Data and Analytics

  • PDF

Data & Analytics

data_solution1Semcasting provides data that delivers multi-channel contact information for individuals in the United States. Semcasting's database was created with 100% FCRA compliant and non-regulated data. The data sources include publicly available property data, census data, federal government survey data and Federal Reserve reporting. The database includes large samples of households nationwide with known incomes, assets, discretionary spending and automobile ownership, along with local tax rates and cost-of-living figures.

Semcasting's variables are part of what makes the Semcasting modeling process so powerful for direct marketing campaigns. Our variables set us apart from our competitors and are influential in creating the strongest predictive models and targeted lists for our client's campaigns – no matter the industry or issue they are facing.

For more information on these data variables or to learn more about other variables in Semcasting's data universe, please go to the Resources page where you can download PDFs about the individual variables.

 

Democratizing the Modeling Process - Making Analytics Available to All

Predictive models have been used for customer acquisition, cross selling and collections programs for years. It is widely accepted that lists built from predictive models outperform lists based on selection criteria by double digit percentages. However, the development of a model typically involves expensive tools, data and high levels of statistical expertise. Semcasting Modeler changes all that by automating the predictive modeling process and turning what is often weeks of development into an hour of compute time.

Semcasting Modeler employs machine learning and automation to enhance the speed and efficiency of predictive modeling. Based on patented genetic algorithms, Semcasting Modeler software allows models to be created using hundreds of data variables rather than just a few. Data predictors are the product of a process where thousands of models are built simultaneously to determine which of the variables, and combination of variables, will offer the strongest contribution to the final model. Since the software uses a much broader set of data during the model building process, there is a greater likelihood that subtle predictors will be found. The process takes hours rather than days or weeks to complete, often producing models that measurably outperform traditional regression-based approaches.

Modeler Process: How it Works

Loading the current customer data set into the Modeler software, initiates a process where the data is cleaned and sampled automatically, eliminating any manual data preparation. Next, a dependent variable is selected and the model building process begins. The Modeler software creates and tests 250 individual models simultaneously, graduating the "most fit" to the next generations of models. This process continues for thousands of generations and for millions of candidate models until the best combination of variable and predictive power emerges.

Semcasting Modeler is not a black box. Analysts have complete access to the application toolset where they can take advantage of the automated variable selection capabilities and generation building process while also directing how the final model is composed. If different parameters are preferred or more input is required, it is all part of a learning process for the model as it continuously learns and adapts. The final product of the modeling process is a scoring formula which is accessible as SAS, SPSS or XML code. Modeler is also capable of scoring a file at high speed directly through the application.

Market Database

A Turnkey Market Database Solution

The Marketing Database Solution includes unlimited access to data hygiene services, enhancement data, and analytical services at a subscription price. Semcasting's Marketing Database Solution can help to maintain CRM or customer lists while ensuring that the addresses and contact information on the customer file is up-to-date and correct using NCOA and CASS Certification services. Custom and prospect files are enhanced with the Semcasting data, and rapid Profile and Model reporting is included as part of the solution. Semcasting's Marketing Database solution has proven popular with regional and national retail organizations, political candidates, home improvement and pharmaceutical stores.

List Services

The prospect mailing list is the single most important component of a successful direct mail campaign. If you mail to the wrong address or to the wrong prospect the result for either good or bad creative is typically the same. Ensuring that the customer file is up-to-date and has the correct contact information for prospects is just as important to the return on investment of a marketing campaign. Semcasting provides the following services to ensure that consumer lists are kept up-to-date:

NCOA: The NCOA product ensures that change-of-address information is available to mailer. This can help reduce undeliverable mail pieces before the mail campaign enters the mail stream.

CASS: The CASS system can help to improve the accuracy of carrier routes, five-digit ZIP, ZIP + 4, and delivery point codes that appear on mail pieces. This can help to improve the quality of address-matching in customer lists.

Contact us today for more information on how Semcasting can help manage your data and help with your next direct marketing campaign.

  • Case Studies

  • FAQ

Modeler Software Case Studies

2Utilities Provider Collects $25 Million
An east coast electric provider has a number of low-income urban customers who have difficulty paying their electric bills every winter.  Using Modeler the electric provider was about to increase their ROI by $250. Learn More

 

 

2Energy Assistance Success with Predictive Modeling
A Midwest electric company was looking for a better way to serve its customers, particularly their lower income households who had a hard time paying their bills during the colder months.  Modeler created a $59 ROI for every dollar spent on the marketing campaign. Learn More

 

3Isolating Bad Debt and Fraud Before it Happens
One bank set out to regain customer trust in financial institutions by attempting to identify potential fraudulent transactions before they occurred.  Modeler software was able to identify transactions that were 276 time more likely to be fraudulent.  Learn More

 

 

4Targeting Lapsed Donor Reactivation
A major non-profit organization with a large lapsed donors file needed to determine which ones would be the most cost effective to pursue for reactivation.  Modeler was able to increase the average gift per donor to $15.75. Learn More

 

 

 

5Wealth ID Identifies High Value Donors
A non-profit group that handles several religious organizations was looking to increase its marketing campaign responses and donations.  Campaigns created by Modeler created an average response rate greater than 2% and had an average gift of $15 per responder. Learn More

 

 

 

.

What is the underlying theory of genetic algorithms?


Genetic algorithms (GA) are computational problem-solving procedures modeled after the evolutionary theories put forth in Charles Darwin’s The Origin of Species, where he introduced the concept of “Survival of the Fittest”. In the 1970s, John Holland of the University of Michigan expanded upon this concept when he presented the genetic algorithm as a way of logically reproducing the workings of evolution to perform optimization functions on a computer. In his book Adaptation in Natural and Artificial Systems, Holland introduced the creation of new offspring using natural selection together with genetics-based operators of crossover and mutation.


How is the theory of genetic algorithms applied within the Semcasting software?


Semcasting software is based on the broad concept of genetic algorithms, applied specifically to predictive modeling. As potential solutions to business problems, models are genetically encoded into digital chromo¬somes (patent pending). Gene groups are used to represent data attributes, with separate genes used for modeling transformations of the data (e.g., coefficients, outlier trimming thresholds, missing values substi¬tutions, and categorical combinations).


Is this a black box solution?


No. All models are available, including the variables, coefficients, and transformations. The modeling ana¬lyst can review the evolutionary progress of the king model from the first generation to the latest. In addi¬tion, the model equation is available in either SAS or SPSS format.


What is the right sample size?


There is not an exact answer to this question. The goal is to have enough data so that the model holds up under validation and that there is enough information to explain the independent variables. Larger data sets, however, may not be the answer, especially when response rates are low. Most modeling techniques do not produce good models when most of the data represents non-events. In addition, larger data sets take longer to process. Analysts’ time could be better spent investigating other aspects of the business problem instead of processing and manipulating data.


You are here: SOLUTIONS Data & Analytics