direct
281.667.4200

training
888.742.2454

fax
281.652.5721

email
send a message
 
newsletter
Receive quarterly training schedule updates and informative articles

 
 
 
 
 
 
 
 
 
The Modeling Agency Quarterly Newsletter
2006-Q4 Release
 

[ December 12, 2006  |  This Edition: ]

1.  Training Schedule Update: Learn How Experts Mine Data in
Orlando, January 29 - February 2 or Las Vegas, April 16 - 20

2.  Feature Article:  "Adding the Environmental Context to Predictive Analytics: From Predicting the Past to Predicting The Future" by Kenneth D. Levin, PhD

3.  Article:  "Data Quality Defined" by Thomas C. Redman, Senior Consultant of The Cutter Consortium

4.  Announcement:  TDWI World Conference in Las Vegas
February 18 - 23, 2007

5.  Newsletter Summary

 
 

1.  TRAINING SCHEDULE UPDATE 

 

  
COURSE SERIES ON DATA MINING STRATEGY, METHODS AND APPLICATION
Learn how experts mine data by attending The Modeling Agency's vendor-neutral, application-oriented data mining courses.  Participants will enjoy a balanced and broad presentation of predictive analytics without restriction through a particular tool or product.  Attendees will learn about data mining capabilities, limitations, methods, tools, strategies, techniques, applications, and costly pitfalls. Those in attendance will leave with a comprehensive binder of notes, illustrations and references to valuable resources.

Don't leave a powerful competitive advantage untapped: harness the valuable information and profits hidden in your data.  Each offering is limited to just 18 seats.  A current status of remaining space may be viewed at TMA's main training page.   Submit an unofficial registration and reserve your seat today while your training request is processed.  

Since The Modeling Agency is not a tools vendor, participants enjoy a balanced, broad and
non-promotional perspective of predictive analytics at desirable venues throughout the USA.

CHOOSE THE TRAINING THAT'S RIGHT FOR YOU
The Modeling Agency offers three data mining courses with distinct objectives.  The courses are designed to be attended independently, or as a progressive series.  While the three levels are staged as a progression, they should not be viewed simply as "introductory, intermediate and advanced."  Refer to the table below to ensure that your experience, situation and objectives align properly with the intent, scope and depth of each offering:

Course

Focus

Scope

Geared To

Data Mining: Level I Strategy An intensive overview of strategy, best practices and case studies Project leaders,
Stakeholders,
Functional Managers
Data Mining: Level II Methods A tactical drill-down of the data mining process, methods, techniques and resources Business Analysts,
Functional Analysts,
IT Professionals
Data Mining: Level III Application A hands-on application workshop as an extension to Data Mining: Level II Practitioners,
Model-builders,
Decision Support
Developers

 

FULL COURSE DETAILS

The featured course schedule for this section is outdated.  For current course dates, locations, pricing and detailed outlines, please visit the main training page.

web
http://www.the-modeling-agency.com/training

email
training@the-modeling-agency.com

phone
888-742-2454 (toll free)
281-667-4200 (direct)
281-652-5721 (fax)
 

Courses May Be Delivered At Your Site

Call (888) 742-2454 or send an email inquiry to receive a value-based
spreadsheet quotation for training at your site.

 

 

2.  FEATURE ARTICLE
 

Adding the Environmental Context to Customer Analytics
From Predicting the Past to Predicting the Future

by
Kenneth D. Levin, PhD
 

INTRODUCTION
The tagline for customer analytics has long been the right offer to the right customer at the right time and through the right channel. Marketing investment rides a fine line between two complementary goals: a) creating opportunity by convincing consumers to behave in a certain way; and b) capturing opportunity by being in the right place at the right time when the customer is ready to buy. Customer analytics helps businesses focus their marketing investment on customers who may be persuaded to buy a product they would not have otherwise purchased or customers who are already likely to make the purchase. All too often, however, marketing strategy is based on pre-determined goals of the company rather than filling the needs of the marketplace. In this regard, customer analytics is used to identify customers who are likely to respond to a pre-determined offer, at a time convenient to the business making the offer, through a pre-budgeted channel. The offer, timing and channel are too often determined by the plans of the business rather than the reality of the marketplace and the environment in which the customer behaves.

Companies often limit their analysis to micro-level data and focus on fitting the customer into the box they have drawn. The businesses that stay ahead of the curve examine micro-level data in the context of macro-level data. They understand the macroeconomic trends that create the environment in which customers behave, and bridge the gap between predicting how customers have behaved in the past and how customers will behave in the future. They are proactive in predicting behavior change based on larger social or economic trends (e.g., the popularity of SUVs moving to more fuel-efficient vehicles).

Marketing organizations typically use data mining to develop marketing strategy by focusing on customer-level data, for example customers’ previous product choices, frequency of purchase, age, income, etc. These variables provide insight into the factors that are associated with customers’ past behavior and have some predictive utility in predicting how customers will behave in the future. Macro-level forces, however, have a large impact on how customer behavior changes over time. Variables such as energy prices, mortgage rates, inflation, housing values and unemployment, just to name a few, can add insight into how customer behavior will be different in the present and future than it may have been in the past. These environmental variables can be added to predictive models and segmentation schemes to enhance their predictive utility and make the models more robust. Examples of micro-level and macro-level variables are listed below. 

Micro-Level Variables
Attributes specific to particular
consumers and their past behavior
 
  • Age
  • Income
  • Tenure
  • Recency
  • Frequency
  • Monetary Value
  • Revolving Debt
  • Lease vehicle history
  • Purchase vehicle history
Macro-Level Variables
Characteristics of the economic or
social normative environment
 
  • Interest Rates
  • Energy Prices
  • Health Care Costs
  • Housing Values
  • Industry Trends
  • Tax Policy
  • Unemployment
  • Inflation

Customer analytics is, at heart, a comparison of the characteristics associated with different people behaving differently. By selecting the characteristics associated with the desired behavior and not associated with the undesired behavior, marketers can direct their energy and resources towards incenting the behavior they want and winning higher market share. Micro-level variables help to explain why two different people will behave differently in the same environment. Macro-level variables, on the other hand, help explain why the same person may behave differently in different situations. By adding macro-level dimensions to predictive models and the interpretation of predictive models, analysts can better leverage historical data to predict how people will behave in the present or future, based on the combination of micro-level attributes and the current environment.
 

EXAMPLE # 1: THE AUTOMOTIVE INDUSTRY
The automotive industry provides a great example of how individual customer patterns interact with the economic environment to drive behavior. A number of micro-level customer attributes can be combined to segment the population based on different automotive behavioral patterns. Meaningful segments can be identified by examining age, income, marital status, presence and number of children, recency of last purchase, the vehicle purchased most recently, the cost of their most recent vehicle purchase, and other customer-level and vehicle-level attributes. Figure 2 illustrates what four of these segments might look like and examples of their possible differences in vehicle purchasing behavior.

Single Urban Young Professionals Dual Income Young Professional Couples First Time Parents New Empty Nesters
  • Sporty Vehicles (e.g. coupes)
  • Buy and hold or lease new
  • Near-luxury vehicles
  • Lease new
  • Minivan, SUV, station wagon
  • Buy and hold
  • Luxury vehicles
  • Lease new

Each of these segments is likely to have different vehicle purchase patterns. They will differ in purchase frequency, vehicle partition choices and the factors that influence the vehicle that they drive home when their purchase is complete. Although the micro-level variables drive the segmentation scheme, the macro-level variables will impact the ultimate behavioral differences between the segments and should help to drive marketing strategy. For example, interest rates can impact the frequency that individuals purchase vehicles. When interest rates were high in the late 1990s, banks and captive finance companies priced vehicles in a way that made auto leasing quite attractive. As interest rates plummeted to historical lows, financing a vehicle purchase became more attractive, particularly to segments that were looking for the best value rather than committed to getting a new car every three years. The marketing organizations that understood various segments’ priorities and understood the impact that interest rate trends would have on consumer choices were able to capture market share from the marketing organizations that did not. 

Another macro-level force that has impacted the way that automotive customers behave is energy prices. The unprecedented spike in gas prices over the last few years altered car buyers’ behavior, placing a premium on fuel efficiency. For example, the typical family vehicles are migrating from SUVs and minivans to more fuel efficient station wagons and crossover vehicles. The entry into the market of hybrid gas/electric automobiles is enabling manufacturers to take advantage of this changing landscape. Similar to adjusting to changes in interest rates, the marketing organizations that push products that are aligned with the changing environment are much better positioned than marketing organizations that orient themselves towards selling for yesterday’s environment.
 

EXAMPLE #2: MORTGAGE LENDING
Mortgage lending provides another industry example of the key interaction of micro and macro-level data. There are volumes of customer-level data that provide predictive insight into a consumer’s behavior. For example the customer’s mortgage balance, interest rate, term, amount of revolving debt and the trajectory of their credit scores all help to predict behavior. However, this insight is most useful in the context of the macro-level variables. Predictive models identifying which mortgage customers are most likely to attrite by refinancing their mortgage with a different lender are likely to include the amount of equity the customer has in their home and the amount of revolving debt they are carrying. Predictive models identifying customers who are likely to respond to a cross-sell home equity offer will have similar attributes. Current interest rates relative to the customer’s interest rate on their current mortgage will play a key role in determining whether refinance or home equity products make the most sense for that customer. The likely length of time a customer will stay in their home (which can be predicted by micro-level variables) will interact with current interest rate trends to indicate whether a fixed-rate or adjustable rate product is most likely to resonate with a customer. As interest rates rise, most customers are better off in home equity products because their current mortgage rate is lower than what they could obtain at the present time. In a downward rate environment, many customers are best-served by an adjustable rate refinance.

Additionally, homeowners’ amount and percent of equity in their home will greatly impact whether and which of these products are the right option. Understanding customers’ equity requires an understanding of housing markets and the likely trajectory of housing prices. Customers in rapidly appreciating housing markets are in a different position to pull cash out of their homes than customers in markets appreciating much slower. Macro-level housing appreciation data can help provide a general understanding of customers’ equity positions without necessarily incurring the data costs of purchasing automated valuation model results for an entire database of customers. Understanding the customers’ individual differences in the context of the broader environment can help marketers make more strategic investments in their customer incentives and marketing programs.
  

STRATEGIC APPLICATION OF MACRO-LEVEL DATA
Macro-level data can function as a lens through which micro-level models can be interpreted and applied. In the automotive industry example above, a micro-level segmentation of auto customers can be applied with particular consideration of the impact that energy prices are likely to have on particular segments’ purchase choices. Higher income segments will likely have a much higher tolerance for high energy prices than value-driven segments. Similarly, particular segments are likely to respond differently to the interest rate environment. In particular, customers who are motivated by value may be attracted to lower lease payments in a higher rate environment and longer term ownership potential in a lower rate environment.

Alternatively, macro-level data can be utilized to create predictor variables. In the mortgage industry example above, the spread between the current prevailing mortgage rates and the interest rate on the customer’s current mortgage is a good predictor of customers’ likelihood to refinance their mortgage. This application can help identify specific spread thresholds at which customers will act. Similarly, the rate of property appreciation in a particular geographic area can be used as a predictor in order to understand the impact of particular real-estate markets on customer behavior.
 

CONCLUSION
These industry examples illustrate the importance of putting customer-level analysis in the context of the broader environment. Data mining is a specialized skill set, and the techniques and methods can be applied without much difference across multiple vertical markets. However, knowledge of the specific vertical market and overall economic trends is essential to ensure that customer analytics include all of the most important measurable factors that impact customer behavior, and that customer-level analysis can be viewed through the appropriate lens and in the proper context. Micro-level variables can be used to understand how different customers will behave differently in the same environment. However, predicting the present and future requires the ability to predict how customers will behave when the environment is different. Understanding the individual through micro-level variables and the environment through macro-level variables enables data mining to bridge the gap between the past and the future.
 

ABOUT THE AUTHOR
Kenneth D. Levin, PhD is Principal of Levin Analytics, a data mining consulting firm serving multiple vertical markets, including retail automotive sales, consumer lending, travel and health care. He can be reached via e-mail at ken.levin@adelphia.net.
 

All Rights Reserved by Kenneth D. Levin, PhD Copyright © 2006

 

3.  ARTICLE

 
DATA QUALITY DEFINED


by
Thomas C. Redman
Senior Consultant
Cutter Consortium
 

DATA QUALITY DEFINED
As with any other sensitive subject, there are many competing definitions for the word "quality." For several years, many (myself included) have employed the following definition, adapted from the insights of the great quality guru Joseph Juran:

Data are of high quality if they are fit for their intended uses by customers in operations, decision-making, and planning.

 
Further, "fitness for use" involves "freedom from defects" (such as being incorrect, out of date, or improperly defined) and possessing the "desired features" (such as being relevant to the task at hand, comprehensive, and at the proper level of detail).

It is clear that data quality is a multifaceted notion -- indeed, there are hundreds of potential dimensions of data quality. Among the most important are those that bear on the data model, data
values, and data presentation. For example, "clear definition" and "relevancy" depend on the data model; accuracy relates to data values; and "ease of interpretation" bears on data presentation.

It's important to remember that customers may have needs that are related to data but don't involve the data per se. For example, most customers desire fast access to data. This need is legitimate, of course, but it bears on supporting technologies, not the data themselves.

The definition of data quality above recognizes the preeminence of the "customer." This can be challenging in practice, as customers are surprisingly fickle. Not only does each have different needs, but each changes his or her needs all the time.

 
LESSONS LEARNED
A number of leading-edge companies have made dramatic improvements in data quality. A body of best practices is emerging, and a few of the lessons learned, most pertinent to technologists, are summarized here.

1. The Customer Is King. Understanding changing customer needs is time-consuming and difficult. But those with the best data recognize the primacy of the customer.

2. Prevent Errors at Their Sources. Leading-edge companies have learned that they cannot base their data quality efforts on "finding and fixing errors." They simply create and/or acquire too many new data each day. For example, if an organization creates or acquires one million new data records a day (which may seem like a lot but isn't, even for a moderate-sized organization), and 10% are in error, then it creates 10,000 new errors a day. So instead of focusing on fixing existing errors, leading companies are adopting a philosophy of "preventing errors at their sources." This approach not only yields better data, it ends up costing far less.

3. Management Accountability. If an organization is to prevent errors at their sources, it follows that those who create data must be held accountable for their quality. Applying this principle to data models, data values, and data presentation, we see that:

  • Those who create data models must be held accountable for the quality (clear definition, relevancy) of the models.

  • Those who create data values must be held accountable for the accuracy of those data values.

  • Those who develop applications must be held accountable for quality dimensions associated with data presentation.

 
Companies both create their own models, values, and/or applications and obtain them from outside sources. Thus, the internal business processes and external suppliers must be actively managed.

Many companies ask their technology departments to assume responsibility for data quality. In response, IT departments "clean up" dirty databases, correcting erred data values they did not create. They have not adopted the philosophy of prevention nor have they learned the lesson that those who create data, not downstream technology departments, must be accountable for their quality. Furthermore, they find that the benefits of the cleanup are short-lived -- since newly acquired data values are of poor quality -- and thus the cleaned database is soon dirty once again.

4. Measure, Control, Improve. A third important lesson involves measurement. You simply cannot manage what you don't measure.  Related to measurement are control and improvement. Simple edit controls are essential to prevent errors from leaking through.  But the real power comes from statistical controls and a spirit of continuous improvement.

5. Roles of Technology. This lesson is especially important for technologists. Experience shows that automating a poorly designed and executed process does not improve quality. Technology can "lock in the gains," increase capacity, and lower unit cost, but if the current process produces junk data, automating it will only enable it to produce more junk data faster.

6. The Hard Issues Are Soft. All who work on data in any way know just how "political" even the simplest issue can be. At the very least, those with the best data recognize and avoid political traps. More proactively, they recognize data as essential business assets and work to create a culture that values them.

 
ABOUT THE AUTHOR
This excerpt, by Cutter Consortium Senior Consultant Thomas C. Redman, originated from Cutter's Business Intelligence Advisory Service.  Through this subscription-based service, you are assured of expert analyses of the latest business intelligence strategies, products, and technologies. For more information or to find out how you can become a client, please visit Cutter's web site, or contact Dennis Crowley at + 1 781-641-5125 or e-mail
dcrowley@cutter.com.

Published with permission from Cutter ConsortiumCopyright © 2006.

 


4.
  ANNOUNCEMENT
 

TDWI World Conference
The Premier Event for Business Intelligence
and Data Warehousing Education

February 18 - 23, 2007
Las Vegas, Nevada

 
CONFERENCE HIGHLIGHTS
The TDWI World Conference in Las Vegas brings together leading industry visionaries to deliver a unique program of cutting-edge education, best practices, one-on-one consulting, peer networking, business intelligence certification, and product demos. From business intelligence fundamentals to business analytics, TDWI’s program of more than 50 full-day, half-day, and night school courses offers something for your entire team.
 


HOT TOPICS

  • Special track on IT governance
  • Special track on business by the numbers
  • One full day of healthcare-specific topics
  • Business intelligence career courses
  • CBIP certification program

 

CONFERENCE REGISTRATION

View additional information, and register for the TDWI World Conference in Las Vegas.
 

Produced with permission from The Data Warehousing Institute Copyright © 2006
   
 


 
5.  NEWSLETTER SUMMARY
 

The Modeling Agency newsletter is a quarterly publication which provides course announcements, training schedule updates and informative articles.  This newsletter may be shared in its entirety and subscriptions are free. For additional information on TMA's training, consulting services and solutions, follow corresponding links at the top of this page.

This newsletter is shared with those who have activated a subscription, or have supplied their Email address to The Modeling Agency when requesting product information. If you wish not to receive future releases, simply send an empty email with cancel as he subject from the account which you were subscribed.

    address
One Oxford Centre
301 Grant St, Ste 4300
Pittsburgh, PA 15219 USA
 
phone: 281.667.4200
fax: 281.652.5721
training: 888.742.2454
Copyright © 2000 - 2008 The Modeling Agency. All rights reserved.