|
|
|
| |
|
| |
|
[ December 12, 2006 |
This Edition: ]
1.
Training Schedule Update: Learn How Experts Mine Data
in
Orlando, January 29 - February 2
or Las Vegas, April 16 - 20
2.
Feature Article: "Adding the Environmental
Context to Predictive Analytics: From Predicting the Past to
Predicting The Future"
by Kenneth D. Levin,
PhD
3.
Article: "Data Quality Defined" by
Thomas C. Redman, Senior Consultant of
The Cutter Consortium
4.
Announcement: TDWI World Conference in
Las Vegas
February 18 - 23, 2007
5.
Newsletter Summary
|
| |
 |
| |
|
1.
TRAINING SCHEDULE UPDATE
COURSE SERIES ON DATA MINING STRATEGY, METHODS AND APPLICATION
Learn how experts
mine data by attending The Modeling Agency's vendor-neutral,
application-oriented data mining courses. Participants
will enjoy a balanced and broad presentation of predictive
analytics without restriction through a particular tool or
product. Attendees will learn about data mining
capabilities, limitations, methods, tools, strategies,
techniques, applications, and costly pitfalls. Those in attendance
will leave with a comprehensive binder of notes,
illustrations and references to valuable resources.
Don't leave a
powerful competitive advantage untapped: harness the valuable information and
profits hidden in your data. Each offering is limited to just 18 seats. A current status of remaining space may be
viewed at TMA's main training page.
Submit an
unofficial registration and reserve your seat today while
your training request is processed.
Since The Modeling Agency is not a
tools vendor, participants enjoy a balanced, broad and
non-promotional perspective of predictive analytics at desirable
venues throughout the USA. |
CHOOSE THE TRAINING
THAT'S RIGHT FOR YOU
The Modeling Agency offers three data mining courses with
distinct objectives. The courses are designed to be
attended independently, or as a progressive series. While the
three levels are staged as a progression, they should not be viewed
simply as "introductory, intermediate and advanced." Refer to the table
below to ensure that your experience, situation and objectives align
properly with the intent, scope and depth of each offering:
|
Course |
Focus |
Scope |
Geared To |
|
Data Mining:
Level I |
Strategy |
An intensive overview
of strategy, best practices and case studies |
Project
leaders,
Stakeholders,
Functional Managers |
|
Data Mining:
Level II |
Methods |
A tactical drill-down
of the data mining process, methods, techniques and
resources |
Business
Analysts,
Functional Analysts,
IT Professionals |
|
Data Mining:
Level III |
Application |
A hands-on
application workshop as an extension to Data Mining: Level
II |
Practitioners,
Model-builders,
Decision Support
Developers |
|
|
|
 |
|
2.
FEATURE ARTICLE
Adding the Environmental Context to
Customer Analytics
From Predicting the Past to
Predicting the Future
by
Kenneth D. Levin, PhD
INTRODUCTION
The tagline for
customer analytics has long been the right offer to the right customer at
the right time and through the right channel. Marketing investment rides a
fine line between two complementary goals: a) creating opportunity by
convincing consumers to behave in a certain way; and b) capturing
opportunity by being in the right place at the right time when the customer
is ready to buy. Customer analytics helps businesses focus their marketing
investment on customers who may be persuaded to buy a product they would not
have otherwise purchased or customers who are already likely to make the
purchase. All too often, however, marketing strategy is based on
pre-determined goals of the company rather than filling the needs of the
marketplace. In this regard, customer analytics is used to identify
customers who are likely to respond to a pre-determined offer, at a time
convenient to the business making the offer, through a pre-budgeted channel.
The offer, timing and channel are too often determined by the plans of the
business rather than the reality of the marketplace and the environment in
which the customer behaves.
Companies
often limit their analysis to micro-level data and focus on fitting the
customer into the box they have drawn. The businesses that stay ahead of the
curve examine micro-level data in the context of macro-level data. They
understand the macroeconomic trends that create the environment in which
customers behave, and bridge the gap between predicting how customers have
behaved in the past and how customers will behave in the future. They are
proactive in predicting behavior change based on larger social or economic
trends (e.g., the popularity of SUVs moving to more fuel-efficient
vehicles).
Marketing
organizations typically use data mining to develop marketing strategy by
focusing on customer-level data, for example customers’ previous product
choices, frequency of purchase, age, income, etc. These variables provide
insight into the factors that are associated with customers’ past behavior
and have some predictive utility in predicting how customers will behave in
the future. Macro-level forces, however, have a large impact on how customer
behavior changes over time. Variables such as energy prices, mortgage rates,
inflation, housing values and unemployment, just to name a few, can add
insight into how customer behavior will be different in the present and
future than it may have been in the past. These environmental variables can
be added to predictive models and segmentation schemes to enhance their
predictive utility and make the models more robust. Examples of micro-level
and macro-level variables are listed below.
Micro-Level Variables
Attributes specific to particular
consumers and their past behavior
- Age
- Income
- Tenure
- Recency
- Frequency
- Monetary Value
|
- Revolving Debt
- Lease vehicle history
- Purchase vehicle history
|
|
Macro-Level Variables
Characteristics of the economic or
social normative environment
- Interest Rates
- Energy Prices
- Health Care Costs
|
- Housing Values
- Industry Trends
- Tax Policy
- Unemployment
- Inflation
|
|
Customer
analytics is, at heart, a comparison of the characteristics associated with
different people behaving differently. By selecting the characteristics
associated with the desired behavior and not associated with the undesired
behavior, marketers can direct their energy and resources towards incenting
the behavior they want and winning higher market share. Micro-level
variables help to explain why two different people will behave differently
in the same environment. Macro-level variables, on the other hand, help
explain why the same person may behave differently in different situations.
By adding macro-level dimensions to predictive models and the interpretation
of predictive models, analysts can better leverage historical data to
predict how people will behave in the present or future, based on the
combination of micro-level attributes and the current environment.
EXAMPLE
# 1: THE AUTOMOTIVE INDUSTRY
The automotive industry provides a great
example of how individual customer patterns interact with the economic
environment to drive behavior. A number of micro-level customer attributes
can be combined to segment the population based on different automotive
behavioral patterns. Meaningful segments can be identified by
examining age, income, marital status, presence and number of children, recency of last purchase, the vehicle purchased most recently, the cost of
their most recent vehicle purchase, and other customer-level and
vehicle-level attributes. Figure 2 illustrates what four of these segments
might look like and examples of their possible differences in vehicle
purchasing behavior.
|
Single Urban Young Professionals |
Dual
Income Young Professional Couples |
First Time Parents |
New
Empty Nesters |
- Sporty Vehicles (e.g. coupes)
- Buy and hold or lease new
|
- Near-luxury vehicles
- Lease new
|
- Minivan, SUV, station wagon
- Buy and hold
|
- Luxury vehicles
- Lease new
|
Each of these segments is likely to have
different vehicle purchase patterns. They will differ in purchase frequency,
vehicle partition choices and the factors that influence the vehicle that
they drive home when their purchase is complete. Although the micro-level
variables drive the segmentation scheme, the macro-level variables will
impact the ultimate behavioral differences between the segments and should
help to drive marketing strategy. For example, interest rates can impact the
frequency that individuals purchase vehicles. When interest rates were high
in the late 1990s, banks and captive finance companies priced vehicles in a
way that made auto leasing quite attractive. As interest rates plummeted to
historical lows, financing a vehicle purchase became more attractive,
particularly to segments that were looking for the best value rather than
committed to getting a new car every three years. The marketing
organizations that understood various segments’ priorities and understood
the impact that interest rate trends would have on consumer choices were
able to capture market share from the marketing organizations that did not.
Another macro-level force that has impacted the
way that automotive customers behave is energy prices. The unprecedented
spike in gas prices over the last few years altered car buyers’ behavior,
placing a premium on fuel efficiency. For example, the typical family
vehicles are migrating from SUVs and minivans to more fuel efficient station
wagons and crossover vehicles. The entry into the market of hybrid
gas/electric automobiles is enabling manufacturers to take advantage of this
changing landscape. Similar to adjusting to changes in interest rates, the
marketing organizations that push products that are aligned with the
changing environment are much better positioned than marketing organizations
that orient themselves towards selling for yesterday’s environment.
EXAMPLE #2: MORTGAGE LENDING
Mortgage lending provides another
industry example of the key interaction of micro and macro-level data. There
are volumes of customer-level data that provide predictive insight into a
consumer’s behavior. For example the customer’s mortgage balance, interest
rate, term, amount of revolving debt and the trajectory of their credit
scores all help to predict behavior. However, this insight is most useful in
the context of the macro-level variables. Predictive models identifying
which mortgage customers are most likely to attrite by refinancing their
mortgage with a different lender are likely to include the amount of equity
the customer has in their home and the amount of revolving debt they are
carrying. Predictive models identifying customers who are likely to respond
to a cross-sell home equity offer will have similar attributes. Current
interest rates relative to the customer’s interest rate on their current
mortgage will play a key role in determining whether refinance or home
equity products make the most sense for that customer. The likely length of
time a customer will stay in their home (which can be predicted by
micro-level variables) will interact with current interest rate trends to
indicate whether a fixed-rate or adjustable rate product is most likely to
resonate with a customer. As interest rates rise, most customers are better
off in home equity products because their current mortgage rate is lower
than what they could obtain at the present time. In a downward rate
environment, many customers are best-served by an adjustable rate refinance.
Additionally, homeowners’ amount and percent of
equity in their home will greatly impact whether and which of these products
are the right option. Understanding customers’ equity requires an
understanding of housing markets and the likely trajectory of housing
prices. Customers in rapidly appreciating housing markets are in a different
position to pull cash out of their homes than customers in markets
appreciating much slower. Macro-level housing appreciation data can help
provide a general understanding of customers’ equity positions without
necessarily incurring the data costs of purchasing automated valuation model
results for an entire database of customers. Understanding the customers’
individual differences in the context of the broader environment can help
marketers make more strategic investments in their customer incentives and
marketing programs.
STRATEGIC APPLICATION OF MACRO-LEVEL DATA
Macro-level data can function as a
lens through which micro-level models can be interpreted and applied. In the
automotive industry example above, a micro-level segmentation of auto
customers can be applied with particular consideration of the impact that
energy prices are likely to have on particular segments’ purchase choices.
Higher income segments will likely have a much higher tolerance for high
energy prices than value-driven segments. Similarly, particular segments are
likely to respond differently to the interest rate environment. In
particular, customers who are motivated by value may be attracted to lower
lease payments in a higher rate environment and longer term ownership
potential in a lower rate environment.
Alternatively, macro-level data can be utilized
to create predictor variables. In the mortgage industry example above, the
spread between the current prevailing mortgage rates and the interest rate
on the customer’s current mortgage is a good predictor of customers’
likelihood to refinance their mortgage. This application can help identify
specific spread thresholds at which customers will act. Similarly, the rate
of property appreciation in a particular geographic area can be used as a
predictor in order to understand the impact of particular real-estate
markets on customer behavior.
CONCLUSION
These
industry examples illustrate the importance of putting customer-level
analysis in the context of the broader environment. Data mining is a
specialized skill set, and the techniques and methods can be applied without
much difference across multiple vertical markets. However, knowledge of the
specific vertical market and overall economic trends is essential to ensure
that customer analytics include all of the most important measurable factors
that impact customer behavior, and that customer-level analysis can be
viewed through the appropriate lens and in the proper context. Micro-level
variables can be used to understand how different customers will behave
differently in the same environment. However, predicting the present and
future requires the ability to predict how customers will behave when the
environment is different. Understanding the individual through micro-level
variables and the environment through macro-level variables enables data
mining to bridge the gap between the past and the future.
ABOUT THE AUTHOR
Kenneth D. Levin, PhD is Principal of Levin
Analytics, a data mining consulting firm serving multiple vertical markets,
including retail automotive sales, consumer lending, travel and health care.
He can be reached via e-mail at
ken.levin@adelphia.net.
All Rights Reserved by Kenneth D.
Levin, PhD.
Copyright ©
2006
3.
ARTICLE
DATA QUALITY
DEFINED
by
Thomas C. Redman
Senior Consultant
Cutter Consortium
DATA QUALITY DEFINED
As with any other sensitive subject, there are
many competing definitions for the word
"quality." For several years, many (myself
included) have employed the following
definition, adapted from the insights of the
great quality guru Joseph Juran:
Data are of high quality if they are
fit for their intended uses by customers in
operations, decision-making, and planning.
Further, "fitness for use" involves "freedom
from defects" (such as being incorrect, out of
date, or improperly defined) and possessing the
"desired features" (such as being relevant to
the task at hand, comprehensive, and at the
proper level of detail).
It is clear that data quality is a
multifaceted notion -- indeed, there are
hundreds of potential dimensions of data
quality. Among the most important are those that
bear on the data model, data
values, and data presentation. For example,
"clear definition" and "relevancy" depend on the
data model; accuracy relates to data values; and
"ease of interpretation" bears on data
presentation.
It's important to remember that customers may
have needs that are related to data but don't
involve the data per se. For example, most
customers desire fast access to data. This need
is legitimate, of course, but it bears on
supporting technologies, not the data
themselves.
The definition of data quality above
recognizes the preeminence of the "customer."
This can be challenging in practice, as
customers are surprisingly fickle. Not only does
each have different needs, but each changes his
or her needs all the time.
LESSONS LEARNED
A number of leading-edge companies have made
dramatic improvements in data quality. A body of
best practices is emerging, and a few of the
lessons learned, most pertinent to
technologists, are summarized here.
1. The Customer Is King. Understanding
changing customer needs is time-consuming and
difficult. But those with the best data
recognize the primacy of the customer.
2. Prevent Errors at Their Sources.
Leading-edge companies have learned that they
cannot base their data quality efforts on
"finding and fixing errors." They simply create
and/or acquire too many new data each day. For
example, if an organization creates or acquires
one million new data records a day (which may
seem like a lot but isn't, even for a
moderate-sized organization), and 10% are in
error, then it creates 10,000 new errors a day.
So instead of focusing on fixing existing
errors, leading companies are adopting a
philosophy of "preventing errors at their
sources." This approach not only yields better
data, it ends up costing far less.
3. Management Accountability. If an
organization is to prevent errors at their
sources, it follows that those who create data
must be held accountable for their quality.
Applying this principle to data models, data
values, and data presentation, we see that:
-
Those who create
data models must be held accountable for the
quality (clear definition, relevancy) of the
models.
-
Those who create
data values must be held accountable for the
accuracy of those data values.
-
Those who develop
applications must be held accountable for
quality dimensions associated with data
presentation.
Companies both create their own models, values,
and/or applications and obtain them from outside
sources. Thus, the internal business processes
and external suppliers must be actively managed.
Many companies ask
their technology departments to assume
responsibility for data quality. In response, IT
departments "clean up" dirty databases,
correcting erred data values they did not
create. They have not adopted the philosophy of
prevention nor have they learned the lesson that
those who create data, not downstream technology
departments, must be accountable for their
quality. Furthermore, they find that the
benefits of the cleanup are short-lived -- since
newly acquired data values are of poor quality
-- and thus the cleaned database is soon dirty
once again.
4. Measure,
Control, Improve. A third important lesson
involves measurement. You simply cannot manage
what you don't measure. Related to
measurement are control and improvement. Simple
edit controls are essential to prevent errors
from leaking through. But the real power
comes from statistical controls and a spirit of
continuous improvement.
5. Roles of
Technology. This lesson is especially
important for technologists. Experience shows
that automating a poorly designed and executed
process does not improve quality. Technology can
"lock in the gains," increase capacity, and
lower unit cost, but if the current process
produces junk data, automating it will only
enable it to produce more junk data faster.
6. The Hard Issues
Are Soft. All who work on data in any way
know just how "political" even the simplest
issue can be. At the very least, those with the
best data recognize and avoid political traps.
More proactively, they recognize data as
essential business assets and work to create a
culture that values them.
ABOUT THE AUTHOR
This excerpt, by Cutter Consortium Senior Consultant
Thomas C. Redman, originated
from Cutter's Business Intelligence Advisory Service. Through this
subscription-based service, you are assured of expert analyses of
the latest business intelligence strategies, products, and technologies.
For more information or to find out how you can become a client,
please visit
Cutter's web site,
or contact Dennis Crowley at + 1 781-641-5125 or e-mail
dcrowley@cutter.com.
Published
with permission from
Cutter Consortium. Copyright
© 2006.
|
 |
|
TDWI World Conference
The Premier Event for Business Intelligence
and Data Warehousing Education
February 18 - 23, 2007
Las Vegas, Nevada |
CONFERENCE HIGHLIGHTS
The TDWI World Conference in Las Vegas brings together leading
industry visionaries to deliver a unique program of cutting-edge
education, best practices, one-on-one consulting, peer
networking, business intelligence certification, and product
demos. From business intelligence fundamentals to business
analytics, TDWI’s program of more than 50 full-day, half-day,
and night school courses offers something for your entire team.
-
Special track on
IT governance
-
Special track on
business by the numbers
-
One full day of
healthcare-specific topics
-
Business
intelligence career courses
-
CBIP certification
program
Produced
with permission from
The Data Warehousing Institute.
Copyright ©
2006
|
 |
5.
NEWSLETTER
SUMMARY
The Modeling Agency newsletter is a quarterly publication which
provides course announcements, training schedule updates and informative
articles. This newsletter may be shared in its entirety and subscriptions
are free. For additional information on TMA's training,
consulting services and solutions, follow corresponding links at
the top of this page.This newsletter
is shared with those who have activated a subscription, or have
supplied their Email address to The Modeling Agency when requesting
product information. If you wish not to receive future releases,
simply send an empty
email
with cancel as he subject from the account which you were subscribed.
|
|