direct
281.667.4200

training
888.742.2454

fax
281.652.5721

email
send a message
 
newsletter
Receive quarterly training schedule updates and informative articles

 
DATA MINING: LEVEL III
A Hands-On Application Workshop for Data Mining Practitioners

by Dean Abbott
One Day: $695, or $595 with Level II


 
SCHEDULE AND SITE DETAILS  

June 6, 2008: Washington, DC
October 3, 2008: San Diego, CA
December 12, 2008: Las Vegas, NV

 

Seating is limited to 18 participants.  Register early! 
Proceed to the
On-Line Registration Form to reserve your space today.

 
   
ABOUT THIS COURSE
This third-level offering takes the tactical and methodological presentation of Data Mining: Level II and puts the material into action through team-driven, live data mining exercises. Comparative review sessions then reveal real-world obstacles, breakthroughs and results from which to interpret, learn and apply.
 
Data Mining: Level III is a hands-on application workshop, applying data mining methods and techniques presented in Data Mining: Level II to real-world data. Although the workshop may be attended exclusively, registrants should have experience with the breadth of material covered in the Level II offering.

Throughout the workshop, the CRISP-DM model will be used to guide participants through the steps of the data mining process, and the attendees themselves will complete the entire data mining process during the workshop by solving simple data mining problems through a staged progression.

In the morning, participants will begin with a database containing multiple tables of information. Participants will each have a networked computer and may choose to work on exercises in pairs or individually. Attendees will determine which business questions will be considered, how they will be addressed using data mining, and how the data will be prepared for data mining. A divide-and-conquer approach will be used to carry out the “data understanding” and “data preprocessing” steps as participants work alone or in pairs to prepare the data for data mining.

In the afternoon, regression, decision tree, and neural network models will be created, and performance assessed. Participants may optimize these models by using advanced algorithm options, and report model performance on held-out data and summaries of key variables used in the models. Data preprocessing will be re-applied if models do not meet performance requirements. The team will determine which model best addresses the business question, and score the model on validation data.

Throughout the day, emphasis will be placed not only on the data mining process from a technical perspective, but also how to interpret, explain and apply results that have been discovered during the process.
 

   
WHO SHOULD ATTEND

LEVEL II COURSE PARTICIPANTS with an interest in applying the methods and techniques first-hand as presented and illustrated in the course

DATA MINING PRACTITIONERS who wish to expand their skills and analytical toolbox as well as hone proficiencies in maneuvering elusive data mining obstacles that stand in the way of superior model accuracy

BUSINESS ANALYSTS who must develop and interpret models, communicate the results and make actionable recommendations

FUNCTIONAL ANALYSTS: Customer Relationship Managers, Risk Analysts, Statistical Analysts, Business Forecasters, Inventory Flow Analysts, Direct Marketing Analysts, Medical Diagnostic Analysts, Market Timers, e-commerce System Architects and Web Data Analysts

 
BENEFITS OF ATTENDING
  • Driving the Level II presentation material through team exercises
  • Hands-on experience through the data mining process via a staged progression of exercises using application data
  • First-hand vendor-neutral exposure to various data mining tools
  • Real-world perspective of data preparation for data mining, model optimization and results interpretation
  • Cross-learning through team exercise comparisons to reveal what worked, what didn't, and why?

WHAT MAKES THIS COURSE UNIQUE


Unlike any other application-oriented offering on the market, Data Mining: Level III offers a structured approach to team-oriented data mining exercises in a lab environment. Since The Modeling Agency is not a tools vendor, participants enjoy a balanced, broad, and non-promotional perspective of data mining.
 


COURSE OUTLINE

MORNING

  • Team Meeting #1
    • Introduction
         • Purpose of Data Mining III: Practice
         • CRISP-DM
         • Description of Data Source for Modeling
    • Business Understanding
         • Prioritize Questions to be Addressed
         • Determine Method to Score Results
    • Assign Data Understanding Responsibilities
    • Result: List of Prioritized Business Questions and Corresponding Data
    • Mining Approaches
     
  • Breakout Session #1: Data Understanding
    • Summary Statistics
    • Visualization
    • Outlier Analysis
    • Missing Data Analysis
    • Create Mini-Report
     
  • Team Meeting #2
    • Data Assessment Summary
    • Assign Data Preprocessing Responsibilities
    • Result: Summaries of Available Modeling Data, and Recommendations for
    Their Use
     
  • Breakout Session #2: Data Preprocessing
    Correct Data Problems
    Create Features
    Create Mini-Report
     
  • Team Meeting #3
    Data Preprocessing Summary
    Join Data Modified During Breakout Sessions
    Determine Sampling Strategy
    Assign Modeling Responsibilities
    Result: Single Modeling Dataset

AFTERNOON

  • Breakout Session #3: Modeling and Evaluation
    • Build Decision Trees, Regression, and Neural Networks
    • Assess Results
    • Rebuild Models, Changing Modeling Parameters
     
  • Breakout Session #4: Model Evaluation and Assessment
    • Score Models on Testing Data
    • Rank Variable Importance to Models
    • Create Mini-Report
     
  • Team Meeting #4
    • Summarize Modeling Results
    • Assess Which Modeling Techniques Worked and Didn’t Work
    • Determine Needs for More Data Pre-Processing and Modeling
    • Assign Responsibilities
     
  • Breakout Session #5: Re-visit Modeling
    • Create Final Models
    • Create Mini-Report
    • Score Models on Validation Data
     
  • Team Meeting #5
    • Select Final Model to Use (if any)
         • Explain Reason for Selection
         • Assess Trade-offs Between Models
    • Assign List of Action Items Pending
     


   

Courses May Be Delivered At Your Site

Call (888) 742-2454 or send an email inquiry to receive a value-based
spreadsheet quotation for training at your site.

   
THE PRESENTER

DEAN ABBOTT has been applying advanced data mining, data preparation, and data visualization methods in real-world data intensive problems since 1986, including fraud detection, response modeling, survey analysis, planned giving, predictive toxicology, signal process, and missile guidance. In addition, he has developed and evaluated algorithms for use in commercial data mining and pattern recognition products, including polynomial networks, neural networks, radial basis functions, and clustering algorithms, and has consulted with data mining software companies to provide critiques and assessments of their current features and future enhancements.

Mr. Abbott is a seasoned instructor, having taught a wide range of data mining tutorials and seminars for a decade to audiences of up to 400, including DAMA, KDD, AAAI, and IEEE conferences. He possesses a unique talent to explain concepts in language readily understood by a wide range of audiences, including analytics novices, data analysts, statisticians, and business professionals. Mr. Abbott also has taught applied data mining courses for major software vendors, including Clementine (SPSS), Affinium Model (Unica Corporation), Model 1 (Group1 Software), and hands-on courses using S-Plus and Insightful Miner (Insightful Corporation), and CART (Salford Systems).  Participants' feedback consistently reinforces how Dean's energy and enthusiasm for data mining is contagious.
 
 
ATTENDEES' COMMENTS

"There is no better way to learn Data Mining than to do it yourself. Take this course for an invaluable hands-on experience. Even if your organization does not adopt Data Mining procedures as standard, you will have acquired a much needed skill set."

Raymond D. Mooring, PhD
Wage and Investment Research
Internal Revenue Service
 

"This course was fabulous. It was everything I hoped it would be and then some - technical and practical. Dean is amazing and the resources he gave to further my studies was also helpful.

Jenifer Underwood
Solutions Architect
Bayshore Solutions

 
"Dean Abbott’s effective communication and presentation skills provided us the confidence to understand the proper use of data mining in our new roles as analysts. His genuine interest and concerns for adaptation to each student level in addition to his experience and understanding enabled proactive class participation conducive to learning. This was extremely helpful for those of us with no data mining background. Our sincere thanks for an overall excellent experience!"

Ana Lemmon and Elisabetta Halkard
Office of Special Investigations
US Air Force

 
"The instructor and course material are first rate. Any organization that believes data mining should be a part of their business operations portfolio would be making a wise investment by attending this course."

Eric Rickard
Information Computing Sciences
SRI International
 

"Dean's presentation was quite thoughtful and very well organized. I came away with a solid map for the ever changing data mining landscape."

David Cousins
Divisional Scientist
BBN Technologies
 

Dean Abbott has successfully made an otherwise difficult subject matter easy to understand.

Roger Chua
Executive Vice President
Sencor Analytics
 

"The class was great! I was really impressed with Dean's knowledge, experience, and ability. He was able to answer everyone's questions thoroughly and tailor the class to individual needs. I learned so much about the data mining process, the different methods, and available tools. I highly recommend this course to both technical and non-technical people interested in leading-edge data mining methodologies and the application of current data mining software to marketing, business, and research endeavors."

Stephen Pearce
Preventive Medicine
Kaiser Permanente
 

Seating is limited to 18 participants.  Register early!
Proceed to the
On-Line Registration Form to reserve your space today.

    address
One Oxford Centre
301 Grant St, Ste 4300
Pittsburgh, PA 15219 USA
 

  training: 888.742.2454

 direct: 281.667.4200
 
Copyright © 2000 - 2008 The Modeling Agency. All rights reserved.