DATA MINING TOOLS FOR THE SIX SIGMA PRACTITIONER

Training materials: XL Miner software (trial version); Data Mining Tools student participant guide. Recommended (but not required) texts: The Elements of Statistical Learning by T. Hastie, R. Tibshirani, J. H. Friedman; Principles of Data Mining (Adaptive Computation and Machine Learning) by David J. Hand, Heikki Mannila, Padhraic Smyth.

Course agenda:

I. OVERVIEW OF DATA MINING
Definitions and tasks
Steps in the data mining process
Facts versus myths
Data mining applications

II. PREPATORY STEPS
Data preparation (missing data, outlier analysis, data types)
Data visualization
Data dictionaries

III. BACKGROUND ON MODELING
The curse of dimensionality
Notation and terms
Bias-variance tradeoff
Control of the bias-variance tradeoff
Error functions

IV. TRADITIONAL MODELS
Linear regression procedures
Logistic regression
Discriminant analysis
Nearest neighbors
Clustering algorithms

V. MODERN MODELS
Classification and Regression Trees
Neural networks
Bump hunting
Association rules
Evaluating and combining models (Bagging, boosting, MART)
Survey of recent developments (as time permits)


Home | Contact Us | FAQ | Site Map | Privacy Statement | Legal Statement