2nd ed. — John Wiley & Sons, Inc., 2014. — 336 p. — ISBN: 0470908742, 9780470908747.
The second edition of a highly praised, successful reference on data mining, with thorough coverage of big data applications, predictive analytics, and statistical analysis.
Includes new chapters on Multivariate Statistics, Preparing to Model the Data, and Imputation of Missing Data, and an Appendix on Data Summarization and Visualization.
Offers extensive coverage of the R statistical programming language.
Contains 280 end-of-chapter exercises.
Includes a companion website with further resources for all readers, and PowerPoint slides, a solutions manual, and suggested projects for instructors who adopt the book.
An Introduction to Data Mining
Data Preprocessing
Exploratory Data Analysis
Univariate Atatistical Analysis
Multivariate Statistics
Preparing to Model the Data
k-nearest Neighbor Algorithm
Decision Trees
Neural Networks
Hierarchical and k-Means Clustering
Kohonen Networks
Association Rules
Imputation of Missing Data
Model Evaluation Techniques
Appendix: Data Summarization and Visualization