Data Mining: Practical Machine Learning Tools and Techniques


  • Ian Witten, University of Waikato, Hamilton, New Zealand.
  • Eibe Frank, University of Waikato, Hamilton, New Zealand. Recipient of the 2005 ACM SIGKDD Service Award.
  • Mark Hall

Data Mining: Practical Machine Learning Tools and Techniques offers a thorough grounding in machine learning concepts as well as practical advice on applying machine learning tools and techniques in real-world data mining situations. This highly anticipated third edition of the most acclaimed work on data mining and machine learning will teach you everything you need to know about preparing inputs, interpreting outputs, evaluating results, and the algorithmic methods at the heart of successful data mining.

Thorough updates reflect the technical changes and modernizations that have taken place in the field since the last edition, including new material on Data Transformations, Ensemble Learning, Massive Data Sets, Multi-instance Learning, plus a new version of the popular Weka machine learning software developed by the authors. Witten, Frank, and Hall include both tried-and-true techniques of today as well as methods at the leading edge of contemporary research.

View full description


Information systems practitioners, programmers, consultants, developers, information technology managers, specification writers, data analysts, data modelers, database R&D professionals, data warehouse engineers, data mining professionals, as well as professors and students of upper-level undergraduate and graduate-level data mining and machine learning courses who want to incorporate data mining as part of their data management knowledge base and expertise.


Book information

  • Published: January 2011
  • ISBN: 978-0-12-374856-0


"Co-author Witten is the author of other well-known books on data mining, and he and his co-authors of this book excel in statistics, computer science, and mathematics. Their in- depth backgrounds and insights are the strengths that have permitted them to avoid heavy mathematical derivations in explaining machine learning algorithms so they can help readers from different fields understand algorithms. I strongly recommend this book to all newcomers to data mining, especially to those who wish to understand the fundamentals of machine learning algorithms."--INFORMS Journal of Computing

"The third edition of this practical guide to machine learning and data mining is fully updated to account for technological advances since its previous printing in 2005 and is now even more closely aligned with the use of the Weka open source machine learning, data mining and data modeling application. Beginning with an introduction to data mining, the volume explores basic inputs, outputs and algorithms, the implementation of machine learning schemes and in-depth exploration of the many uses of the Weka data analysis software. Numerous illustration, tables and equations are included throughout and additional resources are available through a companion website. Witten, Frank and Hall are academics with the department of computer science at the University of Waikato, New Zealand, the home of the Weka software project."--Book News, Reference & Research

"I would recommend this book to anyone who is getting started in either data mining or machine learning and wants to learn how the fundamental algorithms work. I liked that the book slowly teaches you the different algorithms piece by piece and that there are also a lot of examples. I plan on taking a machine learning course this upcoming fall semester and feel that the book gave me great insight that the course will be based on mathematics more than I had originally expected. My favorite part of the book was the last chapter where it explains how you can solve different practical data mining scenarios using the different algorithms. If there were more chapters like the last one, the book would have been perfect. This book might not be that useful if you do not plan on using the Weka software or if you are already familiar with the various machine learning algorithms. Overall, Data Mining: Practical Machine Learning Tools and Techniques is a great book to learn about the core concepts of data mining and the Weka software suite."-- ACM SIGSOFT Software Engineering Notes

"This book is a must-read for every aspiring data mining analyst. Its many examples and the technical background it imparts would be a unique and welcome addition to the bookshelf of any graduate or advanced undergraduate student. The book is written for both academic and application-oriented readers, and I strongly recommend it to any reader working in the area of machine learning and data mining."--Computing

Table of Contents

PART I: Introduction to Data Mining

Ch 1 What's It All About?
Ch 2 Input: Concepts, Instances, Attributes
Ch 3 Output: Knowledge Representation
Ch 4 Algorithms: The Basic Methods
Ch 5 Credibility: Evaluating What's Been Learned

PART II: Advanced Data Mining

Ch 6 Implementations: Real Machine Learning Schemes
Ch 7 Data Transformation
Ch 8 Ensemble Learning
Ch 9 Moving On: Applications and Beyond

PART III: The Weka Data MiningWorkbench

Ch 10 Introduction to Weka
Ch 11 The Explorer
Ch 12 The Knowledge Flow Interface
Ch 13 The Experimenter
Ch 14 The Command-Line Interface
Ch 15 Embedded Machine Learning
Ch 16 Writing New Learning Schemes
Ch 17 Tutorial Exercises for the Weka Explorer