HOME
Most Popular
New Books

Data Mining : Concepts, Models, Methods, and Algorithms

Data Mining : Concepts, Models, Methods, and AlgorithmsISBN:0471228524
Pages:360
Date:2002-10-25
Publisher:Wiley-IEEE Press
Rating:3.0

Read Data Mining : Concepts, Models, Methods, and Algorithms Online

   

Book Description

A comprehensive introduction to the exploding field of data mining

We are surrounded by data, numerical and otherwise, which must be analyzed and processed to convert it into information that informs, instructs, answers, or otherwise aids understanding and decision-making. Due to the ever-increasing complexity and size of today's data sets, a new term, data mining, was created to describe the indirect, automatic data analysis techniques that utilize more complex and sophisticated tools than those which analysts used in the past to do mere data analysis.

Data Mining: Concepts, Models, Methods, and Algorithms discusses data mining principles and then describes representative state-of-the-art methods and algorithms originating from different disciplines such as statistics, machine learning, neural networks, fuzzy logic, and evolutionary computation. Detailed algorithms are provided with necessary explanations and illustrative examples.

This text offers guidance: how and when to use a particular software tool (with their companion data sets) from among the hundreds offered when faced with a data set to mine. This allows analysts to create and perform their own data mining experiments using their knowledge of the methodologies and techniques provided.

This book emphasizes the selection of appropriate methodologies and data analysis software, as well as parameter tuning. These critically important, qualitative decisions can only be made with the deeper understanding of parameter meaning and its role in the technique that is offered here. Data mining is an exploding field and this book offers much-needed guidance to selecting among the numerous analysis programs that are available./p>

Reviews From AMAZON.COM


Pattern recognition or machine learning, not data mining


This book can be used as an introduction to pattern recognition or machine learning rather than into data mining. Data mining does appear here and there, but mostly it is the classical pattern recognition and machine learning material (data reduction, clustering, neural networks) with very few illustrations from data mining. An introduction into genetic algorithms and fuzzy sets is also in the book, just in case, I suppose. If you'd like more specific data mining knowledge, look elsewhere.

Survey, not how-to

The subtitle advertises "concepts, models, methods, and algorithms". Concepts and models, yes; methods, a few; algorithms, nearly none that you could actually code.

This book's strength is its breadth. It offers brief tastes of many topics. It discusses early data preparation, including reduction of dimension and handling of outliers and missing values. It emphasizes that different kinds of questions must be addressed in different ways. The rest of the book then covers decision rules of different sorts, clustering, neural networks, genetic algorithms, fuzzy logic, and data visualization. Each chapter includes references and comments on what to expect from each reference - a nice touch. The end of the book names a wide variety of web sites, products, and companies dedicated to data mining.

The big problem, however, is that the book is shallow. With a few exceptions, it just names techniques instead of giving descriptions that a programmer can use. For example, the discussion of missing data barely mentions the idea that imputed (made-up) values must be tailored to the specific analysis technique, so as to minimize their effect on results. There are exceptions, of course. Neural nets get a relatively detailed treatment. The author gives illustrative examples of genetic algorithms, but those were thin and tangential to data mining. The section on data visualization could have been much more lively. There is a huge body of visual technique, some bordering on artistry, that can present high-dimensional data to the human pattern-detection faculty, and samples are readily available. This book's examples were small and drab, though. Also, it completely ignored the research in auditory and tactile data representation, and omitted discussion of graphic design principles required for effective presentation.

What really bothered me were examples of sheer carelessness. A number of figures, including 4.8 and 9.9, contain errors severe enough to interfere with the point being made. Important relationships are simply illegible. Books like this aren't cheap - I would have hoped that the author would show a little more respect for the people paying the money.

This book may have value as a survey resource, but isn't for the reader who wants to implement the algorithms. Its bibliography is informative, but not a major asset. Indices of current products and web sites nearly guarantee early obsolescence. Look this over thoroughly before you commit your time and money to it.

NEW!!!

TOOOO Many results in general search?!! Try this customized search engine for searching online books