Introduction to Data Mining

Here's a quick guide to getting started in data mining.

You may also want to look at Data Mining Books and Data Mining and Warehousing

Introduction

Data mining (also known as Knowledge Discovery in Databases - KDD) has been defined as "The nontrivial extraction of implicit, previously unknown, and potentially useful information from data"[1] It uses machine learning, statistical and visualization techniques to discover and present knowledge in a form which is easily comprehensible to humans.

Applications of Data Mining

Some example application areas are listed under Applications Of Data Mining

A resourceful blog on data mining specially for novices with a good and easy to understand introduction to data mining and good examples of data mining applications to solve real life problems.

Net accessible resources

Two main websites provide information on data mining (including pointers to other relevant sites). They are

The Data Mine The Data Mining Wiki - http://www.The-Data-Mine.com

Featuring articles on data mining, a database of software and information on Data Mining Organizations. It attempts to provide links to as much of the available data mining information on the net as is possible. Contributions to this effort are very welcome.

Knowledge Discovery Mine - http://www.kdnuggets.com

The Knowledge Discovery Mine has the KDD FAQ, a comprehensive catalog of tools for discovery in data, as well as back issues of the KDD-Nugget mailing list.

Online Introductions

Journals

Business and Industry Publications

Academic Journals

Journals relevant to Data Mining

Introductory Articles

[1] W. Frawley and G. Piatetsky-Shapiro and C. Matheus, Knowledge Discovery in Databases: An Overview. AI Magazine, Fall 1992, pgs 213-228

[2] "State of The Art" in Byte Magazine October 1995, contains three articles on Data Mining and Data Warehousing - http://www.byte.com/art/9510/sec8/sec8.htm bissell carpet steam cleaner

[3] W. H. Inmon and S. Osterfelt, Understanding Data Pattern Processing QED Techincal Publishing Group, 1991, Wellesley, MA. I haven't see this book myself, but I saw Gregory Piatetsky Shapiro? describe it as "a business-oriented, nontechnical book" Other Bissell Products

[4] Cheryl Gerber, Excavate Your Data. Datamation 42(9), May 1996. "Datamining could be your No. 1 strategic weapon--and source of profit--in dissecting archival information. But with its roots in machine learning, this esoteric technology takes some time to master." - http://www.datamation.com

[5] Usama M. Fayyad, Gregory Piatetsky-Shapiro, Padhraic Smyth, and Ramasamy Uthurusamy, Advances in Knowledge Discovery and Data Mining. Published by the AAAI Press / The MIT Press - http://aaai.org/Press/Books/Fayyad/fayyad.html

[6] Herb Edelstein, Data Mining: Exploiting the Hidden Trends in Your Data. DB2 Online Magazine. bissell proheat suncast db5000 deck escali digital scale

"This article, adapted from the Two Crows report "Data Mining: Products, Applications, Technology," introduces you to data mining technology and explains how IBM's Intelligent Miner can help you exploit hidden trends in your data." - http://www.db2mag.com

Larger Bibliography

See: Data Mining Bibliographies

See also:

Log In or Register to see Topic History, Edit or Add New Pages
Topic revision: r20 - 24 Jun 2009 - 15:03:21 - Pro Lan

 
This Wiki powered by TWiki This Wiki is powered by the TWiki collaboration platformCollection Copyright © Andy Pryke.
Ideas, requests, problems regarding The Data Mine? Send feedback