CSCE 5380 Data Mining

Instructor: Dr. Yan Huang             Fall 2010


 

 

Announcements | Syllabus | Class Notes and Schedule | Assignments | Books | Links

 

 

Announcements

 

  • 11/28, the parameters of DBScan have been provided in project 2. The due date of project 2 is changed to Dec. 02.
  • 11/11, assignment 3 has been posted.
  • 10/12, this is a small correction of project 1 (Naïve Bayes Simple Classifier -> Naïve Bayes). Please download it again.
  • 09/21, please go to F218 for a lab session for the Oct. 05 class.
  • 09/21, project 1 and project 2 have been posted.
  • 08/30, assignment 1 and assignment 2 have been posted.
  • 08/25, Welcome to the data mining class!

 

 

 

Announcements | Syllabus | Class Notes and Schedule | Assignments | Books | Links

 

 

Syllabus [pdf]

Instructor:

Dr. Yan Huang

Office:

Discovery Park, F251, tel: 940-369-8353

Email:

huangyan at cs.unt.edu

Class hours:

TTh 11:00am -12:20pm, Discovery Park B192

Office hours:

TTH 9:00am- 10:00am

 

 

Teaching assistant:

Indrakanti, Saratchandra

Office:

F205

Email:

SaratchandraIndrakanti@my.unt.edu

Office hours:

Fridays 12:00pm to 4:00 p.m.

 

 

 

 

Course description:

This course will provide a broad and rapid introduction to the field of data mining. We will provide a general introduction to main data mining tasks, e.g. classification, clustering, association rules, graph mining, sequential pattern mining, and outlier detection as well as some of the latest developments, e.g. mining spatial data and web data.

 

 

 

 

Announcements | Syllabus | Class Notes and Schedule | Assignments | Books | Links

 

 

Class Notes

 

·       Notes from the book authors website

 

·       Exam I, Sep. 30, in-class.

·       Exam II, Nov. 09, in-class.

·       Final Exam, Dec. 16, 10:30am-12:30pm.

·       Project presentation: pre-final week.

 

 

Announcements | Syllabus | Class Notes and Schedule | Assignments | Books | Links

 

 

Assignments

1.     Assignment 1, due Sep. 23.

2.     Assignment 2, due Oct. 26.

3.     Assignment 3, due Dec. 02.

4.     Project 1, due Oct. 21.

5.     Project 2, due Dec. 02.

 

 

Announcements | Syllabus | Class Notes and Schedule | Assignments | Books | Links

 

 

Books

Required textbook

Introduction to Data Mining

Pang-Ning Tan, Michigan State University
Michael Steinbach, University of Minnesota
Vipin Kumar, University of Minnesota

ISBN: 0-321-32136-7
Publisher: Addison-Wesley

Buy this book (new) from Amazon.
Compare prices (new or used) at BestBookBuys

 

Recommended readings
  • J. Han and M. Kamber (2000) Data Mining: Concepts and Techniques , Morgan Kaufmann
  • U. M. Fayyad, G. Piatetsky-Shapiro, P. Smyth, R. Uthurusamy, Advances in Knowledge Discovery and Data Mining, The MIT Press, 1996
  • U. Fayyad, G. Grinstein, and A. Wierse, Information Visualization in Data Mining and Knowledge Discovery, Morgan Kaufmann, 2001
  • D. J. Hand, H. Mannila, and P. Smyth, Principles of Data Mining, MIT Press, 2001.
  • T. Hastie, R. Tibshirani, and J. Friedman, The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Springer-Verlag, 2001
  • T. M. Mitchell, Machine Learning, McGraw Hill, 1997.
  • S. M. Weiss and N. Indurkhya, Predictive Data Mining, Morgan Kaufmann, 1998
  • H. Witten and E. Frank, Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations, Morgan Kaufmann, 2001






 

Announcements | Syllabus | Class Notes and Schedule | Assignments | Books | Links

 

 

Links

1.     The book authors website

2.     UCI Machine Learning Data Repository 

3.     Weka

4.     KDD CUP