|
|
CSCE 5380 Data
Mining
Instructor: Dr. Yan Huang
Fall 2010
|
|
|
|
|
|
Announcements
- 11/28, the parameters of DBScan have been provided in project 2. The due
date of project 2 is changed to Dec. 02.
- 11/11, assignment 3
has been posted.
- 10/12, this is a small correction
of project 1 (Naïve Bayes Simple Classifier
-> Naïve Bayes). Please download it
again.
- 09/21, please go to F218 for a lab
session for the Oct. 05 class.
- 09/21, project 1 and
project 2 have been posted.
- 08/30, assignment 1 and assignment
2 have been posted.
- 08/25, Welcome to the data mining
class!
|
|
|
|
|
|
Syllabus [pdf]
|
Instructor:
|
Dr. Yan Huang
|
|
Office:
|
Discovery Park, F251, tel: 940-369-8353
|
|
Email:
|
huangyan
‘at’ cs.unt.edu
|
|
Class hours:
|
TTh
11:00am -12:20pm, Discovery Park B192
|
|
Office hours:
|
TTH 9:00am- 10:00am
|
|
|
|
|
Teaching assistant:
|
Indrakanti, Saratchandra
|
|
Office:
|
F205
|
|
Email:
|
SaratchandraIndrakanti@my.unt.edu
|
|
Office hours:
|
Fridays 12:00pm to 4:00
p.m.
|
|
|
|
|
|
|
|
Course description:
|
This
course will provide a broad and rapid introduction to the field of data
mining. We will provide a general introduction to main data mining tasks,
e.g. classification, clustering, association rules, graph mining,
sequential pattern mining, and outlier detection as well as some of the
latest developments, e.g. mining spatial data and web data.
|
|
|
|
|
|
|
|
|
|
Class
Notes
·
Notes from the book
author’s website
·
Exam
I, Sep. 30, in-class.
·
Exam
II, Nov. 09, in-class.
·
Final
Exam, Dec. 16, 10:30am-12:30pm.
·
Project
presentation: pre-final week.
|
|
|
|
|
|
Assignments
1.
Assignment 1, due Sep. 23.
2.
Assignment 2, due Oct. 26.
3.
Assignment 3, due Dec. 02.
4.
Project 1, due Oct. 21.
5.
Project 2, due Dec. 02.
|
|
|
|
|
|
Books
Required textbook
|

|
Introduction to Data Mining
Pang-Ning
Tan, Michigan
State University
Michael Steinbach, University of Minnesota
Vipin Kumar, University of Minnesota
ISBN: 0-321-32136-7
Publisher: Addison-Wesley
Buy this book (new) from Amazon.
Compare prices (new or used) at BestBookBuys
|
Recommended readings
- J. Han and M. Kamber (2000)
Data Mining: Concepts and
Techniques , Morgan Kaufmann
- U.
M. Fayyad, G. Piatetsky-Shapiro, P. Smyth,
R. Uthurusamy, Advances in
Knowledge Discovery and Data Mining, The MIT Press, 1996
- U.
Fayyad, G. Grinstein, and A. Wierse, Information Visualization in Data
Mining and Knowledge Discovery, Morgan Kaufmann, 2001
- D.
J. Hand, H. Mannila, and P. Smyth, Principles of Data Mining,
MIT Press, 2001.
- T.
Hastie, R. Tibshirani, and J. Friedman, The Elements of Statistical
Learning: Data Mining, Inference, and Prediction, Springer-Verlag, 2001
- T.
M. Mitchell, Machine Learning,
McGraw Hill, 1997.
- S.
M. Weiss and N. Indurkhya, Predictive Data Mining,
Morgan Kaufmann, 1998
- H.
Witten and E. Frank, Data
Mining: Practical Machine Learning Tools and Techniques with Java
Implementations, Morgan Kaufmann, 2001
|
|
|
|
|
|
|
Links
1.
The book authors’ website
2.
UCI Machine Learning
Data Repository
3.
Weka
4.
KDD CUP
|
|
|