CSE 591 DATA MINING
(August 21 - December 5, 2000, Fall)
I hear, I forget; I see, I remember; I do, I understand. - Proverb
Your suggestions are most welcome. Please send email to hliu@asu.edu
OUTLINE
-
Data Mining and Its Impact, 1
-
Classification Methods (ID3, Nearest Neighbor, NBC, Neural Networks), 3
-
Performance Evaluation, (Measures, Comparison), 2
-
Data Preprocessing (Feature Selection, Discretization, Sampling), 3
-
Clustering Methods (K-Means/EM, COBWEB, BIRCH), 2
-
Association Rules (APRIORI, Parallel, Multi-Level), 3
-
Data Warehousing (Schema, Data Cube, Data Marting), 2
-
Semi-Structured Data (XML, RDF), 2
-
Web Mining (Search, Mining), 2
-
Applications (Customer Retention, Motorola
Case Study (invited talk by Mike Gardner on Nov 1, 2000)), 2, 22
Readings (There is a 10% credit for class
participation)
Notes
ASSIGNMENT
-
Your Ideas about Data Mining (5% bonus)
-
Paper Presentation Guidelines and Schedule
(20%)
-
Project Proposal Presentation (5%)
-
Proposal due on 9/18/00
-
Proposal Presentation on 9/20/00
-
Every student should prepare no more than 3 slides and will make a 5-minute
presentation
-
Project Presentations on 11/22, 11/27, 11/29,
and 12/4/00 (10%)
-
Deadline for your email submission of the links to your presentation
slides: 11/20 Monday, 5:00pm
-
The presentation sequence will be first-submit-last-present as we
did in the proposal presentations
-
Each student will have 15 minutes for presentation and 3 minutes for Q&A
-
If anyone wishes to present earlier (on 11/20), we can try to arrange 2
presentations.
-
If necessary, we may arrange an extra presentation slot on 12/1 Friday
as we have 18 students.
PROJECT
You're welcome to discuss with the instructor about your project ideas.
-
Categories of projects
-
Self-proposing: solving a suitable problem from design to implementation
-
Establishing a mining environment for the course: installation and maintenance
-
Survey of journal quality on a sub-field related to data mining
-
Challenging problems and applications of data mining: identification and
possible solutions
-
Report and/or Demo (20%)
-
Findings, Results, New Problems
-
Deadline: 12/4 Monday, 5:00pm
-
About the report
-
The length (or number of pages) is not an issue. A concise technical
report is preferred.
-
It is a concise and self-contained write-up such that another student
can read it and carry out the work.
-
It should at a minimum include (1) the description of your project (2)
the technical details (3) significance, usefulness, or impact (4)
findings or results (5) future work.
-
For an implementation related project, you need to include a brief manual
of how-to-use/development.
-
You should try to convey all your efforts on the project in the report
in a simple manner.
QUIZ or EXAM
-
There are about 7 quizzes and no exams (35%).
-
If you're a student taking the class, please sign in so you can be informed
of anything related to homework and projects in time.
LINKS
-
KDNuggets - a comprehensive site
for many resources of KDD
-
FAQ's
Prepared by Huan Liu on
July 22, 2000
Last updated: November 20, 2000