CSE 572 DATA MINING
(January 16 – May 1, Spring 2007)
I hear, I forget; I see, I remember; I do, I understand. -
Proverb
We will work together to hear, see and do in this class in many forms
including lectures, invited talks, discussions, research paper reading
assignment, a project, and presentation, in addition to homework, quizzes, and
exam(s). We will create opportunities to learn from each other. If you
find anything interesting about data mining, please share with us all. The
ultimate goal is to provide a conducive environment that unleashes the student
creativity in producing new works with impact.
We want to strike a balance between exploiting
and exploring in learning with
respect to passive and active learning.
Your suggestions are most welcome. Please send email to huan.liu at asu.edu
OUTLINE
- Introduction
to Data Mining
- Classification
Methods (ensemble methods, SVMs,
skewed data, cost-sensitive
classification)
- A
Brief Review of Probability and Entropy
- Performance
Evaluation (Measures, Comparison between two
algorithms)
- Data,
Data Preparation, and Data Preprocessing (feature selection,
discretization, sampling, instance selection)
- Clustering
Methods (subspace clustering, CLIQUE)
- Association
Rules
- Current Challenges
- The
Shark Toothed Elephant – an invited talk by Bill Rose, VP GIS
of Avnet
- USuggest - an
invited talk on data mining in a Web application
- Some Thought-Provoking
Applications (steganography and steganalysis, streaming data extraction,
gene selection)
ASSIGNMENTS (To be updated)
- Please check Assignments at
myASU.
- There is an extra credit (up
to 10%) for class participation and discussion.
- Three to five homework
assignments.
- Research paper reading
assignment, and selected presentations (We will discuss more in class)
- Everyone will need to
work on a course project.
- A theme-based project
for all in class
- Project final report and demo
More on Paper Reading
Assignment, Project and its Due Dates
EXAMS
- 2 exams: First (mid-term) exam
is on March 5, Monday
LINKS
Prepared by Huan Liu on January 9, 2007
Last updated: April 17, 2007