PAPER READING
Research papers of data mining in the follow categories will be
studied in-depth by individual students (including the data mining
aspects of
Bioinformatics):
- Data and application security
- Data mining and privacy
- Streaming data extraction and mining
- Dealing with large data either row-wise or column-wise
- Overfitting data in classification (AdaBoost)
- Association rules (FP-trees, Efficient implementation - e.g.,
MAFIA)
- Mining structured data
- Adversarial classification
- Semi-supervised learning
- Clustering data in forms of scenarios
Where to start (to be revised):
- Conferences such as AAAI, ICML, IJCAI, SIGKDD, ICDE, SIGMOD,
VLDB, ICDM, PAKDD, PKDD, ECML
- You can use websites such as KDNggets, Citeseer
- Journals such as AIJ, MLJ, DMKD, JAIR, JMLR, IEEE TKDE,
Bioinformatics, Genome Research, Nature, Science
What are needed for submission?
- A report includes summary, critique, and suggestions
- A set of powerpoint slides that explains the technical details
in-depth (aiming at a 20 minutes presentation - roughly 15- 20 slides)
- Some selected reports will be presented in class (we don't know
yet who will present - everyone should be prepared to present if he or
she is chosen to present)
- Late penalty (as discussed in class) applies after the deadline
How are presentation slides evaluated?
Each presentation is evaluated in five categories:
time control, organization, clarity,
preparation, and discussion.
Notes about presentation:
- Everyone is expected to read the listed papers to be presented.
- Presentation duration: 25 minutes including 5 minutes for
discussion. Three
questions with solutions suitable for a quiz and in-class
discussion should be submitted to the instructor before the
presentation with a hard copy of
the presentation slides.
PROJECTS
- Categories of projects
- Solving a suitable problem from design to
implementation
- Working on a specialized algorithm to make it accessible
publicly
- Challenging problems and applications of data mining:
identification and possible solutions
- Project proposal (no more than 3 pages), you should make clear
the following:
1. what you're going to do
2. why it is a useful project
3. how the project should be
evaluated in your view
4. what will be the values of
your project when it's done
The proposal is basically a
rough outline of your project report.
A good beginning is half the
success.
- Applications (as discussed in class)
- Data repositories
- Organizations (NIH, TGen, ...)
- Submission and deadlines: In all submissions, a copy is
required and please email the
links to your presentation slides using digital drop box at myASU
before the deadline. We will host all slides at myASU for student access.You
can keep revising your softcopy after the copy and the link are
submitted.
- Project Proposal: due on 2/28/2005.
- Progress Report: due on 3/28/2005.
Be brief. It is about the progress you have made and difficulties
encountered
with key references.
- Presentation Slides: due on 4/22/2005, (for
about 15 minutes presentation in powerpoint, including title,
problem statement, approach, and key results). Everybody should be
prepared to present in class.
- Final Report due on 5/3/05 by 5:00pm in hard
copie (be concise and self contained) everything about your
project with references.
You can submit it earlier than the deadline.
- Project Presentations
- The purpose of project presentation is to share the projects
among
us so that we know what others are doing. The project may be related to
what you are doing or are interested in.
- The slides should be accessible via Web. Please include the
URL of
your slides on the cover page.
- Participation and presenting projects in class is a key
element for
class participation.
- Final Report and/or Demo
- Please include Findings, Results, New Problems
- The length (or number of pages) is immaterial. In fact,
a concise technical report is most preferred.
- It is a concise and self-contained write-up such that
another
student can read it and repeat the work.
- It should at a minimum include (1) the description of
your project (2) the technical details (3) significance,
usefulness, or impact (4) findings or results (5) future work, (6)
important references.
- For an implementation related project, you need to
include a brief manual of how-to-use/development.
- You should try to convey all your efforts on the project
in the report in a simple manner.