PAPER READING

Research papers of data mining in the follow categories will be studied in-depth by individual students (including the data mining aspects of Bioinformatics):
  1. Data and application security
  2. Data mining and privacy
  3. Streaming data extraction and mining
  4. Dealing with large data either row-wise or column-wise
  5. Overfitting data in classification (AdaBoost)
  6. Association rules (FP-trees, Efficient implementation - e.g., MAFIA)
  7. Mining structured data
  8. Adversarial classification
  9. Semi-supervised learning
  10. Clustering data in forms of scenarios
Where to start (to be revised):
What are needed for submission?
How are presentation slides evaluated?

    Each presentation is evaluated in five categories: time control, organization, clarity, preparation, and discussion.

Notes about presentation:

PROJECTS

        1. what you're going to do
        2. why it is a useful project
        3. how the project should be evaluated in your view
        4. what will be the values of your project when it's done

        The proposal is basically a rough outline of your project report.

        A good beginning is half the success.

    1. Applications (as discussed in class)
    2. Data repositories
    3. Organizations (NIH, TGen, ...)
    You can submit it earlier than the deadline.