Data Mining Research Group – Weekly Presentations

Summer 2005

S. No

Presenter

Date

Paper Title

Paper Author(s)

Journal/Conference

1.

Lance Parsons

06-03-2005

Multiclass classification of microarray data with repeated measurements: application to cancer

Yeung, Ka Yee and Bumgarner, Roger E

Genome Biol. 2003;4(12):R83

2.

Alan Zheng Zhao

06-10-2005

Overfitting in Making Comparisons Between variable Selection Methods

Reunanen, Juha

Journal of Machine Learning Research 3 (2003) 1371-1382

3.

Alan Zheng Zhao

 

Steven Day (cancelled)

06-17-2005

Margin based feature selection - theory and algorithms

Ran Gilad-Bachrachy, Amir Navotz, and Naftali Tishby

ICML 2004

4.

Nitin Agarwal

06-24-2005

Web page categorization without the web page

MinYen Kan

WWW 2004

Somnath Shahapurkar

Comparison of Self-Organizing Map with K-Means Hierarchical Clustering for Bioinformatics Applications

Somnath S. Shahapurkar, Malur K. Sundareshan

ICJNN 04

5.

Niyati Parikh

07-01-2005

Data mining in wireless sensor networks based on artificial neural-networks algorithms (Slides)

Andrea Kulakov and Danco Davcev

SDM 2005 – Workshop on Datamining in Sensor Networks

6.

Sai Moturu

07-08-2005

?

?

?

7.

Lei Yu

Everyone

07-15-2005

Review Lei’s Defense

Late summer project updates from all group members

--

--

8.

 

07-22-2005

Discuss ICDM paper reviews

--

IEEE ICDM 05

9.

Ashutosh Tiwari

07-29-2005

A Framework for Projected Clustering of High Dimensional Data Streams (Slides)

Charu C. Aggarwal, Jiawei Han, Kianyong Want, Philip S. Yu

VLDB 2004

10.

Alan (Zheng) Zhao

08-05-2005

Intro to Cox Survival Models

--

--

11.

Niyati Parikh

08-12-2005

Update on current research

--

--

12.

Somnath Shahapurkar

08-19-2005

Update on current thesis topic research

--

--

Spring 2005

S. No

Presenter

Date

Paper Title

Paper Author(s)

Conference

1.

Nitin Agarwal

02-04-2005

Subspace Selection for Clustering High-Dimensional Data

Christian Baumgartner and Claudia Plant and Karin Kailing and Hans-Peter Kriegel and Peer Kroger

In proceedings of 4th IEEE International Conference on Data Mining (ICDM'04)

2.

Niyati Parikh

03-25-2005

An Iterative Method for Multi-class Cost-Sensitive Learning

Naoki Abe and bianca Zadrozny and John Langford

 KDD '04

3.

Alan Zheng Zhao

04-01-2005

 Learning with Local and Global Consistency (Slides)

D. Zhou, O. Bousquet, T.N. Lal, J.Weston and B.Scholokopf  

 NIPS '04

4.

Lei Tang

04-15-2005

 Support Vector Classification with Input Data Uncertainty (Slides)

 Jinbo Bi and Tong Zhang

 NIPS '04

5.

Surendra Singhi

04-29-2005

Choosing Between Two Learning Algorithms Based on Calibrated Tests (Slides)

Remco R. Bouckaert

 ICML '03

6.

Sai Moturu

05-06-2005

 Evaluation and Optimization of Clustering in Gene Expression and Data Analysis (Slides)

 A. Fazel Famili, Ganming Liu and Ziying Liu

 Bioinformatics

7.

Lance Parsons

05-13-2005

Single Nucleotide Polymorphism (Slides)

 

 

8.

Ashutosh Tiwari

05-20-2005

Clustering Distributed Data Streams in Peer-to-Peer Environments

 

 

9.

Alan Zheng Zhao

05-26-2005

Attribute dependencies, understandability and split selection in tree based models

 Marko Robnik- Sikonja and Igor Kononenk

 ICML’99

Fall 2004 (Group Members)

S. No

Presenter

Date

Paper Title

Paper Author(s)

Conference

1.

Lance Parsons

09-03-2004

Mining Coherent Gene Clusters from Gene-Sample-Time Microarray Data

(Slides)

Daxin Jiang, Jian Pei, Murali Ramanathan, Chun Tang, Aidong Zhang

KDD-2004 Conference

2.

Lei Yu

 

A Pitfall and Solution in Multi-Class Feature Selection for Text
Classification

George Forman

ICML-2004 Conference

3.

Ehtesham

 

A Framework for Ontology-Driven Subspace Clustering

Jinze Liu, Wei Wang, and Jiong Yang

SIGKDD-2004 Conference

4.

Steve

 

CrossMine: Efficient Classification Across Multiple Database Relations

Xiaoxin Yin, Jiawei Han, Jiong Yang, and Philip yu

ICDE-2004 Conference

5.

Nitin

 

Web Usage Mining Based on Probabilistic Latent Semantic Analysis

Xin Jin, Yanzan Zhou, and Bamshad Mobasher

SIGKDD-2004 Conference

6.

Magdiel

09-17-2004

Clustering Time Series from ARMA Models with Clipped Data (Slides)

A. J. Bagnall and G. J. Janacek

SIGKDD-2004 Conference

7.

Alan

 

FARMER: Finding Interesting Rule Groups in Microarray Datasets

Gao Cong, Anthony K. H. Tung, Xin Xu, Feng Pan, and Jiong Yang

SIGMOD-2004 Conference

8.

Lei Tang

 

Ensemble selection from library of models

Rich Caruana, Alexandru Niculescu-Mizil, Geoff Crew, and Alex Ksikes

ICML-2004 Conference

9.

Surendra

 

 

 

 

10.

Magdiel

10-08-2004

A Wavelet-Based Anytime Algorithm for K-Means Clustering of Time Series

(Slides)

M. Vlachos, J.Lin, E. Keogh, D. Gunopulos

SIAM-2003 Conference

11.

Ashutosh Tiwari

10-15-2004

A Tutorial on Time-Series Data (Part-1)