Social-Dimension Approach to Classification in Large-Scale Networks
Arizona State University, Computer Science and Engineering, Data Mining and Machine Learning
Overview
Social media often provides social network information of users.
However, the relationship hidden in the connections are
inhomogeneous. Social dimensions are introduced to represent the
heterogeneous interactions people interact with each other (latent
affiliations one actor is involved in). Here, we list some of our
published work as well as the data sets and code used for experiments.
Data
Below are some of the data sets we have used for experiments in [2] and
[3]. All are in matlab format. Each data set has two variables:
network and groups. "network" is a symmetric sparse matrix representing
the interaction between users, and "groups" are the groups subscribed
by users. We use that as the class labels in our work.
- BlogCatalog
data of 10,312 nodes, 333, 983 links, and 39 categories.
- Flickr Data
of 80,513 nodes, 5, 899, 882 links, and 195 categories.
- YouTube Data
of 1, 138, 499 nodes, 2, 990, 443 links and 47 categories.
Code
People
Tutorial
References
Acknowledgements
This project is sponsored by AFOSR-FA95500810132.
Updated on 10/26/2009