Fast Correlation-Based Filter Software for Feature Selection


Arizona State University, Computer Science and Engineering, Data Mining and Machine Learning

Overview

    Feature selection is a preprocessing technique frequently used in data mining and machine learning tasks. It can reduce dimensionality, remove irrelevant data, increase learning accuracy, and improve results comprehensibility.  FCBF is a  fast correlation-based filter algorithm designed for high-dimensional data and has been shown effective in removing both irrelevant features and redundant features.

About FCBF Software

    This software package is prepared in Java. It is provided free of charge to the research community as an academic software package with no commitment in terms of support or maintenance. Data may need to be discretized before running FCBF. Please refer to the paper below. A discretization package is also available on our web site.

Downloads

        The software is freely available for academic use, and can be downloaded here for Linux
        Run "tar xvfz FCBF.tar.gz" to obtain all the files including README and some sample data files.
        README provides the details about how to run, how to prepare the data, and how to read the results.

People

References

Created on 3/15/2004

Updated on 4/14/2004