Online machine learning repository

By | January 5, 2008

The UCI team from the Center for Machine Learning and Intelligent Systems has performed a major update and relaunch of the machine learning repository website; this is a great resource for those specializing in machine learning. The site hosts over 160 data sets for machine learning problems donated by research from around the world.

We have developed a new search portal that allows for faceted browsing of over 160 different data sets, using criteria such as data type, attribute type, number of instances, and number of attributes. The web site allows users to easily find data sets that satisfy particular criteria, such as “show all data sets containing only real-valued attributes and rank by number of instances.” In collaboration with Rexa.info, the site now also provides links to published papers that have cited each data set.

We are always interested in new data set donations. If you have a data set that you would like to make available to the entire machine learning community, please feel free to donate using our automated web form. We thank all the previous donors for making this Repository a success.

Arthur Asuncion and David Newman, University of California, Irvine.

Visit (and bookmark) the UCI Machine Learning Repository.