26 datasets found

Tags: Technology Licenses: Creative Commons CCZero (CC0-1.0);Open Data Commons Public Domain Dedication and Licence (PDDL-1.0);

Filter Results
  • KC1 Software defect prediction

    One of the NASA Metrics Data Program defect data sets. Data from software for storage management for receiving and processing ground data. Data comes from McCabe and Halstead...
  • KC2 Software defect prediction

    One of the NASA Metrics Data Program defect data sets. Data from software for science data processing. Data comes from McCabe and Halstead features extractors of source code....
  • JM1/software defect prediction

    JM1/software defect prediction
  • PC3 Software defect prediction

    One of the NASA Metrics Data Program defect data sets. Data from flight software for earth orbiting satellite. Data comes from McCabe and Halstead features extractors of source...
  • PC4 Software defect prediction

    One of the NASA Metrics Data Program defect data sets. Data from flight software for earth orbiting satellite. Data comes from McCabe and Halstead features extractors of source...
  • GEMLeR

    GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation...
  • MagicTelescope

    The data are MC generated (see below) to simulate registration of high energy gamma particles in a ground-based atmospheric Cherenkov gamma telescope using the imaging...
  • Artificial characters database

    This database has been artificially generated. It describes the structure of the capital letters A, C, D, E, F, G, H, L, P, R, indicated by a number 1-10, in that order...
  • Genuine and forged banknotes

    Dataset about distinguishing genuine and forged banknotes. Data were extracted from images that were taken from genuine and forged banknote-like specimens. For digitization, an...
  • Emotiv EEG Neuroheadset

    All data is from one continuous EEG measurement with the Emotiv EEG Neuroheadset. The duration of the measurement was 117 seconds. The eye state was detected via a camera during...
  • Gas Sensor Array Drift Dataset Data Set

    This archive contains 13910 measurements from 16 chemical sensors utilized in simulations for drift compensation in a discrimination task of 6 gases at various levels of...
  • QSAR biodegradation Data Set

    The QSAR biodegradation dataset was built in the Milano Chemometrics and QSAR Research Group (Università degli Studi Milano – Bicocca, Milano, Italy). The research leading to...
  • Wall-Following Robot Navigation Data Data Set

    The data were collected as the SCITOS G5 robot navigates through the room following the wall in a clockwise direction, for 4 rounds, using 24 ultrasound sensors arranged...
  • Seismic-bumps Data Set

    The data describe the problem of high energy (higher than 10^4 J) seismic bumps forecasting in a coal mine. Data come from two of longwalls located in a Polish coal mine.
  • Semeion

    Semeion Handwritten Digit Data Set, where 1593 handwritten digits from around 80 persons were scanned and documented. The each of the 256 variables V1 - V256 describe one of the...
  • Micro Mass

    MicroMass (pure spectra version) is a dataset to explore machine learning approaches for the identification of microorganisms from mass-spectrometry data.
  • Phishing Websites

    One of the challenges faced by our research was the unavailability of reliable training datasets. In fact this challenge faces any researcher in the field. However, although...
  • Higgs Boson detection data

    Higgs Boson detection data. The data has been produced using Monte Carlo simulations. The first 21 features (columns 2-22) are kinematic properties measured by the particle...
  • Quickbird imagery

    High-resolution Remote Sensing data set (Quickbird). Small number of training samples of diseased trees, large number for other land cover. Testing data set from stratified...
  • Isolated Letter Speech Recognition

    ISOLET (Isolated Letter Speech Recognition) dataset was generated as follows: 150 subjects spoke the name of each letter of the alphabet twice. Hence, there are 52 training...
You can also access this registry using the API (see API Docs).