Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

HEPMASS-IMB is a benchmark dataset for signal-background classification in High-Energy Physics (HEP), derived from HEPMASS (Baldi et al.) by imbalancing it two times: on the class labels, as well as on the mass labels.

  • It has 27 feature columns (named from f0 to f26), and a 28-th mass feature (named mass).
  • The 27 features are already normalized to have approximately zero-mean and unitary variance.
  • The mass feature has five unique values: 500, 750, 1000, 1250, and 1500.
  • There are two class labels: 1 (signal), and 0 (background).
  • The dataset describes the decay of an hypothetical particle: Xtt¯X>tt¯W+bWb¯.

Further details about the original dataset are available here, whereas a description of our modifications is presented in our paper.

NOTE:

  • The files provided here represent only the training-set, since it's what is diverse compared to the original HEPMASS.
  • The label column has been renamed from "# label" to "type".
  • There are two new columns: name, and weight.

...

Dataset for DQM for Drift Tube Chambers. The dataset include a reference sample and smaller data samples characterized by anomalous effects. Plots for data visualization are provided.