Skip to main content
  1. Projects/

Higgs Boson ML Challenge

Abantika Bhuti
Author
Abantika Bhuti
Table of Contents

This article is being updated

Introduction
#

The Higgs Machine Learning Challenge was an open data analysis competition that took place between May and September 2014. Samples of simulated data from the ATLAS Experiment at the LHC corresponding to signal events with Higgs bosons decaying to \(\tau^+\tau^-\) together with background events were made available to the public through the website of the data science organization Kaggle. Participants attempted to identify the search region in a space of 30 kinematic variables that would maximize the expected discovery significance of the signal process.1

Evaluation
#

The evaluation metric is the approximate median significance (AMS):

$$\text{AMS} = \sqrt{2\left[(s + b + b_r)\ln\left(1 + \frac{s}{b + b_r}\right) - s \right]}$$

where

  • \(s, b\) : unnormalized true positive and false positive rates, respectively,
  • \(b_r=10\) is the constant regularization term,
  • \(ln\) is the natural log.

Links #

Links
  • Detailed overview of my project: Github Repo
  • See the official challenge and datasets as hosted on Kaggle.
  • For more information on the statistical model and the derivation of the metric, see the official technical documentation.
  • The extended version of the dataset can be found here.

  1. C Adam-Bourdarios et al 2015 J. Phys.: Conf. Ser. 664 072015 [link] ↩︎