Loading…

A method for approximating optimal statistical significances with machine-learned likelihoods

Machine-learning techniques have become fundamental in high-energy physics and, for new physics searches, it is crucial to know their performance in terms of experimental sensitivity, understood as the statistical significance of the signal-plus-background hypothesis over the background-only one. We...

Full description

Saved in:
Bibliographic Details
Published in:The European physical journal. C, Particles and fields Particles and fields, 2022-11, Vol.82 (11), p.993-14, Article 993
Main Authors: Arganda, Ernesto, Marcano, Xabier, Lozano, Víctor Martín, Medina, Anibal D., Perez, Andres D., Szewc, Manuel, Szynkman, Alejandro
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Machine-learning techniques have become fundamental in high-energy physics and, for new physics searches, it is crucial to know their performance in terms of experimental sensitivity, understood as the statistical significance of the signal-plus-background hypothesis over the background-only one. We present here a simple method that combines the power of current machine-learning techniques to face high-dimensional data with the likelihood-based inference tests used in traditional analyses, which allows us to estimate the sensitivity for both discovery and exclusion limits through a single parameter of interest, the signal strength. Based on supervised learning techniques, it can perform well also with high-dimensional data, when traditional techniques cannot. We apply the method to a toy model first, so we can explore its potential, and then to a LHC study of new physics particles in dijet final states. Considering as the optimal statistical significance the one we would obtain if the true generative functions were known, we show that our method provides a better approximation than the usual naive counting experimental results.
ISSN:1434-6052
1434-6044
1434-6052
DOI:10.1140/epjc/s10052-022-10944-3