Weighted Classification Cascades for Optimizing Discovery Significance   in the HiggsML Challenge

Lester Mackey; Jordan Bryan; Man Yue Mo

arXiv:1409.2655·stat.ML·September 11, 2015·2 cites

Weighted Classification Cascades for Optimizing Discovery Significance in the HiggsML Challenge

Lester Mackey, Jordan Bryan, Man Yue Mo

PDF

Open Access

TL;DR

This paper presents a new iterative method for optimizing discovery significance in high energy physics, using weighted classification and convex duality, validated on the HiggsML challenge data.

Contribution

It introduces a novel minorization-maximization algorithm that links weighted classification error improvements to discovery significance enhancement.

Findings

01

Effective optimization of discovery significance demonstrated

02

Algorithm outperforms baseline methods in HiggsML challenge

03

Theoretical guarantees connect classification error reduction to significance increase

Abstract

We introduce a minorization-maximization approach to optimizing common measures of discovery significance in high energy physics. The approach alternates between solving a weighted binary classification problem and updating class weights in a simple, closed-form manner. Moreover, an argument based on convex duality shows that an improvement in weighted classification error on any round yields a commensurate improvement in discovery significance. We complement our derivation with experimental results from the 2014 Higgs boson machine learning challenge.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsParticle physics theoretical and experimental studies · Computational Physics and Python Applications · Particle Detector Development and Performance