Reducing Systematic Bias in Machine Learning Applications to Signal Extraction in High-Energy Nuclear Physics
Yan Wang, Rangrong Ma, Kaifeng Shen, Zebo Tang, Wangmei Zha

TL;DR
This paper presents two methods to reduce systematic bias caused by simulation-reality discrepancies in machine learning applications for high-energy nuclear physics data analysis, demonstrated through J/ψ yield measurements.
Contribution
It introduces cumulative distribution function mapping and shift-and-scale techniques to align simulated data with real data, improving ML performance and reducing biases.
Findings
Methods effectively reduce systematic biases in signal extraction.
Application to STAR experiment data demonstrates improved accuracy.
Techniques are adaptable to various high-energy physics analyses.
Abstract
Machine learning techniques are increasingly being applied in high-energy nuclear physics data analysis thanks to their outstanding performance. One key challenge in such applications is the construction of training samples that can accurately represent real data. Training samples are typically generated through detector simulations, but discrepancies between simulated and real data can lead to degradation in machine learning performance and systematic biases in the results. This paper introduces two methods: i) cumulative distribution function mapping and ii) shift-and-scale, to align simulated signals with real data, which can aid in eliminating the aforementioned issues. We use the J/ yield measurement in 200 GeV Ru+Ru and Zr+Zr collisions with the STAR experiment as an example to demonstrate the application and effectiveness of the proposed methods.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHigh-Energy Particle Collisions Research · Nuclear physics research studies · Particle physics theoretical and experimental studies
