
TL;DR
This paper introduces a method for analyzing the compatibility of data subsets with the overall dataset and identifying which parameters are influenced by specific data groups, demonstrated through parton distribution function studies.
Contribution
A novel method for data subset compatibility analysis and parameter attribution in large-scale data fitting, applied to parton distribution functions.
Findings
Effective identification of data subset compatibility.
Clear attribution of parameters to specific data groups.
Enhanced understanding of data influence in global fits.
Abstract
The analysis of data sometimes requires fitting many free parameters in a theory to a large number of data points. Questions naturally arise about the compatibility of specific subsets of the data, such as those from a particular experiment or those based on a particular technique, with the rest of the data. Questions also arise about which theory parameters are determined by specific subsets of the data. I present a method to answer both of these kinds of questions. The method is illustrated by applications to recent work on measuring parton distribution functions.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
