CritiQ: Mining Data Quality Criteria from Human Preferences
Honglin Guo, Kai Lv, Qipeng Guo, Tianyi Liang, Zhiheng Xi, Demin Song, Qiuyinzhe Zhang, Yu Sun, Kai Chen, Xipeng Qiu, Tao Gui

TL;DR
CritiQ is a novel data selection method that automatically mines human preference-based quality criteria, enabling efficient and interpretable data filtering for language models with minimal human annotation.
Contribution
It introduces CritiQ Flow, a system that evolves quality criteria from limited human preferences and leverages a knowledge base for improved data selection in language model training.
Findings
Achieves high accuracy in data quality assessment across multiple domains.
Improves downstream task performance of Llama 3.1 models with selected data.
Demonstrates the interpretability and reusability of verbal quality criteria.
Abstract
Language model heavily depends on high-quality data for optimal performance. Existing approaches rely on manually designed heuristics, the perplexity of existing models, training classifiers, or careful prompt engineering, which require significant expert experience and human annotation effort while introduce biases. We introduce CritiQ, a novel data selection method that automatically mines criteria from human preferences for data quality with only ~30 human-annotated pairs and performs efficient data selection. The main component, CritiQ Flow, employs a manager agent to evolve quality criteria and worker agents to make pairwise judgments. We build a knowledge base that extracts quality criteria from previous work to boost CritiQ Flow. Compared to perplexity- and classifier- based methods, verbal criteria are more interpretable and possess reusable value. After deriving the criteria,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsData Quality and Management · Mobile Crowdsensing and Crowdsourcing · Topic Modeling
