TL;DR
AdaQE-CG is a novel framework that enhances web-scale generative AI documentation by dynamically extracting and completing information, significantly improving quality and reproducibility over existing methods.
Contribution
It introduces adaptive query expansion and cross-card knowledge transfer techniques, along with a new benchmark for evaluating GAI documentation quality.
Findings
Outperforms existing approaches in multiple quality metrics.
Exceeds human-authored data cards in quality.
Approaches human-level quality for model cards.
Abstract
Transparent and standardized documentation is essential for building trustworthy generative AI (GAI) systems. However, existing automated methods for generating model and data cards still face three major challenges: (i) static templates, as most systems rely on fixed query templates that cannot adapt to diverse paper structures or evolving documentation requirements; (ii) information scarcity, since web-scale repositories such as Hugging Face often contain incomplete or inconsistent metadata, leading to missing or noisy information; and (iii) lack of benchmarks, as the absence of standardized datasets and evaluation protocols hinders fair and reproducible assessment of documentation quality. To address these limitations, we propose AdaQE-CG, an Adaptive Query Expansion for Card Generation framework that combines dynamic information extraction with cross-card knowledge transfer. Its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
