CIDRe: A Reference-Free Multi-Aspect Criterion for Code Comment Quality Measurement
Maria Dziuba, Valentin Malykh

TL;DR
CIDRe introduces a comprehensive, reference-free metric for assessing code comment quality across multiple aspects, improving dataset curation and model performance.
Contribution
It proposes a novel multi-aspect, language-agnostic criterion for code comment quality measurement that outperforms existing metrics.
Findings
CIDRe achieves better correlation with human judgment.
Models trained on CIDRe-filtered data outperform those trained on unfiltered data.
CIDRe improves cross-entropy evaluation metrics.
Abstract
Effective generation of structured code comments requires robust quality metrics for dataset curation, yet existing approaches (SIDE, MIDQ, STASIS) suffer from limited code-comment analysis. We propose CIDRe, a language-agnostic reference-free quality criterion combining four synergistic aspects: (1) relevance (code-comment semantic alignment), (2) informativeness (functional coverage), (3) completeness (presence of all structure sections), and (4) description length (detail sufficiency). We validate our criterion on a manually annotated dataset. Experiments demonstrate CIDRe's superiority over existing metrics, achieving improvement in cross-entropy evaluation. When applied to filter comments, the models finetuned on CIDRe-filtered data show statistically significant quality gains in GPT-4o-mini assessments.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWeb Application Security Vulnerabilities · Natural Language Processing Techniques · Software Engineering Research
