Fine-tuning with HED-IT: The impact of human post-editing for dialogical language models
Daniela Occhipinti, Michele Marchi, Irene Mondella, Huiyuan Lai,, Felice Dell'Orletta, Malvina Nissim, Marco Guerini

TL;DR
This paper investigates how human post-editing of machine-generated dialogues affects the quality of fine-tuned language models, revealing that data quality impacts smaller models more significantly than larger ones.
Contribution
It introduces HED-IT, a large dataset of machine-generated dialogues with human post-edits, and analyzes the effects of data quality on models of different sizes.
Findings
Post-edited dialogues are perceived as higher quality.
Models trained on post-edited data produce different outputs.
Larger models are less affected by data quality variations.
Abstract
Automatic methods for generating and gathering linguistic data have proven effective for fine-tuning Language Models (LMs) in languages less resourced than English. Still, while there has been emphasis on data quantity, less attention has been given to its quality. In this work, we investigate the impact of human intervention on machine-generated data when fine-tuning dialogical models. In particular, we study (1) whether post-edited dialogues exhibit higher perceived quality compared to the originals that were automatically generated; (2) whether fine-tuning with post-edited dialogues results in noticeable differences in the generated outputs; and (3) whether post-edited dialogues influence the outcomes when considering the parameter size of the LMs. To this end we created HED-IT, a large-scale dataset where machine-generated dialogues are paired with the version post-edited by humans.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Semantic Web and Ontologies · Topic Modeling
