NUBIA: NeUral Based Interchangeability Assessor for Text Generation

Hassan Kane; Muhammed Yusuf Kocyigit; Ali Abdalla; Pelkins Ajanoh,; Mohamed Coulibali

arXiv:2004.14667·cs.CL·May 4, 2020·36 cites

NUBIA: NeUral Based Interchangeability Assessor for Text Generation

Hassan Kane, Muhammed Yusuf Kocyigit, Ali Abdalla, Pelkins Ajanoh,, Mohamed Coulibali

PDF

Open Access 1 Repo

TL;DR

NUBIA is a neural-based framework for automatic text generation evaluation that outperforms existing metrics in correlation with human judgment across multiple tasks, offering a modular and explainable approach.

Contribution

The paper introduces NUBIA, a novel neural-based methodology for automatic evaluation of text generation, emphasizing modularity, explainability, and continuous improvement.

Findings

01

Outperforms current metrics in machine translation and summarization evaluation

02

Matches or exceeds state-of-the-art correlation with human judgments

03

Demonstrates modularity and explainability in evaluation metrics

Abstract

We present NUBIA, a methodology to build automatic evaluation metrics for text generation using only machine learning models as core components. A typical NUBIA model is composed of three modules: a neural feature extractor, an aggregator and a calibrator. We demonstrate an implementation of NUBIA which outperforms metrics currently used to evaluate machine translation, summaries and slightly exceeds/matches state of the art metrics on correlation with human judgement on the WMT segment-level Direct Assessment task, sentence-level ranking and image captioning evaluation. The model implemented is modular, explainable and set to continuously improve over time.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wl-research/nubia
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications