Dataset vs Reality: Understanding Model Performance from the Perspective   of Information Need

Mengying Yu; Aixin Sun

arXiv:2212.02726·cs.IR·August 22, 2023

Dataset vs Reality: Understanding Model Performance from the Perspective of Information Need

Mengying Yu, Aixin Sun

PDF

Open Access

TL;DR

This paper emphasizes the importance of understanding the specific information need behind datasets, arguing that models trained on datasets with different underlying information needs may not perform well on real-world problems, and advocates for considering this perspective in dataset creation and model evaluation.

Contribution

The paper introduces the concept of information need as a key factor in dataset design and model performance, using QA and IC tasks as case studies to highlight differences in datasets and their implications.

Findings

01

Datasets for different tasks reflect distinct information needs.

02

Differences in dataset creation processes impact model performance.

03

Considering information need can improve dataset relevance and model applicability.

Abstract

Deep learning technologies have brought us many models that outperform human beings on a few benchmarks. An interesting question is: can these models well solve real-world problems with similar settings (e.g., identical input/output) to the benchmark datasets? We argue that a model is trained to answer the same information need for which the training dataset is created. Although some datasets may share high structural similarities, e.g., question-answer pairs for the question answering (QA) task and image-caption pairs for the image captioning (IC) task, they may represent different research tasks aiming for answering different information needs. To support our argument, we use the QA task and IC task as two case studies and compare their widely used benchmark datasets. From the perspective of information need in the context of information retrieval, we show the differences in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Domain Adaptation and Few-Shot Learning