Diagnosing Medical Datasets with Training Dynamics

Laura Wenderoth

arXiv:2411.01653·cs.LG·November 5, 2024

Diagnosing Medical Datasets with Training Dynamics

Laura Wenderoth

PDF

Open Access 1 Repo

TL;DR

This paper investigates the use of training dynamics via Data Maps to evaluate medical datasets, finding that the framework is not suitable for the unique challenges of medical question answering.

Contribution

It assesses the transferability of Data Maps for dataset diagnosis in the medical domain, revealing limitations in medical question answering tasks.

Findings

01

Data Maps framework is unsuitable for medical datasets

02

Medical question answering requires specialized data evaluation methods

03

Training dynamics may not generalize across domains

Abstract

This study explores the potential of using training dynamics as an automated alternative to human annotation for evaluating the quality of training data. The framework used is Data Maps, which classifies data points into categories such as easy-to-learn, hard-to-learn, and ambiguous (Swayamdipta et al., 2020). Swayamdipta et al. (2020) highlight that difficult-to-learn examples often contain errors, and ambiguous cases significantly impact model training. To confirm the reliability of these findings, we replicated the experiments using a challenging dataset, with a focus on medical question answering. In addition to text comprehension, this field requires the acquisition of detailed medical knowledge, which further complicates the task. A comprehensive evaluation was conducted to assess the feasibility and transferability of the Data Maps framework to the medical domain. The evaluation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

laurawenderoth/training-dynamics
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare

MethodsFocus