Semi-Supervised Learning for Sparsely-Labeled Sequential Data:   Application to Healthcare Video Processing

Florian Dubost; Erin Hong; Nandita Bhaskhar; Siyi Tang; Daniel Rubin,; Christopher Lee-Messer

arXiv:2011.14101·cs.CV·October 5, 2022

Semi-Supervised Learning for Sparsely-Labeled Sequential Data: Application to Healthcare Video Processing

Florian Dubost, Erin Hong, Nandita Bhaskhar, Siyi Tang, Daniel Rubin,, Christopher Lee-Messer

PDF

Open Access 1 Repo

TL;DR

This paper introduces a semi-supervised learning approach for event detection in sequential data with sparse labels, demonstrating improved performance on video datasets and a real-world epilepsy application.

Contribution

The paper proposes a novel semi-supervised training strategy that leverages noisy end time estimates to enhance event detection in sparsely labeled sequential data.

Findings

01

Neural networks improve detection with more data despite label noise.

02

Risk-tolerant strategy outperforms conservative estimates by up to 30 points in mean average precision.

03

Method achieves comparable performance to fully supervised models in epilepsy video analysis.

Abstract

Labeled data is a critical resource for training and evaluating machine learning models. However, many real-life datasets are only partially labeled. We propose a semi-supervised machine learning training strategy to improve event detection performance on sequential data, such as video recordings, when only sparse labels are available, such as event start times without their corresponding end times. Our method uses noisy guesses of the events' end times to train event detection models. Depending on how conservative these guesses are, mislabeled samples may be introduced into the training set. We further propose a mathematical model for explaining and estimating the evolution of the classification performance for increasingly noisier end time estimates. We show that neural networks can improve their detection performance by leveraging more training data with less conservative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fpgdubost/CIFAR-10-Sparsely-Labeled-Sequential-Data
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Healthcare · Anomaly Detection Techniques and Applications