# Threshold-based exploitation of noisy label in black-box unsupervised domain adaptation

**Authors:** Huiwen Xu, Jaeri Lee, U Kang, Lei Chu, Lei Chu, Lei Chu

PMC · DOI: 10.1371/journal.pone.0321987 · PLOS One · 2025-05-12

## TL;DR

This paper introduces a method to improve model performance when adapting a pre-trained model to a new domain without access to the original model's data or parameters.

## Contribution

The novel TEN method uses thresholding and knowledge distillation to handle noisy labels in black-box unsupervised domain adaptation.

## Key findings

- TEN improves model accuracy by up to 9.49% compared to existing methods.
- Thresholding helps distinguish clean from noisy labels, preserving useful information from the source model.

## Abstract

How can we perform unsupervised domain adaptation when transferring a black-box source model to a target domain? Black-box Unsupervised Domain Adaptation focuses on transferring the labels derived from a pre-trained black-box source model to an unlabeled target domain. The problem setting is motivated by privacy concerns associated with accessing and utilizing source data or source model parameters. Recent studies typically train the target model by mimicking the labels derived from the black-box source model, which often contain noise due to domain gaps between the source and the target. Directly exploiting such noisy labels or disregarding them may lead to a decrease in the model’s performance. We propose Threshold-Based Exploitation of Noisy Predictions (TEN), a method to accurately learn the target model with noisy labels in Black-box Unsupervised Domain Adaptation. To ensure the preservation of information from the black-box source model, we employ a threshold-based approach to distinguish between clean labels and noisy labels, thereby allowing the transfer of high-confidence knowledge from both labels. We utilize a flexible thresholding approach to adjust the threshold for each class, thereby obtaining an adequate amount of clean data for hard-to-learn classes. We also exploit knowledge distillation for clean data and negative learning for noisy labels to extract high-confidence information. Extensive experiments show that TEN outperforms baselines with an accuracy improvement of up to 9.49%.

## Full-text entities

- **Diseases:** TEN (MESH:D019292)
- **Chemicals:** Chu (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12068613/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12068613/full.md

## References

25 references — full list in the complete paper: https://tomesphere.com/paper/PMC12068613/full.md

---
Source: https://tomesphere.com/paper/PMC12068613