# Clustering-based Source-aware Assessment of True Robustness for Learning   Models

**Authors:** Ozsel Kilinc, Ismail Uysal

arXiv: 1704.00158 · 2017-04-04

## TL;DR

This paper proposes a source-aware clustering framework to accurately assess the true robustness of learning models, addressing limitations of traditional validation methods especially when source labels are unavailable.

## Contribution

It introduces a novel validation framework with source-inclusive and source-exclusive partitions, along with a robustness metric based on source-aware bounds, applicable even without source labels.

## Key findings

- Challenging training scenarios can be constructed on MNIST using the framework.
- The approach enables rigorous model comparison and dataset adequacy evaluation.
- It helps identify less robust classes and optimize data collection efforts.

## Abstract

We introduce a novel validation framework to measure the true robustness of learning models for real-world applications by creating source-inclusive and source-exclusive partitions in a dataset via clustering. We develop a robustness metric derived from source-aware lower and upper bounds of model accuracy even when data source labels are not readily available. We clearly demonstrate that even on a well-explored dataset like MNIST, challenging training scenarios can be constructed under the proposed assessment framework for two separate yet equally important applications: i) more rigorous learning model comparison and ii) dataset adequacy evaluation. In addition, our findings not only promise a more complete identification of trade-offs between model complexity, accuracy and robustness but can also help researchers optimize their efforts in data collection by identifying the less robust and more challenging class labels.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1704.00158/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/1704.00158/full.md

## References

12 references — full list in the complete paper: https://tomesphere.com/paper/1704.00158/full.md

---
Source: https://tomesphere.com/paper/1704.00158