Benchmarking Popular Classification Models' Robustness to Random and   Targeted Corruptions

Utkarsh Desai; Srikanth Tamilselvam; Jassimran Kaur; Senthil Mani,; Shreya Khare

arXiv:2002.00754·cs.CL·February 4, 2020·1 cites

Benchmarking Popular Classification Models' Robustness to Random and Targeted Corruptions

Utkarsh Desai, Srikanth Tamilselvam, Jassimran Kaur, Senthil Mani,, Shreya Khare

PDF

Open Access 1 Repo

TL;DR

This paper extends benchmark datasets with natural corruptions like spelling errors and noise, and compares model robustness to these corruptions using LIME, revealing vulnerabilities especially under targeted attacks.

Contribution

It introduces publicly available corrupted datasets for robustness testing and analyzes model vulnerabilities with targeted versus random corruptions.

Findings

01

Targeted corruptions expose more vulnerabilities than random ones.

02

Models are vulnerable to natural corruptions like spelling errors and noise.

03

Extended datasets improve robustness evaluation.

Abstract

Text classification models, especially neural networks based models, have reached very high accuracy on many popular benchmark datasets. Yet, such models when deployed in real world applications, tend to perform badly. The primary reason is that these models are not tested against sufficient real world natural data. Based on the application users, the vocabulary and the style of the model's input may greatly vary. This emphasizes the need for a model agnostic test dataset, which consists of various corruptions that are natural to appear in the wild. Models trained and tested on such benchmark datasets, will be more robust against real world data. However, such data sets are not easily available. In this work, we address this problem, by extending the benchmark datasets along naturally occurring corruptions such as Spelling Errors, Text Noise and Synonyms and making them publicly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

constraint-solvers/benchmark-corruptions
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImbalanced Data Classification Techniques · Benford’s Law and Fraud Detection · Misinformation and Its Impacts

MethodsTest