Bringing Balance to Hand Shape Classification: Mitigating Data Imbalance Through Generative Models

Gaston Gustavo Rios; Pedro Dal Bianco; Franco Ronchetti; Facundo Quiroga; Oscar Stanchi; Santiago Ponte Ah\'on; Waldo Hasperu\'e

arXiv:2507.17008·cs.CV·August 21, 2025

Bringing Balance to Hand Shape Classification: Mitigating Data Imbalance Through Generative Models

Gaston Gustavo Rios, Pedro Dal Bianco, Franco Ronchetti, Facundo Quiroga, Oscar Stanchi, Santiago Ponte Ah\'on, Waldo Hasperu\'e

PDF

TL;DR

This paper enhances sign language handshape classification by using GAN-based data augmentation to address dataset imbalance, leading to improved accuracy and generalization across datasets.

Contribution

It introduces a novel data augmentation approach using ReACGAN and SPADE GANs to improve handshape classification accuracy on unbalanced datasets.

Findings

01

Achieved a 5% accuracy improvement on the RWTH dataset.

02

Demonstrated cross-dataset generalization with pose-based generation.

03

Outperformed previous methods in handling data imbalance.

Abstract

Most sign language handshape datasets are severely limited and unbalanced, posing significant challenges to effective model training. In this paper, we explore the effectiveness of augmenting the training data of a handshape classifier by generating synthetic data. We use an EfficientNet classifier trained on the RWTH German sign language handshape dataset, which is small and heavily unbalanced, applying different strategies to combine generated and real images. We compare two Generative Adversarial Networks (GAN) architectures for data generation: ReACGAN, which uses label information to condition the data generation process through an auxiliary classifier, and SPADE, which utilizes spatially-adaptive normalization to condition the generation on pose information. ReACGAN allows for the generation of realistic images that align with specific handshape labels, while SPADE focuses on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.