DRIFT: Data Reduction via Informative Feature Transformation- Generalization Begins Before Deep Learning starts

Ben Keslaki

arXiv:2506.19734·cs.LG·June 25, 2025

DRIFT: Data Reduction via Informative Feature Transformation- Generalization Begins Before Deep Learning starts

Ben Keslaki

PDF

Open Access

TL;DR

DRIFT introduces a physics-inspired preprocessing method that transforms input data into a compact, informative feature set, improving deep learning training stability and generalization by emphasizing essential patterns before learning begins.

Contribution

This paper presents DRIFT, a novel data reduction technique based on vibrational analysis, which enhances deep learning performance by preprocessing data into a physically grounded, low-dimensional feature space.

Findings

01

DRIFT achieves competitive accuracy with significantly fewer input features.

02

It improves training stability and robustness against overfitting.

03

DRIFT shows minimal sensitivity to batch size, architecture, and resolution.

Abstract

Modern deep learning architectures excel at optimization, but only after the data has entered the network. The true bottleneck lies in preparing the right input: minimal, salient, and structured in a way that reflects the essential patterns of the data. We propose DRIFT (Data Reduction via Informative Feature Transformation), a novel preprocessing technique inspired by vibrational analysis in physical systems, to identify and extract the most resonant modes of input data prior to training. Unlike traditional models that attempt to learn amidst both signal and noise, DRIFT mimics physics perception by emphasizing informative features while discarding irrelevant elements. The result is a more compact and interpretable representation that enhances training stability and generalization performance. In DRIFT, images are projected onto a low-dimensional basis formed by spatial vibration mode…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Machine Learning and Data Classification · Generative Adversarial Networks and Image Synthesis