Improving text classification with vectors of reduced precision

Krzysztof Wr\'obel; Maciej Wielgosz; Marcin Pietro\'n; Micha{\l}; Karwatowski; Aleksander Smywi\'nski-Pohl

arXiv:1706.06363·cs.CL·June 21, 2017·1 cites

Improving text classification with vectors of reduced precision

Krzysztof Wr\'obel, Maciej Wielgosz, Marcin Pietro\'n, Micha{\l}, Karwatowski, Aleksander Smywi\'nski-Pohl

PDF

Open Access 1 Repo

TL;DR

This study investigates how reducing the precision of vector representations in text classification affects accuracy, finding that lower precision can improve performance and reduce computational resources.

Contribution

It demonstrates that floating-point precision reduction in text classification vectors can enhance accuracy and efficiency, a novel insight for hardware-optimized NLP applications.

Findings

01

Precision reduction improves classification accuracy in most cases

02

Reducing from 64 to 4 bits yields optimal results

03

Lower precision decreases memory and computation requirements

Abstract

This paper presents the analysis of the impact of a floating-point number precision reduction on the quality of text classification. The precision reduction of the vectors representing the data (e.g. TF-IDF representation in our case) allows for a decrease of computing time and memory footprint on dedicated hardware platforms. The impact of precision reduction on the classification quality was performed on 5 corpora, using 4 different classifiers. Also, dimensionality reduction was taken into account. Results indicate that the precision reduction improves classification accuracy for most cases (up to 25% of error reduction). In general, the reduction from 64 to 4 bits gives the best scores and ensures that the results will not be worse than with the full floating-point representation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kwrobel-nlp/precision-reduction
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Algorithms and Data Compression · Evolutionary Algorithms and Applications