Improving text classification with vectors of reduced precision
Krzysztof Wr\'obel, Maciej Wielgosz, Marcin Pietro\'n, Micha{\l}, Karwatowski, Aleksander Smywi\'nski-Pohl

TL;DR
This study investigates how reducing the precision of vector representations in text classification affects accuracy, finding that lower precision can improve performance and reduce computational resources.
Contribution
It demonstrates that floating-point precision reduction in text classification vectors can enhance accuracy and efficiency, a novel insight for hardware-optimized NLP applications.
Findings
Precision reduction improves classification accuracy in most cases
Reducing from 64 to 4 bits yields optimal results
Lower precision decreases memory and computation requirements
Abstract
This paper presents the analysis of the impact of a floating-point number precision reduction on the quality of text classification. The precision reduction of the vectors representing the data (e.g. TF-IDF representation in our case) allows for a decrease of computing time and memory footprint on dedicated hardware platforms. The impact of precision reduction on the classification quality was performed on 5 corpora, using 4 different classifiers. Also, dimensionality reduction was taken into account. Results indicate that the precision reduction improves classification accuracy for most cases (up to 25% of error reduction). In general, the reduction from 64 to 4 bits gives the best scores and ensures that the results will not be worse than with the full floating-point representation.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Algorithms and Data Compression · Evolutionary Algorithms and Applications
