# Normalyzing Numeronyms -- A NLP approach

**Authors:** Avishek Garain, Sainik Kumar Mahata, Subhabrata Dutta

arXiv: 1907.13356 · 2019-11-13

## TL;DR

This paper introduces a two-step NLP method utilizing Levenshtein distance and Cosine Similarity to normalize numeronyms, improving human understanding across Bengali and English with over 70% accuracy.

## Contribution

It presents a novel combination of Levenshtein distance and Cosine Similarity for numeronym normalization in multiple languages.

## Key findings

- Achieves 71% accuracy in Bengali
- Achieves 72% accuracy in English
- Effective cross-lingual numeronym normalization

## Abstract

This paper presents a method to apply Natural Language Processing for normalizing numeronyms to make them understandable by humans. We approach the problem through a two-step mechanism. We make use of the state of the art Levenshtein distance of words. We then apply Cosine Similarity for selection of the normalized text and reach greater accuracy in solving the problem. Our approach garners accuracy figures of 71\% and 72\% for Bengali and English language, respectively.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1907.13356/full.md

## References

34 references — full list in the complete paper: https://tomesphere.com/paper/1907.13356/full.md

---
Source: https://tomesphere.com/paper/1907.13356