# An arithmetic method algorithm optimizing k-nearest neighbors compared to regression algorithms and evaluated on real world data sources

**Authors:** Theodoros Anagnostopoulos, Evanthia Zervoudi, Christos Anagnostopoulos, Apostolos Christopoulos, Bogdan Wierzbinski

PMC · DOI: 10.1038/s41598-025-33966-9 · 2026-01-07

## TL;DR

This paper introduces an optimized k-NN regression algorithm using an arithmetic method, showing it performs as well or better than traditional methods on real-world data.

## Contribution

A novel Arithmetic Method Regression (AMR) algorithm is proposed as an optimized version of k-NN.

## Key findings

- The AMR algorithm performs comparably to or better than other regression algorithms on real-world data.
- AMR outperforms the traditional k-NN in most cases.
- The introduced arithmetic method enhances the efficiency of the k-NN algorithm.

## Abstract

Linear regression analysis focuses on predicting a numeric regressand value based on certain regressor values. In this context, k-Nearest Neighbors (k-NN) is a common non-parametric regression algorithm, which achieves efficient performance when compared with other algorithms in literature. In this research effort an optimization of the k-NN algorithm is proposed by exploiting the potentiality of an introduced arithmetic method, which can provide solutions for linear equations involving an arbitrary number of real variables. Specifically, an Arithmetic Method Algorithm (AMA) is adopted to assess the efficiency of the introduced arithmetic method, while an Arithmetic Method Regression (AMR) algorithm is proposed as an optimization of k-NN adopting the potentiality of AMA. Such algorithm is compared with other regression algorithms, according to an introduced optimal inference decision rule, and evaluated on certain real world data sources, which are publicly available. Results are promising since the proposed AMR algorithm has comparable performance with the other algorithms, while in most cases it achieves better performance than the k-NN. The output results indicate that introduced AMR is an optimization of k-NN.

The online version contains supplementary material available at 10.1038/s41598-025-33966-9.

## Full-text entities

- **Genes:** DHFR (dihydrofolate reductase) [NCBI Gene 1719] {aka DHFR1, DYR}, MFSD11 (major facilitator superfamily domain containing 11) [NCBI Gene 79157] {aka ET}, INS (insulin) [NCBI Gene 3630] {aka IDDM, IDDM1, IDDM2, ILPR, IRDN, MODY10}, ACKR5 (atypical chemokine receptor 5) [NCBI Gene 11318] {aka 7TMR, ADMR, AM-R, AMR, G10D, GPR182}
- **Diseases:** stroke (MESH:D020521), infection (MESH:D007239), PBC (MESH:D008105), squamous carcinoma in the mouth and throat (MESH:D000077195), tumor (MESH:D009369), ovarian cancer (MESH:D010051), Lung Cancer (MESH:D008175), asthma (MESH:D001249), base deficit (MESH:D019292), metastases (MESH:D009362), depression (MESH:D003866), spinal cord injury (MESH:D013119), insulin-dependent diabetes mellitus (MESH:D003922), birth defect (MESH:D000014), heart attack (MESH:D009203), breast mass (MESH:D061325), AMA (MESH:D007859), ET (MESH:D000377), heart diseases (MESH:D006331), breast cancer (MESH:D001943), Parkinson's disease (MESH:D010300)
- **Chemicals:** glucose (MESH:D005947), AMA (-), trimethoprim (MESH:D014295), lithium (MESH:D008094), D-penicillamine (MESH:D010396), pyrimidines (MESH:D011743), cholesterol (MESH:D002784), blood glucose (MESH:D001786), oil (MESH:D009821)
- **Species:** Nicotiana tabacum (American tobacco, species) [taxon 4097], Homo sapiens (human, species) [taxon 9606]

## Figures

12 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12852706/full.md

---
Source: https://tomesphere.com/paper/PMC12852706