# Unbiased machine learning-assisted approach for conditional discretization of human performances

**Authors:** Thepparit Banditwattanawong, Masawee Masdisornchote

PMC · DOI: 10.7717/peerj-cs.2804 · 2025-04-21

## TL;DR

This paper introduces new machine learning methods to fairly categorize human performance rankings under specific conditions.

## Contribution

Proposes four novel methods for conditional performance discretization using machine learning and a heuristic approach.

## Key findings

- Machine-learning-based methods outperformed the heuristic approach in most datasets.
- The heuristic method showed strong performance on a specific dataset with high conditional unbiasedness.
- The multi-modal approach effectively combines methods for better conditional discretization.

## Abstract

Performance discretization maps numerical performance values to ordinal categories or performance ranking labels. Norm-referenced performance discretization is extensively applied in human performance evaluation such as grading academic achievements and determining salary increases for employees. These tasks stipulate a common condition that certain performance ranking labels might have no associated performance values and are referred to as conditional discretization. Currently, the only statistical method available for norm-referenced performance discretization is Z score, which merely addresses partial conditions. To achieve a fully conditionally norm-referenced performance discretization, this article proposes four novel approaches enlisting a multi-modal technique that incorporates unsupervised machine-learning algorithms and a heuristic method as well as a novel decision function ensuring conditional unbiasedness. The machine-learning-based methods demonstrate superiority over the heuristic one across most testing data sets, achieving a conditional unbiasedness degree ranging from 0.11 to 0.82. On the other hand, the heuristic method notably outperforms for a specific data set, exhibiting a conditional unbiasedness degree up to 0.76. Leveraging the strengths of these constituent methods enable the effectiveness of the proposed multi-modal approach for conditionally norm-referenced performance discretization.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

50 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12190627/full.md

---
Source: https://tomesphere.com/paper/PMC12190627