# Establishing language and ethnic equivalence for health-related quality of life item banks and testing their efficiency via computerised adaptive testing simulations

**Authors:** Yu Heng Kwan, Eva Fenwick, Ying Ying Leung, Warren Fong, Ting Hui Woon, Ling Xiang, Ecosse Lamoureux, Julian Thumboo

PMC · DOI: 10.1371/journal.pone.0298141 · 2024-02-23

## TL;DR

This study confirms that health-related quality of life item banks are unbiased across languages and ethnicities and can efficiently assess quality of life using adaptive testing.

## Contribution

The study establishes language and ethnic equivalence of health-related quality of life item banks and evaluates their efficiency via computerised adaptive testing simulations.

## Key findings

- Rasch analysis identified minimal differential item functioning (DIF) after remedial efforts in SHAWS item banks.
- Computerised adaptive testing simulations showed high efficiency, especially at moderate precision levels.

## Abstract

We aimed to (1) establish linguistic and ethnic equivalence (i.e. lack of bias) for the items in the English and Chinese versions of the Singapore Health and Well Being (SHAWS) Physical Functioning (PF), Positive Mindset (PM) and Social Relationship (SR) item banks (IBs); and (2) evaluate the preliminary efficiency of these IBs using Computer Adaptive Testing (CAT) simulations.

In this cross-sectional study, 671, 670, and 672 subjects answered 55, 48 and 30 items of the PF, PM, and SR IBs, respectively. Rasch analysis was conducted to assess each IB’s psychometric properties, particularly the presence of differential item functioning (DIF) for language and ethnicity. A set of performance criteria related to removing items that displayed notable DIF were employed. CAT simulations determined the mean number of items for high, moderate, and moderate-low measurement precisions (stopping rule: SEM 0.300, 0.387. 0.521, respectively).

Half of subjects were >50 years old (40.9% PF, 42.1% PM, 41.4% SR), Chinese (50.7% PF, 51.0% PM, 50.6% SR) and female (50.0% PF. 49.4% PM, 52.8% SR) respectively. Rasch analysis revealed 4 items with DIF for the PF IB, 9 items with DIF for the PM IB and 2 items with DIF for the SR IB. In CAT simulations, the mean number of items administered was 8.5, 21.6 and 14.5 for the PF, PM and SR IBs, respectively (SEM 0.300), 5.1, 13.0, 8.0 for PF, PM and SR IBs, respectively (SEM 0.387) and 3.1, 5.3 and 4.1 for PF, PM and SR IBs, respectively (SEM 0.521).

The PF, PM and SR IBs to measure health-related quality of life revealed minimal DIF for language and ethnicity after remedial efforts. CAT simulations demonstrated that these IBs were efficient, especially when the stopping rule was set at moderate precision, and support the implementation of the SHAWS IBs into routine clinical care.

## Full-text entities

- **Diseases:** Health and Well (OMIM:603663)

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC10890744/full.md

---
Source: https://tomesphere.com/paper/PMC10890744