# Improving olfactory assessment: an item response theory analysis of the American English version of the Sniffin’ sticks identification subtest

**Authors:** Eva Tolomeo, Leognano Ceraudo, Ryann Kolb, Pamela H. Dalton, Marco Tullio Liuzza, Valentina Parma

PMC · DOI: 10.3389/fpsyg.2026.1661164 · Frontiers in Psychology · 2026-01-29

## TL;DR

This study uses Item Response Theory to evaluate and improve the reliability of the Sniffin’ Sticks identification subtest for olfactory assessment.

## Contribution

The study applies IRT to identify low-performing items in the SSET identification subtest, offering a novel approach to refine olfactory testing.

## Key findings

- The 2PL model provided a better fit than the 1PL model for analyzing the SSET identification subtest.
- Three items (leather, turpentine, and pineapple) showed low discrimination, indicating limited effectiveness.
- No sex-related bias was found in item responses based on the DIF analysis.

## Abstract

The Sniffin’ Sticks Extended Test (SSET) is one of the most widely used tools for assessing olfactory function in research and clinical settings. Despite its broad application, a detailed psychometric evaluation of its items, including those within the identification subtest, remains limited. This study aimed to evaluate the reliability, validity, and item-level functioning of the SSET identification subtest using Item Response Theory (IRT), to identify potential weaknesses and propose possible areas for improvement.

The study included 397 US-based participants (60.5% female; mean age 44.61 ± SD = 18.17 [45 ± 18]) who completed the American English version of the identification subtest of the SSET. IRT analyses were conducted using both a one-parameter (1PL) and a two-parameter (2PL) logistic model to estimate item difficulty and discrimination. A Differential Item Functioning (DIF) analysis was also performed to investigate potential sex-related biases in item responses.

Model comparison indicated that the 2PL model provided a better fit than the 1PL model. The 2PL analysis revealed that three items (i.e., leather, turpentine, and pineapple) exhibited low discrimination parameters, suggesting limited utility in distinguishing among different levels of olfactory ability. The DIF analysis found no evidence of differential item performance between male and female participants.

These findings support the use of IRT to identify poorly performing items, enabling the refinement of the SSET, to enhance its precision and reliability across populations. Future research should explore item revisions and extend psychometric evaluations to other subtests and samples.

## Full-text entities

- **Diseases:** OI (OMIM:613848), loss of consciousness (MESH:D014474), Parkinson's disease (MESH:D010300), Alzheimer's disease (MESH:D000544), depressive disorder (MESH:D003866), allergic or infectious rhinitis (MESH:D065631), head injuries (MESH:D006259), hyposmia (MESH:D000086582), Impairments in odor (MESH:D000089083), sinus infections (MESH:D012852), anosmia (MESH:D000857), DIF (MESH:D005547), schizophrenia (MESH:D012559), neurological or psychiatric disorders (MESH:D001523)
- **Chemicals:** turpentine (MESH:D014425)
- **Species:** Homo sapiens (human, species) [taxon 9606], Glycyrrhiza (licorice, genus) [taxon 46347], Allium sativum (garlic, species) [taxon 4682], Mentha x piperita (peppermint, species) [taxon 34256], Ananas comosus (pineapple, species) [taxon 4615]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12863060/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12863060/full.md

## References

47 references — full list in the complete paper: https://tomesphere.com/paper/PMC12863060/full.md

---
Source: https://tomesphere.com/paper/PMC12863060