An Unsupervised Natural Language Processing Pipeline for Assessing Referral Appropriateness

Vittorio Torri; Annamaria Bottelli; Michele Ercolanoni; Olivia Leoni; Francesca Ieva

arXiv:2501.14701·cs.CL·September 5, 2025

An Unsupervised Natural Language Processing Pipeline for Assessing Referral Appropriateness

Vittorio Torri, Annamaria Bottelli, Michele Ercolanoni, Olivia Leoni, Francesca Ieva

PDF

Open Access

TL;DR

This paper introduces an unsupervised NLP pipeline that accurately assesses the appropriateness of medical referrals from free text, aiding healthcare policy and reducing unnecessary procedures.

Contribution

It presents a novel unsupervised NLP method using Transformer embeddings to evaluate referral appropriateness without labeled data, applicable across different examination types.

Findings

01

High precision and recall in identifying referral reasons.

02

Effective detection of inappropriate referrals and regional variation.

03

Supported policy changes in Lombardy healthcare system.

Abstract

Objective: Assessing the appropriateness of diagnostic referrals is critical for improving healthcare efficiency and reducing unnecessary procedures. However, this task becomes challenging when referral reasons are recorded only as free text rather than structured codes, like in the Italian NHS. To address this gap, we propose a fully unsupervised Natural Language Processing (NLP) pipeline capable of extracting and evaluating referral reasons without relying on labelled datasets. Methods: Our pipeline leverages Transformer-based embeddings pre-trained on Italian medical texts to cluster referral reasons and assess their alignment with appropriateness guidelines. It operates in an unsupervised setting and is designed to generalize across different examination types. We analyzed two complete regional datasets from the Lombardy Region (Italy), covering all referrals between 2019 and 2021…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBiomedical Text Mining and Ontologies