# Poster Session II - A294 DEVELOPMENT OF A MACHINE LEARNING MODEL FOR THE USE OF BLOOD BIOMARKERS TO PREDICT IBD PRESENCE AND ACTIVITY

**Authors:** K R Fine, D Mulder, E Lehman, P Briand, J Britton

PMC · DOI: 10.1093/jcag/gwaf042.293 · 2026-02-13

## TL;DR

This study developed a machine learning model using blood biomarkers to predict the presence and activity of inflammatory bowel disease (IBD), offering a non-invasive alternative to colonoscopies.

## Contribution

The novel contribution is a machine learning model that classifies IBD status using routine blood tests, potentially reducing the need for invasive procedures.

## Key findings

- The random forest model achieved 73% accuracy in distinguishing between healthy, active IBD, and remission IBD samples.
- The model showed 80% accuracy in differentiating between IBS and active IBD samples.
- Platelets, hemoglobin, and neutrophils were the most important features for the model's predictions.

## Abstract

Inflammatory bowel disease (IBD) is a lifelong condition involving a complex interaction of immune mediators. The current gold standard evaluation includes invasive testing like colonoscopies. Underserved communities can face significant barriers to accessing colonoscopies and other common imaging procedures. Remote rural and indigenous communities can be far from the nearest hospital that possesses imaging equipment, requiring patients of all ages be transported outside their own communities. While IBD is multivariate, it is known that white blood cells (WBCs) and inflammation-related proteins play a role in IBD pathology, and the levels of these biomarkers can be measured using a blood test. The effects of each individual biomarker and their interactions between different disease states can be tracked and analyzed using machine learning (ML), and by applying the results of this analysis to new patient data it is possible to predict the IBD activity status of these new patients using a simple blood test that can be performed cheaply and in local health centers.

Create an ML model that uses inflammation-associated biomarker levels to differentiate between healthy patients, those with active IBD, and those with IBD in remission, as while as between patients with Inflamatory Bowel Disease and active IBD and diagnose patients into these classes.

1458 bloodwork measurements from 108 IBD patients were included in this analysis. Disease activity was classified by physician global assessment score. A variety of ML and deep learning (DL) models, including random forest, support vector machine, multilayer perceptron, and keras were trained and evaluated for diagnosis and classification.

The optimal model, random forest, achieved an accuracy of 73% and a specificity of 87% when distinguishing between healthy, active IBD, and remission IBD samples. It achevied an accuracy of 80% and a specificity of 88% when distinguishing between IBS and active IBD samples. The features with the highest relative importance to the model were platelets, hemoglobin, and neutrophils.

An ML model can be created to diagnose and track IBD efficiently and non-invasively, presenting a promising proof-of-principle for the ML-facilitated use of routine bloodwork for the diagnosis of complex immunologic conditions. These findings, once validated, could save unnecessary invasive testing and health care costs.

A294 Table 1: Model accuracy differentiating control, active, and remission samples

Funding Agencies: Queen’s University and Kingston General Hospital

## Linked entities

- **Diseases:** inflammatory bowel disease (MONDO:0005265), IBD (MONDO:0005265), IBS (MONDO:0005052)

---
Source: https://tomesphere.com/paper/PMC12901734