Artificial Intelligence for the Diagnosis of Respiratory Diseases in Dogs and Cats: A Systematic Review

Franklin Parrales-Bravo; Janio Jadán-Guerrero; Katherine Medina-Castro; Rosangela Caicedo-Quiroz

PMC · DOI:10.3390/vetsci13020163·February 7, 2026

Artificial Intelligence for the Diagnosis of Respiratory Diseases in Dogs and Cats: A Systematic Review

Franklin Parrales-Bravo, Janio Jadán-Guerrero, Katherine Medina-Castro, Rosangela Caicedo-Quiroz

PDF

Open Access

TL;DR

This paper reviews how AI can help diagnose breathing problems in dogs and cats by analyzing sounds, X-rays, and other data, but notes challenges like limited data sharing.

Contribution

The study systematically evaluates recent AI applications in veterinary respiratory diagnostics and identifies key technical and practical barriers to adoption.

Findings

01

AI models like CNNs and transformers show high accuracy in detecting conditions such as cardiomegaly and BOAS in pets.

02

Data scarcity and lack of standardized datasets hinder broader implementation of AI in veterinary diagnostics.

03

Multimodal approaches combining audio and imaging data offer promising results for respiratory disease detection.

Abstract

Diagnosing breathing problems in dogs and cats is often difficult because traditional methods rely heavily on a veterinarian’s personal judgment and experience. This review examines how artificial intelligence—computer systems that can learn from data—can help to support the detection of these illnesses more reliably. We analyzed 24 recent studies where artificial intelligence (AI) was used in three ways: listening to breathing sounds, reading chest X-rays and scans, and combining different kinds of data like those from sound and movement sensors. The results show that AI can spot serious conditions like heart enlargement and lung diseases with high accuracy. However, wider use is limited by a lack of shared animal health data and real-world testing in clinics. Overall, AI offers great promise to support veterinarians in making quicker, more consistent diagnoses, leading to better care…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Figures1

Click any figure to enlarge with its caption.

Funding3

—Universidad de Guayaquil
—Universidad Bolivariana del Ecuador
—Universidad Tecnológica Indoamérica

Keywords

assistance toolspathologypetschest X-raysinternal medicine

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPhonocardiography and Auscultation Techniques · COVID-19 diagnosis using AI · Veterinary Oncology Research

Full text

1. Introduction

Respiratory diseases are one of the leading causes of veterinarian visits in pets, especially dogs and cats [1], with conditions such as bronchitis, pneumonia, feline asthma, and brachycephalic obstructive airway syndrome (BOAS) representing a significant clinical burden in companion animal practice [2]. Difficulties in interpretation arise from several factors, including the subtle and often overlapping nature of respiratory sounds—such as distinguishing between wheezes, crackles, and stertor—which can indicate different underlying conditions like feline asthma, pneumonia, or Brachycephalic Obstructive Airway Syndrome (BOAS) [3]. Additionally, many respiratory conditions exhibit clinical signs similar to those of diseases affecting other organ systems, particularly cardiovascular disorders; for example, cardiogenic pulmonary edema can mimic radiographic and auscultatory findings of pneumonia, complicating accurate and timely diagnosis without advanced imaging or multimodal assessment [4,5]. These diseases present a significant challenge for diagnosis due to clinical variability and the reliance on professional experience in the interpretation of traditional tests [6].

Detection of these pathologies in a timely manner is essential to prevent disease progression, reduce complications, and improve animal welfare [1]; however, techniques such as clinical auscultation and radiographic interpretation can exhibit high variability between observers, particularly when abnormalities are subtle or in their initial stages [6,7]. For example, studies such as those by Banzato et al. [8] and Dumortier et al. [9] highlight how subjective interpretation of thoracic radiographs—even among experienced clinicians—can lead to inconsistent diagnoses of conditions like alveolar patterns or pulmonary abnormalities. Similarly, Oren et al. [10] note that traditional auscultation-based assessment of respiratory sounds in brachycephalic dogs is heavily dependent on examiner expertise, often resulting in diagnostic inconsistency. This inter-observer variability underscores the critical need for more objective, AI-supported tools to support detection in veterinary respiratory medicine. Given these constraints, there is a pressing need to systematically evaluate the current state of artificial intelligence (AI) applications [11] specifically tailored to veterinary respiratory diagnostics. Although several reviews have explored AI in veterinary medicine in general, few have focused on the early detection of respiratory diseases in dogs and cats—a critical gap given the clinical prevalence and diagnostic complexity of these conditions [1,12]. A focused review can help identify modality-specific advances, benchmark performance, and highlight translational challenges unique to companion animals.

AI has played a significant role as a tool to support medical diagnosis in pets through automated analysis of images and physiological signals [13,14,15]. In human medicine, numerous studies have demonstrated that machine learning and deep learning techniques can achieve performance comparable to or better than that of specialists in the detection of respiratory diseases [16,17,18]. In contrast, in the field of veterinary medicine, the application of these AI technologies is still limited due to the scarcity of large, well-annotated datasets [19,20], the anatomical and pathological diversity between multiple species [14,21], and the significant economic and infrastructural constraints of most veterinary practices [19]. In fact, unlike human healthcare, where data collection is often standardized and supported by major institutions, veterinary data is frequently fragmented between clinics with varying equipment and record-keeping practices [22]. This creates a substantial bottleneck for training robust generalizable models [23]. However, pioneering research is beginning to emerge that focuses on conditions such as canine pulmonary fibrosis or feline asthma, offering a promising glimpse into a future where AI-powered tools could become vital assistants in the veterinary clinic, helping to make earlier and more accurate diagnoses for our animal companions [14,24].

The purpose of this systematic review is to synthesize and critically evaluate the existing literature on artificial intelligence (AI) applications to support the detection of respiratory diseases in dogs and cats, with a specific focus on three diagnostic modalities:

Audio-based approaches (e.g., respiratory sounds and vocalizations);
Image-based methods (e.g., chest radiographs, CT scans);
Multimodal integrations (e.g., combining audio, video, and sensor data). Moreover, given the rapid evolution of deep learning architectures and the marked increase in veterinary AI publications due to COVID-19 pandemic [25], this review focuses on studies published from 2019 onward. This review aims to identify the types of clinical data used, the AI techniques most frequently employed, and the reported diagnostic performance, thereby highlighting the main opportunities and challenges for future research and clinical implementation in veterinary medicine.

This systematic review addresses the following research questions (RQs) for studies grouped by each analytical approach (audio-based, image-based, and multimodal AI):

RQ1: What are the techniques used? What artificial intelligence (AI) and machine learning (ML) techniques—such as convolutional neural networks (CNNs), recurrent neural networks (RNNs), transformers, ensemble methods, or signal processing methods—are employed in audio-based, image-based, and multimodal diagnostic approaches for the detection of respiratory diseases in dogs and cats?
RQ2: What are the key findings? What are the primary diagnostic performance outcomes (e.g., accuracy, sensitivity, specificity, AUC-ROC) reported in studies using each approach? What respiratory conditions (e.g., BOAS, cardiomegaly, alveolar patterns) are most commonly detected, and how reliable are these AI models in veterinary settings?
RQ3: What are the veterinary clinical implications? How can AI-driven tools enhance veterinary practice in terms of diagnostic accuracy, workflow efficiency, detection, remote monitoring, and clinical decision support? What are the potential impacts on animal welfare, treatment outcomes, and veterinary resource allocation?

These questions systematically guide the evaluation of AI applications in veterinary respiratory diagnostics, ensuring a structured analysis of techniques, outcomes, implications, alignment with existing literature, and critical assessment of each diagnostic modality.

The remaining sections of the paper are structured as follows: Section 2 provides a review of related literature, positioning our work within the broader context of AI applications in veterinary medicine. Section 3 details the systematic methodology employed for study selection and data synthesis. Section 4 presents the findings categorized by diagnostic modality—audio-based, image-based, and multimodal AI approaches. Section 5 discusses the implications, challenges, and future directions of AI in veterinary respiratory diagnostics. Section 6 outlines the practical implications for both research and clinical practice. Section 7 acknowledges the limitations of this review and suggests future perspectives. Lastly, Section 8 encompasses concluding remarks, summarizing the potential and current constraints of AI in supporting the detection of respiratory diseases in dogs and cats.

2. Related Work

This section provides a comprehensive overview of existing literature on artificial intelligence in veterinary sciences, with a particular focus on diagnostic applications.

In Table 1, we present a summary of relevant review works on the topic of AI in veterinary sciences conducted in the last 5 years, indicating for each the main goal, findings, and differences in their approach compared to our study, which is narrowly focused on the application of AI for the detection of respiratory diseases specifically in dogs and cats, primarily using chest radiographs and respiratory sounds as data sources.

The studies summarized in Table 1 collectively aim to map, evaluate, and advance the integration of AI across the diverse landscape of veterinary medicine. Their goals range from providing comprehensive overviews of AI’s potential applications—spanning clinical practice, biomedical research, public health, and administration [19,25,31]—to offering practical educational guides for veterinary practitioners [28]. Several reviews focus specifically on diagnostic imaging, analyzing AI’s role in enhancing detection, classification, and segmentation across various modalities like radiology and ultrasound [13,27,30], while others adopt a broader, holistic perspective to explore AI’s transformative prospects in diagnostics, predictive medicine, personalized treatment, and drug development [14,19,31]. A subset compares AI applications between human and veterinary medicine [26] or assesses the feasibility and ethical considerations of AI deployment in clinical settings [27,30]. Despite their varied scopes, a common thread is the intention to synthesize existing knowledge, identify current trends and challenges—such as data scarcity, methodological heterogeneity, and the need for human oversight—and ultimately chart a path toward more effective, efficient, and ethically sound AI tools that support, rather than replace, veterinary professionals.

3. Methodology

This literature review was conducted using a systematic approach to identify peer-reviewed journal articles published between 2019 and 2025 that focus on the application of artificial intelligence (AI) for the detection of respiratory diseases in pets, particularly dogs and cats. The decision to restrict the review to studies published from 2019 onward is grounded in the rapid evolution of computer-aided veterinary diagnostics during this period due to COVID-19 [25]. In fact, the COVID-19 pandemic spurred increased research interest in AI-driven respiratory health monitoring, further expanding the volume and quality of relevant publications from 2019 onward. By focusing on this timeframe, this review ensures that the included works reflect contemporary applications, and clinical relevance essential for current and future AI applications in veterinary practice.

It is important to mention that the final report followed the PRISMA-ScR (Preferred Reporting Items for Systematic reviews and Meta-Analyses extension for Scoping Reviews) guidelines [32]. The protocol was pre-registered in the Open Science Framework (OSF) portal (Available in https://osf.io/tqbwd/overview?view_only=eb8578d80ef745cbbeb9d370ee8bf800) (accessed on 23 December 2025).

3.1. Databases and Search Strategy

Multiple electronic databases were searched, including PubMed, Scopus, IEEE Xplore, and Web of Science. They were selected for their coverage in the fields of veterinary medicine, biomedical engineering, and artificial intelligence applied to health. In addition, we have considered Google Scholar because it provides a comprehensive and broad-reaching search of scholarly literature across multiple disciplines and sources, including peer-reviewed papers, theses, books, and conference proceedings, which helps to capture a wider range of potentially relevant studies that may not be indexed in more specialized databases.

Table 2 presents the company names and addresses (city, country) of the databases and software used. The search strategy was designed to systematically identify studies applying artificial intelligence to support the detection of respiratory diseases in dogs and cats. Reflecting the three core diagnostic modalities under review—audio-based, image-based, and multimodal AI—the strategy incorporated targeted keywords and Boolean operators for each approach. For audio-based diagnostics, search terms included “respiratory sounds,” “cough detection,” and “lung sounds.” For image-based diagnostics, terms such as “thoracic radiograph,” “lung ultrasound,” and “CT scan” were combined with AI-specific terms like “CNN,” “ResNet,” and “U-Net.” For multimodal approaches, keywords included “multimodal AI,” “combined model,” and “fusion model” to capture integrative studies. The searches were adapted to the specific syntax of each database and were limited to articles published between 2019 and 2025, a period that coincides with the consolidation of deep learning in diagnostic veterinary applications. Table 3 presents the number of articles retrieved using each search strategy applied.

3.2. Inclusion Criteria

For the present study, we selected articles that met the following criteria:

Studies published between 2019 and 2025.
Original scientific articles published in indexed journals or peer-reviewed conference proceedings.
Studies focused on dogs or cats.
Research applying artificial intelligence techniques to analyze thoracic radiographs, respiratory sounds, or clinical data related to the respiratory system.
Studies reporting diagnostic performance metrics, such as accuracy, sensitivity, specificity, or AUC-ROC.
Publications available in English or Spanish.

3.3. Exclusion Criteria

In this study, we discarded the articles that contained any of the following points:

Studies focused exclusively on human medicine without application or extrapolation to the veterinary field.
Studies addressing non-respiratory pathologies.
Studies using computational techniques without applying artificial intelligence or machine learning.
Opinion pieces, editorials, abstracts without full text, or duplicate documents.
Research without a clear description of the methodology or without a report of diagnostic results.
Articles not written in English or Spanish.

3.4. Data Extraction and Synthesis

Figure 1 presents the flowchart of study selection according to the PRISMA guidelines [32]. The selection process was carried out in several stages. Initially, the records were identified by searching the selected databases. Subsequently, duplicates were removed and a preliminary screening was performed based on the title and abstract. The potentially relevant studies were then assessed by reading the full text to determine their final eligibility.

4. Results

The literature search yielded a total of 558 potential articles; following a detailed screening process (as shown in Figure 1), 24 studies met the inclusion criteria, with 5, 13 and 6 related to audio-based, image-based and multimodal-based diagnostics, respectively. The following sections summarize the key findings, grouping the works by each diagnostic approach employed.

4.1. Audio-Based Diagnostic

This section focuses exclusively on research that uses respiratory sounds or vocalizations as primary data for the detection and assessment of conditions such as Brachycephalic Obstructive Airway Syndrome (BOAS) in dogs. Table 4 (and its continuation) summarizes the selected studies in the domain of audio-based AI diagnostics for respiratory diseases in commonly companion animals, specifically, Cats and Dogs. The included studies employ a range of AI techniques—from convolutional and recurrent neural networks to ensemble classifiers and signal processing methods—to analyze audio recordings captured via electronic stethoscopes or other acoustic sensors. Each entry outlines the study’s goal, sample characteristics, data type, AI methodology, key performance metrics, and veterinary implications, thus providing a structured overview of how audio-driven AI is currently being applied and validated in veterinary respiratory diagnostics.

Based on the studies summarized in Table 4, the application of AI for audio-based diagnosis of respiratory diseases in veterinary medicine has shown promising results, particularly in the detection and assessment of conditions such as Brachycephalic Obstructive Airway Syndrome (BOAS) in dogs. These studies utilize various AI techniques, including convolutional neural networks (CNNs), recurrent neural networks (RNNs), ensemble models, and signal processing methods, to analyze respiratory sounds and vocalizations. For instance, Karaslan et al. [33] developed a fully automatic voice analysis system capable of classifying dog vocalizations with up to 90% accuracy, using CNNs with features such as MFCC and STFT. This system enables automated behavioral and health monitoring, supporting stress and emotion assessment in clinical settings. Similarly, McDonald et al. [34] employed an RNN with GRU and attention mechanisms to detect stertor in brachycephalic dogs, achieving an AUC of 0.85, which facilitates accessible and objective BOAS screening through electronic stethoscopes and potential smartphone applications.

4.2. Image-Based Diagnostic

This section focuses on studies that employ thoracic imaging data—primarily radiographs, but also CT scans and ultrasound—as the main input for AI-driven diagnosis of respiratory conditions in dogs and cats. The works summarized in Table 5 utilize deep learning architectures such as convolutional neural networks (CNNs), ResNet, DenseNet, U-Net, and transformer-based models to perform tasks including classification, segmentation, and quality assessment of thoracic images. These approaches aim to support veterinarians by automating the detection of patterns such as alveolar infiltrates, pleural effusion, cardiomegaly, and pulmonary masses, as well as by providing objective measurements like Vertebral Heart Size (VHS) and Cardiothoracic Ratio (CTR). The table outlines each study’s objective, sample characteristics, imaging modality, AI technique, key performance outcomes, and potential clinical implications, offering a consolidated view of how image-based AI is currently advancing respiratory diagnostics in veterinary practice.

The studies compiled in Table 5 demonstrate that AI, particularly deep learning models, has achieved considerable success in analyzing thoracic images for respiratory disease detection in dogs and cats. Architectures such as ResNet-50, DenseNet-121, U-Net, and vision transformers have been widely employed for tasks ranging from multi-label classification of radiographic patterns to automated segmentation of lung fields and masses. Performance metrics reported are often clinically relevant, with many studies achieving AUC values above 0.8 or 0.9 for conditions like alveolar patterns, pleural effusion, pneumothorax, and cardiomegaly [8,38,43]. For example, Banzato et al. [8] reported AUCs greater than 0.9 for detecting alveolar patterns and pleural effusion in feline radiographs, while Burti et al. [43] achieved an AUC of 0.973 for detecting cardiomegaly in dogs using ResNet-101. Segmentation tasks have also shown high precision, with Jurgas et al. [44] reporting a Dice Similarity Coefficient of 0.91 for pulmonary mass segmentation in canine CT scans. Additionally, some studies have extended beyond detection to include image quality assessment, such as evaluating collimation, positioning, and exposure [39], which can help reduce non-diagnostic studies and improve workflow efficiency.

4.3. Multimodal-Based Diagnostic

This section examines studies that integrate multiple types of data—such as audio, video, sensor signals, and imaging—to enhance the detection and monitoring of respiratory conditions in dogs and cats. Multimodal AI approaches aim to overcome the limitations of single-modality systems by combining complementary information, thereby improving diagnostic robustness, accuracy, and clinical applicability. The works summarized in Table 6 employ a variety of AI techniques, including artificial neural networks (ANNs), convolutional neural networks (CNNs), and custom algorithms, to analyze integrative data from wearable devices, smartphones, and pressure sensors. Each entry outlines the study’s objective, sample characteristics, data types, AI methodology, key contributions, and veterinary implications, providing a structured overview of how multimodal AI is being developed and validated for respiratory health assessment in companion animals.

The integration of multimodal data through artificial intelligence represents a significant advancement in veterinary diagnostics, particularly for the detection and continuous monitoring of respiratory diseases in companion animals. Studies such as those by Withington et al. [48] and Angelucci et al. [50] illustrate how combining diverse data streams—including audio, video, motion, and sensor signals—can yield a more holistic and accurate assessment of an animal’s respiratory health. For instance, Angelucci et al. [50] demonstrated that video-based respiratory rate monitoring in sleeping dogs achieved high accuracy (RMSE = 1.1, MAE = 0.7), offering a low-cost, non-invasive method suitable for at-home monitoring and telemedicine applications. Similarly, Jarkoff [49] reported that a smart collar equipped with motion sensors and AI algorithms could estimate resting heart and breathing rates with minimal error (SMAPE 0.38% for heart rate, 1.42% for breathing rate), enabling continuous, real-time vital sign tracking outside clinical settings.

5. Discussion

Our systematic review successfully synthesized the emerging evidence on AI applications to support the detection of respiratory diseases in dogs and cats. The most important finding of this study is that, while promising, veterinary AI lags significantly behind its human medicine counterpart. Unlike human healthcare, where AI tools for respiratory diagnosis are more advanced and widely validated, veterinary applications are constrained by critical barriers such as data scarcity, lack of standardization, and limited real-world clinical integration. This discussion synthesizes the key opportunities and challenges across the three diagnostic modalities, highlighting areas requiring focused research, standardization, and validation.

5.1. Audio-Based Diagnostics: Beyond BOAS and Towards Generalized Respiratory Sound Analysis

Current audio-based AI research has effectively demonstrated its utility in objective screening for Brachycephalic Obstructive Airway Syndrome (BOAS), achieving notable accuracy through models analyzing laryngeal sounds and post-exercise respiratory patterns. Traditional auscultation and clinical evaluation can be subjective and variably performed, whereas AI models offer standardized, data-driven insights. In fact, respiratory sounds in dogs and cats can be challenging to differentiate due to overlapping acoustic characteristics between normal and abnormal sounds, as well as between different pathologies. Below, we clarify the physiological and anatomical origins of sounds that may lead to confusion in AI-based evaluation:

Normal respiratory sounds (e.g., tracheal, bronchial, and vesicular sounds) arise from laminar airflow through open airways and are typically soft, low-pitched, and regular. Abnormal sounds such as stertor (originating from the nasopharynx) and stridor (originating from the larynx or trachea) are often higher-pitched and may be confused with normal turbulent airflow in brachycephalic breeds, where anatomical narrowing is common even in healthy individuals [54].
Crackles (associated with pulmonary edema, fibrosis, or pneumonia) and wheezes (associated with airway obstruction, e.g., feline asthma) can be difficult to distinguish from artifacts (e.g., movement, panting, or environmental noise) or from normal inspiratory/expiratory sounds in anxious or panting animals [55].
Breed-specific anatomical variations (e.g., elongated soft palate in brachycephalic dogs) can produce sounds that resemble pathological stertor, leading to false positives if models are trained on limited or non-representative datasets. These overlapping acoustic profiles highlight the need for AI models to be trained on well-annotated, diverse datasets that include clear labels for sound origin (anatomical site) and context (rest, exercise, stress). Future work should also incorporate multimodal data (e.g., simultaneous video or spirometry) to disambiguate ambiguous acoustic patterns.

Futhermore, it is insightful to compare these AI-based audio diagnostic approaches with traditional, non-AI methods for BOAS assessment, such as structured owner questionnaires and standardized exercise tolerance tests (ETTs). For example:

The study by Anyamaneecharoen et al. [56] utilized a detailed owner questionnaire and a 6-min walk test (6-MWT) to assess Brachycephalic Obstructive Syndrome (BOAS) severity in French Bulldogs. Their results showed a clear gradient: the normal group walked significantly farther ( $[eqn]$ ) than the moderate ( $[eqn]$ ) and severe ( $[eqn]$ ) BOAS groups. Furthermore, they found a strong negative correlation (r = −0.757, p< 0.001) between the 6-MWT distance and the owner-reported breathing sound scores, indicating that poorer exercise capacity closely aligns with more severe respiratory noise.
Similarly, Reyes-Sotelo et al. [57] employed a combination of a 6-min walk and a 1000-m walk test to evaluate dogs of different cephalic biotypes. Their findings confirmed that brachycephalic dogs, especially those with BOAS grades 2 and 3 (G2, G3), covered significantly less distance in the 1000-m test and exhibited more pronounced physiological alterations (e.g., sustained low SpO_2_, elevated heart and respiratory rates, poor thermoregulatory recovery) compared to dolichocephalic and mesocephalic dogs. They also identified specific morphometric risk factors, such as muzzle length < 38 mm and nasal fold thickness ≥ 20 mm, associated with severe BOAS. These traditional studies underscore the clinical value of exercise tests but also highlight their inherent limitations and risks. Tests like the 6-MWT or 1000-m walk, while informative, impose physical stress that can be hazardous for severely affected brachycephalic dogs, potentially triggering dyspnea, cyanosis, hyperthermia, or collapse. This is precisely where AI-based audio diagnostics present a transformative opportunity.

The AI models reviewed, such as those deployed by McDonald et al. [34] (AUC = 0.85 for BOAS detection from laryngeal sounds) and Oren et al. [10] (85% accuracy), demonstrate that AI can extract diagnostically rich information from respiratory sounds recorded under controlled or minimally stressful conditions, potentially even at rest or during mild activity. For example, an AI model trained on both acoustic data (e.g., resting or post-mild-exercise respiratory sounds) and the corresponding outcomes of standardized 6-MWT or 1000-m walk tests could learn to predict a dog’s functional exercise capacity and BOAS severity grade. A dog presenting with specific acoustic signatures (e.g., certain stertor patterns, spectral characteristics identified by FFT) could be algorithmically assessed as “high risk” for failing a strenuous exercise test, thus contraindicating the physical test itself. This approach would shift the paradigm from provoking a physiological crisis to diagnosing by predicting it through safe, passive monitoring.

The future of BOAS diagnosis lies not in choosing between AI and traditional methods but in their intelligent integration to support the diagnosis:

AI for Triage and Continuous Monitoring: AI tools could be deployed in-clinic for rapid, objective screening during routine exams or via wearable/smartphone technologies for at-home monitoring. They can provide an initial, risk-strratified assessment without stress.
Traditional Tests for Calibration and Validation: Well-established questionnaires and controlled, gentle walk tests (like the initial phase of a 6-MWT in a climate-controlled environment) remain crucial for validating AI models, gathering owner-reported outcomes, and assessing cases where AI predictions are uncertain.
Morphometric Data as Context: As shown by Reyes-Sotelo et al. [57], morphometric data (muzzle length, neck circumference) are strong risk indicators. Future multimodal AI systems could integrate audio analysis with simple anatomical measurements from images or 3D scans for a holistic risk assessment. In summary, while traditional questionnaire- and exercise test-based methods provide a valuable clinical benchmark and correlate well with disease severity, they carry non-negligible risk and are dependent on owner compliance and environmental control. AI-based audio diagnostics offer a complementary pathway that is objective, scalable, and minimally invasive. The most promising clinical application is to develop these AI tools as predictive filters, capable of identifying dogs for whom traditional exercise tests would be of high risk or low diagnostic yield. By doing so, veterinary practice can enhance patient safety, support the detection of BOAS, and guide breeding decisions with greater precision and ethical responsibility, moving proactively toward a model of predictive, preventive, and personalized care for brachycephalic breeds.

5.2. Image-Based Diagnostics: The Imperative for Explainability and Seamless Clinical Integration

Deep learning models for thoracic radiograph and CT scan analysis have demonstrated robust performance in detecting specific pathological patterns, such as alveolar infiltrates, cardiomegaly, and pulmonary masses [8,37,43,44]. However, transitioning these models from research environments to reliable clinical tools requires addressing two interconnected challenges that extend beyond mere algorithmic accuracy: enhanced explainability and practical workflow integration.

While techniques like Gradient-weighted Class Activation Mapping (Grad-CAM) provide valuable visual heatmaps by highlighting regions of interest in an image [9], their output often remains abstract for the practicing veterinarian. There is a pressing need for more intuitive, clinically-grounded explanations that align directly with the spatial and descriptive reasoning veterinarians employ during radiographic interpretation. Future AI systems should be designed to generate reports that explicitly reference specific, recognizable anatomical landmarks, which are fundamental to veterinary image assessment. For instance, an AI tool could indicate that a detected interstitial pattern is primarily localized around the perihilar region [58] or that cardiomegaly is suggested by a vertebral heart score (VHS) exceeding established thresholds based on the cranial border of the 4th thoracic vertebra and the caudal cardiac silhouette [59]. Other critical reference sites include the costophrenic angles for assessing pleural effusion [60], the bronchovascular pattern for identifying interstitial disease [61], the diaphragmatic border for evaluating intrathoracic masses or herniation [62], and the tracheal axis for detecting mediastinal shifts [63]. By correlating AI findings with these standard anatomical reference points and pairing them with standardized textual descriptors of radiographic signs (e.g., “moderate alveolar pattern in the left caudal lung lobe” or “increased bronchial wall thickness extending to the peripheral airways”), AI systems can bridge the interpretability gap, fostering trust and facilitating more efficient clinical decision-making.

Furthermore, a significant limitation in the current evidence base is that the majority of validation studies are conducted on retrospective, often highly curated, single-center datasets. The real-world diagnostic performance of these models can degrade substantially due to factors commonplace in general practice but underrepresented in training data. These include vast variations in image quality (e.g., exposure, motion blur), suboptimal patient positioning artifacts, the presence of concurrent and complex pathologies in a single study, and the diverse range of digital radiography systems and techniques used across clinics [39,64]. This underscores the critical and non-negotiable need for robust, prospective validation studies conducted in diverse, real-world clinical environments. Historically, many diagnostic aids and algorithms have shown promising accuracy in controlled experimental settings but have demonstrated poor sensitivity, specificity, or generalizability when deployed in routine practice, failing to maintain performance across different patient breeds, sizes, clinical states, and institutional protocols [14,64,65]. Therefore, rigorous multi-center prospective trials are essential. These trials must not only evaluate diagnostic accuracy metrics (like AUC, sensitivity, specificity) on held-out, multi-institutional data but must also assess clinical utility—measuring outcomes such as reduction in time-to-diagnosis, influence on treatment planning, improvement in diagnostic confidence among general practitioners, and, ultimately, impact on patient prognosis.

To this end, future development must prioritize the creation of “clinician-in-the-loop” AI systems. These are not meant to be autonomous, but are designed as intelligent assistants. Such systems should go beyond simple detection to quantify prediction uncertainty [66], automatically flag technically suboptimal images (e.g., poor collimation, rotation) in real-time before interpretation [67], and provide ranked differential diagnoses accompanied by confidence scores and relevant clinical context. Achieving this vision requires prospective testing of seamless integration with existing veterinary practice infrastructure, primarily Picture Archiving and Communication Systems (PACS), to evaluate the tool’s true impact on radiologic workflow efficiency, diagnostic error reduction, and overall clinician confidence in everyday settings.

5.3. Multimodal AI: Data Fusion Strategies and the Challenge of Clinical Actionability

The importance of multimodal AI lies in its ability to overcome the limitations inherent in single-modality approaches. By fusing complementary data sources—such as audio recordings for respiratory sounds, video for respiratory effort, and inertial sensors for movement and cardiac activity—AI systems can enhance diagnostic reliability, reduce false positives, and capture subtle physiological changes that may indicate any disease. Withington et al. [48] highlighted how pressure sensor data from medical detection dogs could be classified using artificial neural networks to automate behavioral responses, reducing cognitive load and training time. This approach not only supports diagnostic accuracy but also paves the way for automated, non-invasive monitoring systems that can operate in naturalistic environments, such as the home or shelter.

In practical veterinary use, multimodal AI tools offer several key benefits: they enable continuous monitoring of at-risk animals (e.g., brachycephalic breeds prone to BOAS), support telemedicine and remote consultations by providing objective data to veterinarians, and facilitate preventive care through longitudinal tracking of respiratory trends. For example, systems that combine audio and video inputs, as validated by Angelucci et al. [53], allow owners to participate actively in their pet’s health management using everyday devices like smartphones. Moreover, wearable technologies like the smart collar studied by Jarkoff [49] can alert caregivers to deviations from normal respiratory patterns, prompting timely veterinary intervention. These innovations are particularly valuable in settings with limited access to specialist care, as they provide scalable, user-friendly tools for respiratory assessment without requiring specialized equipment or frequent clinic visits.

Despite these promising developments, challenges remain in the widespread adoption of multimodal AI in veterinary practice. These include the need for standardized data collection protocols [14], integration into existing clinical workflows [68], and validation across diverse breeds and environments [69]. Future research should focus on creating shared multimodal datasets, improving model interpretability, and conducting multi-center clinical trials to ensure robustness and generalizability. Nevertheless, the current evidence underscores the transformative potential of multimodal AI to enhance respiratory disease detection, improve animal welfare, and support veterinarians with actionable, data-driven insights.

5.4. Bridging the Data Scarcity Gap: Leveraging Human Medicine and Advanced Learning Paradigms

A fundamental and persistent challenge across all AI modalities in veterinary diagnostics is the acute scarcity of large-scale, high-quality, and consistently annotated datasets [19,30]. Unlike human medicine, where initiatives like MIMIC-CXR [70] or the NIH Chest X-ray dataset [71] provide hundreds of thousands of labeled images, veterinary data remains fragmented, institution-specific, and often proprietary. This data paucity severely restricts the development, validation, and generalizability of deep learning models, which are inherently data-hungry. To overcome this critical bottleneck and unlock the full potential of veterinary AI, a strategic shift toward data-efficient methodologies is imperative. Future research must focus on two synergistic pathways: leveraging cross-domain knowledge and pioneering advanced, resource-conscious learning paradigms.

The first strategic pathway involves aggressive exploration of cross-species transfer learning and sophisticated domain adaptation techniques. In facr, the anatomical similarities in thoracic structures between humans and companion animals, especially in pathology manifestations like pleural effusion or pneumothorax [72], can provide a foundation for pre-training models on vast human data before fine-tuning on smaller veterinary sets. Pioneering studies, such as the work by Celniak et al. [47], have validated the feasibility of this approach. Their model, pre-trained on a heterogeneous mix of over 500,000 human and canine radiographs using self-supervised learning, demonstrated improved performance on veterinary-specific classification tasks after fine-tuning. This paradigm allows models to learn universal features of disease presentation—such as texture, shape, and spatial relationships—from the vast, richly annotated repositories of human medical imaging before specializing in the veterinary domain. Future work should extend this concept beyond radiography to other modalities, such as adapting models trained on human lung sound databases or human wearable sensor data for veterinary audio and physiological monitoring applications. The key will be developing robust domain adaptation algorithms that can effectively minimize the “domain shift” caused by anatomical differences (e.g., thoracic conformation in brachycephalic breeds) and imaging protocol variations.

The second, equally critical pathway is the dedicated adoption of next-generation machine learning frameworks designed for learning from limited supervision. Relying solely on supervised learning with expert-labeled data is unsustainable for the veterinary field. Instead, the community must invest in:

Self-Supervised Learning (SSL): SSL algorithms can learn powerful representations by solving “pretext tasks” on unlabeled data, such as predicting the rotation of an image or reconstructing masked parts of a spectrogram. Veterinary clinics generate terabytes of unlabeled images and audio recordings daily. SSL can transform this untapped resource into a pre-training goldmine, creating foundation models that encode general veterinary-relevant features without a single diagnostic label [73].
Contrastive Learning: This technique learns by contrasting similar (positive) and dissimilar (negative) data pairs. It is exceptionally effective for learning robust representations that are invariant to nuisance variations (e.g., different X-ray machine exposures, variable stethoscope placement) while being sensitive to pathological differences. This is crucial for building models that perform consistently across diverse clinical settings [74].
Few-Shot and Meta-Learning: These paradigms aim to train models that can learn new diagnostic tasks from only a handful of examples. This is directly applicable to rare respiratory conditions in veterinary medicine or for quickly adapting a general model to a specific clinic’s patient demographics or imaging equipment [75,76]. Prioritizing these advanced paradigms will catalyze a shift from a dependency on massive, curated datasets toward intelligent, efficient learning from available data. The ultimate goal is to build AI systems that are not only accurate but also data-frugal, adaptable, and inherently robust to the heterogeneity of real-world veterinary practice. This strategic focus on bridging the data scarcity gap is not merely a technical improvement but a foundational requirement for the equitable, widespread, and clinically impactful deployment of AI in veterinary respiratory medicine.

5.5. Pathway to Clinical Deployment: Validation, Standardization, and Ethical Implementation

The transition of AI models from promising research to reliable, trusted tools in daily veterinary practice represents a significant translational challenge. This pathway requires a comprehensive, multi-faceted framework that extends far beyond demonstrating high accuracy on retrospective datasets [77]. The cornerstone of this framework is the shift from internal validation to rigorous external, multi-center prospective trials [19,30]. Such studies must be designed to evaluate models on heterogeneous, real-world data that reflects the full spectrum of clinical variability—including differences in imaging equipment, patient positioning, concurrent pathologies, and operator skill [39,64]. These trials should move beyond traditional performance metrics (e.g., AUC, sensitivity, specificity) to include robust assessments of clinical utility and impact. This involves measuring tangible outcomes such as reduction in diagnostic turnaround time, influence on clinical decision-making and treatment plans, improvement in diagnostic confidence among general practitioners, and, ultimately, positive effects on patient prognosis and welfare [14,31]. The history of diagnostic aids in both human and veterinary medicine underscores that high performance in controlled settings does not guarantee utility in the chaotic clinical environment; therefore, prospective validation is non-negotiable [65].

Concurrently, there is an urgent, community-wide need for standardization in both data acquisition and evaluation. The current heterogeneity in how data is collected (e.g., audio recording settings, radiographic views) and how results are reported (e.g., varying performance metrics, lack of clarity on test set composition) severely hampers meaningful comparison between studies and slows collective progress [27,30]. The development and adoption of veterinary-specific standards—such as consensus protocols for respiratory sound recording, guidelines for AI-optimized radiographic positioning, and standardized reporting checklists for AI studies (akin to human medicine’s CLAIM checklist)—are essential. Such standardization will ensure data quality, facilitate the creation of shared, high-quality datasets, and enable the aggregation of evidence across research groups [68].

Finally, and critically, the ethical dimensions of AI deployment must be proactively and transparently addressed to build trust and ensure responsible use. Key issues include:

Algorithmic Bias and Fairness: Models must be evaluated for performance disparities across different dog and cat breeds, sizes, and ages to prevent inequitable care [28,31]. A model that performs well on Labrador retrievers but poorly on French Bulldogs would be clinically harmful and ethically untenable.
Data Privacy and Security: Clear protocols must govern the collection, storage, and use of patient and client data, ensuring compliance with data protection regulations and maintaining client trust [19].
Defining the AI’s Role: It must be unequivocally communicated that AI serves as a decision-support tool, augmenting rather than replacing veterinary clinical judgment and expertise. Guidelines are needed to define appropriate use cases, limitations, and the necessity of human oversight, particularly for high-stakes decisions [28,31,69].
Transparency and Explainability: As discussed in Section 5.2, providing interpretable outputs that align with veterinary reasoning is not just a technical issue but an ethical imperative for informed use. Successfully navigating the pathway to clinical deployment requires that technological advancement be matched by a commitment to rigorous validation, community standardization, and ethical stewardship. Establishing trust with veterinarians, technicians, and pet owners is as vital as algorithmic performance for the sustainable integration of AI into veterinary medicine.

6. Implications for Future Research and Clinical Veterinary Practice

Based on the systematic review presented in the present systematic review, the implications for future research and clinical veterinary practice are substantial and interconnected. They can be summarized in two key areas:

6.1. Implications for Future Research

Future research must pivot from proof-of-concept studies to translational work that bridges the gap to clinical utility. We consider the following key priorities:

Data Infrastructure and Standardization: There is an urgent need for large-scale, collaborative efforts to create shared, well-annotated veterinary datasets. Research should focus on establishing standardized data acquisition protocols (e.g., for audio recording, radiographic positioning) to ensure data quality and model generalizability. Advanced learning paradigms like self-supervised and cross-species transfer learning should be aggressively explored to overcome data scarcity by leveraging unlabeled archives and human medical databases.
Clinical and Technical Validation: Research must move beyond retrospective, single-center validation. Multi-center, prospective trials are essential to evaluate AI performance on heterogeneous, real-world data with variable quality and concurrent pathologies. Studies should assess not just diagnostic accuracy but also clinical utility metrics, such as impact on time-to-diagnosis, treatment decisions, and patient outcomes. For multimodal AI, research must develop and test intelligent data fusion strategies and clinical correlation engines to translate complex signals into actionable insights.
Explainability and Human–AI Collaboration: Developing intuitive explainability tools that align with veterinary clinical reasoning is critical for trust and adoption. Future studies should design “clinician-in-the-loop” systems that quantify uncertainty, flag suboptimal data quality, and provide differential diagnoses. Research must also formally address ethical implementation, including studies on algorithmic bias across breeds and the development of guidelines for AI’s role as a decision-support tool.

6.2. Implications for Clinical Veterinary Practice

For practicing veterinarians, the evolution of AI promises to augment capabilities and reshape aspects of clinical workflow, but requires careful integration. We consider the following implications:

Enhanced Diagnostic Support and Accessibility: AI tools will likely become vital decision-support aids, particularly in settings without specialist radiologists or during emergency hours. They can provide rapid, objective screening (e.g., for cardiogenic pulmonary edema), automate tedious measurements (e.g., Vertebral Heart Size), and assess radiographic quality in real-time to reduce repeats. This can improve diagnostic confidence, consistency, and detection for general practitioners.
Shift Towards Proactive and Remote Monitoring: The rise of wearable sensors and multimodal AI enables a paradigm shift from reactive to proactive, continuous health monitoring. Veterinarians can leverage data from smart collars or owner-collected smartphone videos for longitudinal tracking of at-risk patients (e.g., brachycephalic breeds, cardiac cases). This supports earlier intervention, enhances telemedicine consultations with objective data, and empowers preventive care and owner engagement.
Necessity for New Skills and Critical Engagement: Successful adoption requires veterinarians to develop digital literacy to critically evaluate AI outputs, understanding their limitations and potential biases. The profession must engage in shaping these tools, ensuring they address real clinical needs and integrate seamlessly into practice workflow. Ultimately, AI will not replace clinical expertise but will augment it, allowing veterinarians to focus more on complex decision-making, client communication, and hands-on care.

7. Limitations and Future Perspectives

This review has certain limitations that should be considered when interpreting its conclusions. First, the included studies are predominantly proof-of-concept in nature, often based on retrospective data from single institutions. This limits the generalizability of the findings to broader, more diverse clinical settings. Second, the heterogeneity in study designs, AI methodologies, and performance metrics across the reviewed literature makes direct comparisons and definitive conclusions about the superiority of any single approach challenging. Third, the scope of the review, focused on studies from 2019 onward, while capturing recent advancements, may not encompass all foundational work in the field.

The path forward requires addressing these limitations. Future research should prioritize the development of large, well-annotated, and shared veterinary datasets to improve model generalizability. There is a critical need for standardized protocols in data acquisition (e.g., for audio recordings and radiographic imaging) and performance reporting. Moreover, advancing beyond technical validation, rigorous multi-center prospective trials are essential to evaluate the real-world clinical utility, workflow integration, and impact on patient outcomes of these AI tools. Exploring advanced learning paradigms, such as self-supervised and cross-species transfer learning, could help overcome data scarcity. Finally, the ethical dimensions of deployment—including algorithmic bias, data privacy, and the clear definition of AI’s role as a decision-support aid—must be proactively addressed to ensure responsible and trusted adoption.

8. Conclusions

Based on this systematic review, the application of artificial intelligence (AI) to support the detection of respiratory diseases in dogs and cats shows significant promise, particularly through audio-based, image-based, and multimodal diagnostic approaches. The main findings indicate that deep learning models—such as convolutional neural networks (CNNs), ResNet, and U-Net—can achieve clinically relevant accuracy in specific tasks, such as analyzing chest radiographs for patterns like cardiomegaly and alveolar infiltrates and classifying respiratory sounds for conditions like Brachycephalic Obstructive Airway Syndrome (BOAS). These results suggest AI’s potential to serve as a supportive tool that can enhance diagnostic consistency, reduce observer variability, and aid in early intervention within veterinary practice.

In conclusion, AI represents a promising and transformative complementary tool for respiratory disease diagnosis in pets. However, its successful and reliable integration into clinical workflows is not yet realized. The transition from promising research to practical clinical application depends on a concerted effort to overcome the existing methodological, data-related, and validation barriers identified in this review.

Bibliography77

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Jatav R.S. Pratap A. Vaishnav N. Sharma N. General aspects of introduction to diseases, diagnosis, and management of dogs and cats Introduction to Diseases, Diagnosis, and Management of Dogs and Cats Elsevier Amsterdam, The Netherlands 2024317
2Johnson L. Canine and Feline Respiratory Medicine, An Issue of Veterinary Clinics of North America: Small Animal Practice Elsevier Health Sciences Amsterdam, The Netherlands 2020 Volume 50
3Elgalfy G.E.A.M. Clinical and Diagnostic Studies on Respiratory System Affections in Dogs and Cats Ph.D. Thesis Faculty of Veterinary Medicine, Benha University Banha, Egypt 2022
4Zamorska T. Grushanska N. Cardiogenic and Non-Cardiogenic Pulmonary Oedema in a Domestic Cat: Pathological Mechanisms, Differential Diagnosis, and Treatment Ukr. J. Vet. Sci.202213344310.31548/ujvs.13(1).2022.34-43 · doi ↗
5Bouyssou S. Specchi S. Desquilbet L. Pey P. Radiographic appearance of presumed noncardiogenic pulmonary edema and correlation with the underlying cause in dogs and cats Vet. Radiol. Ultrasound 20175825926510.1111/vru.1246828005303 · doi ↗ · pubmed ↗
6Orakpoghenor O. Problems in veterinary pathology: A focus on diagnosis Eur. J. Sci. Res. Rev.2024110311110.5455/EJSRR.20240625055842 · doi ↗
7Kamel M.S. Davidson J.L. Verma M.S. Strategies for bovine respiratory disease (BRD) diagnosis and prognosis: A comprehensive overview Animals 20241462710.3390/ani 1404062738396598 PMC 10885951 · doi ↗ · pubmed ↗
8Banzato T. Wodzinski M. Tauceri F. DonàC. Scavazza F. Müller H. Zotti A. An AI-based algorithm for the automatic classification of thoracic radiographs in cats Front. Vet. Sci.2021873193610.3389/fvets.2021.73193634722699 PMC 8554083 · doi ↗ · pubmed ↗