Big data analysis for Covid-19 in hospital information systems
Xinpa Ying, Haiyang Peng, Jun Xie

TL;DR
This paper introduces a deep learning framework to improve the accuracy of identifying COVID-19 from CT images across different hospital systems.
Contribution
A novel deep learning framework that handles domain discrepancies in multi-site datasets for improved COVID-19 identification.
Findings
The proposed method outperforms the original model by 13.27% and 15.15% in AUC on two public datasets.
The framework improves prediction accuracy and learning efficiency through redesigned architecture and normalization techniques.
Contrastive training enhances domain invariance and classification performance across datasets.
Abstract
The COVID-19 pandemic has triggered a global public health crisis, affecting hundreds of countries. With the increasing number of infected cases, developing automated COVID-19 identification tools based on CT images can effectively assist clinical diagnosis and reduce the tedious workload of image interpretation. To expand the dataset for machine learning methods, it is necessary to aggregate cases from different medical systems to learn robust and generalizable models. This paper proposes a novel deep learning joint framework that can effectively handle heterogeneous datasets with distribution discrepancies for accurate COVID-19 identification. We address the cross-site domain shift by redesigning the COVID-Net’s network architecture and learning strategy, and independent feature normalization in latent space to improve prediction accuracy and learning efficiency. Additionally, we…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Fig 1
Fig 2
Fig 3
Fig 4Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Anomaly Detection Techniques and Applications · Machine Learning in Healthcare
Introduction
The global public health crisis caused by COVID-19 continues to spread. Medical imaging, particularly computed tomography (CT), plays a critical role in the clinical diagnosis [1] and monitoring of patients with COVID-19. However, the increasing number of COVID-19 suspicious cases has overloaded public health services, revealing a shortage of trained radiologists. Therefore, effective computational methods are needed to automate COVID-19 CT image analysis, improve diagnosis outcomes and patient management, and reduce the tedious workload of clinicians on image interpretation, allowing them to dedicate more time to urgent matters on the frontline.
In the era of big data, many data-driven methods have developed rapidly, with high accuracy typically attributed to the collection of large-scale datasets [2–4]. However, collecting these datasets in practice is often difficult. To address the issue of insufficient single-site data, it is necessary to aggregate CT data from different hospitals to establish a cross-hospital learning scheme. For example, Di et al. proposed a hypergraph model for multi-hospital pneumonia data to achieve rapid identification of COVID-19 cases [5] shown in Fig 1. Wang et al. also collected datasets from different hospitals to build an accurate X-ray image classifier [6]. However, all of these works have neglected the heterogeneity of data across different hospitals, such as imaging protocols and differences in CT equipment. CT slices from different datasets exhibit different image contrasts, which may weaken a model’s generalization ability. Previous works [7–9] have found that simple improvements on such heterogeneous data often bring limited improvement, or even worse performance than training on a single dataset.
The appearance and contrast of CT images from COVID-19 patients vary significantly across two different clinical centers, indicating data heterogeneity.
Our research aims to address the practical challenge of accurately identifying COVID-19 CT images by using an effective joint learning framework to handle distribution heterogeneity. We not only redesigned the latest COVID-Net [6] network architecture and learning strategy, but also conducted effective joint learning on a new backbone network, fully leveraging the advantages of multiple datasets. We also used domain-specific batch normalization layers to estimate internal feature statistics for each site separately and normalize the features. Our method aims to explicitly adjust the category sensitivity and domain invariance of the latent semantic feature space and proposes a contrastive learning objective. We evaluated our method on two public COVID-19 CT classification datasets, and extensive experiments showed that our joint learning framework outperforms single-site training models, simple joint learning, and existing state-of-the-art multi-site learning methods on both datasets. Our method effectively combines multiple data sources by proposing a joint learning framework, improving the quality of COVID-19 CT image recognition, and making a significant contribution to the field. Our research not only improves the accuracy of COVID-19 CT image recognition but also has the potential to help doctors diagnose COVID-19 more accurately in practice, providing support for epidemic prevention and control.
Our focus has been on enhancing the computational efficiency and prediction accuracy of the COVID-Net [6] specifically for COVID-19 CT images. To achieve this, we have revamped the original COVID-Net by revising its network architecture and learning strategies, which were originally developed for X-Ray.Our proposed joint learning framework improves COVID-19 diagnosis by effectively leveraging heterogeneous datasets. We address the issue of inter-site data discrepancy by performing separate feature normalization and proposing a contrastive objective to explicitly enhance the robustness of semantic representations.We extensively evaluated our method on two public datasets, using rigorous experimentation. The results consistently demonstrate significant improvement in classification performance on both datasets.
Related works
COVID-19 screening
A significant amount of research has been conducted with great speed and intensity to develop deep learning methods for responding to the global COVID-19 pandemic [10]. In this paper, we provide a brief overview of deep learning methods for image-level classification in diagnosis, which are closely related to the focus of our study.
Initially, Khan et al. [11] aimed to develop a large-scale screening model using chest CT images to distinguish COVID-19 pneumonia [11–14], Influenza-A viral pneumonia, and healthy cases. To achieve this, they employed ResNet18 with both location-attention mechanism and channel attention mechanism. Several follow-up methods based on transfer learning were proposed later, with most of them utilizing well-established network architectures such as VGG [15], ResNet [16–18], and DenseNet [19].
During this period, researchers proposed several new network architectures. COVID-Net, for example, was specifically designed for COVID-19 recognition and achieved high accuracy for image-level diagnosis based on chest X-ray (CXR) images [6]. Alqahtani et al. [20] using edges exploitation operation in an optimized structure with the convolutional operator facilitates learning edges-related features of infection patterns in chest X-ray images. Khan et al. [21] proposes a new deep CNN based technique for COVID-19 classification in X-ray images which systematically employs Region and Edge-based operations along with convolution operations. Zahoor et al. [22] propose a novel two-pahse deep learning-based framework to detect and categorize brain tumors in magnetic resonance images. Khan et al. [23] propose two new deep learning frameworks including Deep Hybrid Learning and Deep Boosted Hybrid Learning for effective COVID-19 detection in X-ray dataset. Meanwhile, Khan et al. [24] propose a novel semantic segmentation model CoV-RASeg which systematically uses average and max pooling operations in the encoder and decoder blocks. Khan et al. [25] also provide a survey of deep learning techniques for the analysis of COVID-19 and their usability for detecting omicron.
Javaheri et al. also designed CovidCTNet for detecting COVID-19 infections based on CXR images [2]. Another framework based on Capsule Network aimed to handle smaller datasets more effectively [26]. Gozes et al. proposed a system that utilized robust 2D and 3D deep learning models by modifying and adapting existing AI models and combining them with domain-specific clinical understanding [27]. Tang et al. explored severity-related features for automated severity assessment of COVID-19 based on chest CT images [28]. Rahimzadeh et al. concatenated features from Xception and ResNet50V2 networks to develop a neural network that achieved improved recognition performance [29].
Contrastive learning
Contrastive learning is a technique that aims to learn representations by contrasting positive and negative sample pairs [12, 13, 30]. It has been widely used in recent works on instance-level discrimination in self-supervised learning. This approach aims to learn representations by imposing transformation invariances in the latent space. In these works, the exemplar CNN represented each instance as a vector and trained a network to recognize each instance. However, one important issue in contrastive learning is negative sampling. To address this, Wu et al. proposed a memory bank to store instance vectors [31]. He et al. proposed the momentum contrast for visual representational learning [32]. More recently, Chen et al. proposed SimCLR, a method for generating augmentation-invariant embeddings for input images [33]. These works have achieved significant empirical successes in self-supervised representation learning.
Several research works have explored the potential of contrastive learning for COVID-19 screening. For instance, He et al. integrated contrastive learning and transfer learning to pre-train classification networks for COVID-19 diagnosis. Li et al. proposed a contrastive multi-task CNN that can improve generalization on unseen CT or X-ray samples for COVID-19 diagnosis [34].
In our study, we extensively utilize contrastive learning to obtain highly discriminative representations. In addition to conventional contrastive learning, we incorporate category information to better explore intra-class similarity and inter-class difference. Specifically, we adopt it as an auxiliary learning task that effectively enhances the classification performance of COVID-19.
Methods
In this section, we first discussed our process of designing COVID-Net. We redesign the original COVID-Net as backbone and performs seprate feature normalization to tackle the statoistical problem.
Then, we introduced our deep learning scheme, including addressing the issue of different data quality from different hospitals and feature normalization. Additionally, we introduced contrastive learning to learn domain-invariant features, resulting in improved classification performance of our model.
COVID-Net redesign for improved CT classification
Our model is based on COVID-Net [6], a deep learning architecture designed specifically for COVID-19 chest X-ray images that has outperformed several popular classification networks pretrained on ImageNet. Our network consists of two modules: a light upper branch with four convolutional layers and a lower branch composed of dense convolutional blocks for deep feature extraction. Long-range multi-level feature fusion is achieved through skip connections between the two branches. However, COVID-Net [6] was optimized for relatively coarse lesions in chest X-ray images and may not be ideal for CT images where the lesion pattern is clearer and provides richer information for the model to learn. Therefore, our goal is to build upon the strengths of this backbone while improving learning efficiency and classification accuracy from two complementary angles.
Network architecture redesign
The original COVID-Net [6] has a limitation in that it lacks internal feature normalization layers, which results in significant differences in the learned representations across different layers and branches. This feature variance is further amplified in CT images, which contain more complex patterns, and can potentially affect the training process and prediction accuracy if not appropriately addressed. To overcome this issue, we added batch normalization [35] (BN) layers to specific components of the network, reducing internal covariate shift, enhancing feature discrimination ability, and speeding up convergence rate. However, adding BN layers to every convolution layer is not necessarily beneficial. The computation blocks in the lower branch contain densely connected short-range connections, and adding BN layers to them would significantly increase the parameter scale and decrease training speed. Therefore, we added a BN layer after each convolutional layer in the upper branch and one for the initial convolutional layer to balance computational efficiency and stable representation.
Formally, the batch normalization (BN) operation obtains normalized features by applying affine transformation to the whitened feature maps along each channel iin1, dots, M, given M-channel feature maps of a certain layer:
In the above equation, μi and represent the mean and variance of feature xi, respectively, the framework is shown in in Fig 2; The BN layer collects moving average values of γ and β during training to capture global data statistics and employs these estimated values for feature normalization during the testing phase.
The COVID-Net redesign for improved CT classification.
Joint learning scheme with redesigned COVID-Net
Given the limited number of COVID-19 samples from individual hospitals, it is often necessary to aggregate cases from different data sources for deep learning model development. Therefore, we propose a joint learning scheme, built upon the redesigned COVID-Net backbone, to explicitly address the data heterogeneity problem and improve diagnosis performance.
Separate batch normalization (SBN) at data heterogeneity
In the presence of significant data heterogeneity, simple joint training has been shown to have limitations or even lead to performance degradation [7, 14, 38]. One significant factor contributing to this is inaccurate estimation of moving average values by the BN layer during training due to statistical differences across datasets, as depicted in Fig 2. Consequently, these estimated values may not accurately represent the testing data statistics at each site, leading to degraded performance. To overcome this issue, we utilize domain-specific batch normalization (DSBN) [7, 39, 40] by assigning a BN layer to each site to explicitly address statistical discrepancies. As illustrated in Fig 2, we substitute the BN layers in the redesigned COVID-Net with DSBN layers. The DSBN layer captures domain-specific moving values that accurately represent the statistics of each site, outperforming the original BN layer.
Contrastive domain invariance enhancement (CDIE)
In joint learning, in addition to addressing the issue of different data distributions across sites, we also aim to obtain robust semantic information that can cluster different source domain data. This is crucial for domain adaptation, as if the model cannot cluster data of the same class in the source domain, data from multiple hospitals cannot be fully utilized. To tackle this issue, we propose utilizing contrastive learning to cluster data within the same class while separating data between different classes.
To achieve this goal, we adopt the popular contrastive learning method [41]. Given a pair of samples (m, n), we obtain their semantic feature embeddings em and en, extracted through global average pooling layer with a vector dimension of 8096. We found that directly applying regularization to the original features hindered model convergence. Therefore, we introduce an embedding network Hϕ to project the embeddings into a lower-dimensional space. The final contrastive learning formula we obtained is as follows:
Positive pairs (m, n) refer to samples that belong to the same class, while negative pairs refer to samples that belong to different classes. In each iteration, we randomly sample a minibatch of K examples from the two sites. The contrastive loss for each positive pair (m, n) within the minibatch is defined as follows:
In this context, takes a value of 1 and 0 for positive and negative samples, respectively, where τ represents a temperature parameter. The loss function is ultimately calculated on a mini-batch of positive samples, considering both (m, n) and (n, m). Through this contrastive learning approach, the model can learn domain-invariant features that bring samples from the same class together and push those from different classes apart.
Overall training objective and technical details
The overall training objective is composed of two terms: the cross-entropy loss to assess classification error and the contrastive loss to regularize the latent space. The objective is defined as:
Experiments
Datasets and evaluation metrics
To evaluate the deep learning framework proposed in this study, we used two public medical image datasets: SARS-CoV-2 [42] and COVID-CT [43]. These datasets are currently widely used and the largest publicly available datasets. The SARS-CoV-2 dataset (referred to as Site A) consists of 2482 CT images from 120 patients, with 1252 positive and the rest negative. The spatial sizes of these images range from 119 × 104 to 416 × 512. The emphCOVID-CT dataset (referred to as Site B) includes 349 CT images from 216 COVID-19 patients and 397 CT images from 171 non-COVID-19 patients. The resolutions of these images range from 102 × 137 to 1853 × 1485. We first resized all images to 224 × 224 and then normalized them.
We conducted a four-fold cross-validation on our experiment using two datasets citeSoares2020sars, and we adopted five metrics to provide a comprehensive evaluation of our models, including accuracy (%), F1 score (%), sensitivity (%), precision (%), and AUC (%). We also presented the mean and standard deviation of three independent runs.
Experimental setting
We implemented the framework using Pytorch on an Nvidia Tesla V100 GPU. Both the classification model and embedding network were trained from scratch using the same Adam Optimizer. We initialized the learning rate to 1e-4 and decayed it using cosine annealing. To empirically adjust the hyper-parameters, we used grid search with a random small subset of the entire dataset and set the temperature parameter τ to 0.05. Fig 3 displays the ROC curves and PR curves of our approach. For our proposed method and all the comparison methods, we trained a total of 100 epochs with a batch size of 32, consisting of 16 images from each dataset.
(a) ROC curves of method. (b) PR curves of our method.
Effectiveness of network redesign on COVID-Net
To validate the effectiveness of our network redesign, we compared our redesigned backbone with the original COVID-Net. Two different experimental settings were used for the comparison: 1) the Single setting, which trains a model for each site separately, and 2) the Joint setting, which trains a model jointly using both datasets with naive aggregation. The results in Table 1 demonstrate that our Redesign model outperforms the original COVID-Net [6] in the Single setting on both sites by a significant margin, with consistent improvements observed across all five evaluation metrics. The Joint setting yielded similar observations, except for a slightly marginal improvement in precision in Site B. These findings underscore the superior representation learning ability of our redesigned backbone for COVID-19 diagnosis, indicating that our redesign strategy can effectively enhance the model’s discriminative power.
Table 1: Results of different methods on the two datasets for COVID-19 CT image classification (mean±std).
Ablation study
We conduct an extensive ablation study to investigate the role of each component in our method in Table 2. SepNrom [44] denotes that we only use SepNorm without any other strategy. Based on SepNorm, we add a SBN (separate batch normalization) to obtain SepNorm + SBN and CDIE (contrastive domain invariance ehancement) to obtain SepNorm + CDIE. We can see that our method can improve the baseline SepNorm by a large margin. This confirm the effectiveness of the proposed method, and the flops of this paper is 1.55G.
Table 2: Ablation study on different components of our method on the COVID-19 images.
Evaluation of domain divergence
We measure the difference between domains using the -distance, which is a widely used metric for evaluating distribution divergence. The -distance is defined as , where ϵ represents the classification error of a trained classifier the distinguishes the source and target domains. A smaller -distance indicates better alignment between the distributions. Our approach, illustrates in Fig 4, learns more invariant features, resulting in a reduction of the divergence between the source and target domains compared to other compared methods.
A -distance of SepNorm and ours.
We conducted paired t-tests to analyze the significance of the improvements of our method over the Joint, Single, and SepNorm approaches. The detailed results are shown in Tables 3 and 4. This range corresponds to a 0.95 confidence interval, we observed that all paired t-tests presented a p-value smaller than 0.05, indicating statistically significant improvements of our method on both sites.
Table 3: The p-value with paired t-test of our method with Single, Joint and SepNorm learning schemes.
Table 4: The p-value with paired t-test of our method with the state-of-the-art comparison methods.
Comparison with state-of-the-art methods
We compared our method with recently proposed medical image analysis methods, comparing three different approaches. The first is Series-Adapter [36], which proposes multiple domain adapters for joint learning from multiple datasets and incorporates residual blocks into domain-adaptive layers to reduce the difference between source and target domain distributions. The second is Parallel-Adapter [37], which proposes parallel domain adaptation and inserts filter banks and domain-adaptive convolutional layers into residual blocks to reduce domain distribution differences. This method demonstrated state-of-the-art performance on 10 image classification datasets. MS-Net [7] also constructs a multi-source domain model, primarily utilizing auxiliary domain-specific branches to improve feature learning ability and employing online learning strategies to enhance the segmentation performance of MRI datasets.
The evaluation of joint learning methods is based on the Joint approach. Table 1 shows that the Series Adapter performs better than the Joint model in both Site A and Site B. However, its improvements are unevenly distributed across both sites and still fall short of the Single approach. On the other hand, the Parallel Adapter achieves more balanced improvements over the Joint model. These two approaches’ improvements over the Joint model suggest that domain-specific parameters in the domain adapter are useful in addressing the issue of data heterogeneity. The MS-Net outperforms the two domain-adaptive approaches, indicating the advantages of the knowledge transfer process in this framework. Notably, our method surpasses all three state-of-the-art joint learning methods in both sites, highlighting the effectiveness of our approach in leveraging more robust representations from diverse datasets.
Conclusion
Our aim in this paper is to develop a highly accurate model for COVID-19 CT diagnosis by exploring the benefits of joint learning from heterogeneous datasets of different data sources. To achieve this, we propose a novel joint learning framework that utilizes a redesigned version of the recently proposed COVID-Net as a strong backbone. Our joint learning framework explicitly addresses the inter-site data heterogeneity by conducting separate feature normalization for each site. Additionally, we utilize a contrastive training objective to enhance the learning of domain-invariant semantic features that improve the identification performance on each dataset. The effectiveness and clinical significance of our approach are demonstrated through experiments conducted on two large-scale public datasets. However, we acknowledge that there is scope for further improvement and generalization of our approach. As future works, we aim to extend our model to a wider multi-site setting and employ transfer learning from other large-scale datasets to further enhance the diagnosis accuracy. We recognize the importance of improving the generalization capacity of our model to enhance its practical applicability. Overall, our proposed joint learning framework has shown promising results in improving the accuracy of COVID-19 CT diagnosis, and we believe that it has the potential to make a significant impact in the field of medical image analysis.
Further discussion
While our method has shown promising performance as a preliminary study of multi-site learning with COVID-19 data, there are still limitations that need to be addressed. One of the limitations is that our method is currently limited to the two sites used in our paper, making it suboptimal for direct application to other unseen sites. This limits the potential for wider cross-site deployment. Additionally, due to computational resource constraints and development urgency, we were unable to pretrain our redesigned model on large-scale datasets such as ImageNet. However, previous works have demonstrated that fine-tuning transferred models can improve performance and speed up the training process. As a near future work, we are interested in exploring how to connect carefully redesigned network architectures with model transfer learning from large-scale datasets, to balance their respective benefits. Furthermore, we plan to extend our method to more sites with different environments to validate the generalization capability of AI models in the context of COVID-19 CT image diagnosis.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Mei X, Lee H C, Diao K, et al. Artificial intelligence–enabled rapid diagnosis of patients with COVID-19[J]. Nature medicine, 2020, 26(8): 1224–1228. doi: 10.1038/s 41591-020-0931-3 32427924 PMC 7446729 · doi ↗ · pubmed ↗
- 2Fan D P, Zhou T, Ji G P, et al. Inf-net: Automatic covid-19 lung infection segmentation from ct images[J]. IEEE transactions on medical imaging, 2020, 39(8): 2626–2637. doi: 10.1109/TMI.2020.2996645 32730213 · doi ↗ · pubmed ↗
- 3Wang M, Li P, Shen L, et al. Informative pairs mining based adaptive metric learning for adversarial domain adaptation[J]. Neural Networks, 2022, 151: 238–249. doi: 10.1016/j.neunet.2022.03.031 35447481 · doi ↗ · pubmed ↗
- 4Di D, Shi F, Yan F, et al. Hypergraph learning for identification of COVID-19 with CT imaging[J]. Medical image analysis, 2021, 68: 101910. doi: 10.1016/j.media.2020.101910 33285483 PMC 7690277 · doi ↗ · pubmed ↗
- 5Wang L, Lin Z Q, Wong A. Covid-net: A tailored deep convolutional neural network design for detection of covid-19 cases from chest x-ray images[J]. Scientific reports, 2020, 10(1): 19549. doi: 10.1038/s 41598-020-76550-z 33177550 PMC 7658227 · doi ↗ · pubmed ↗
- 6Liu Q, Dou Q, Yu L, et al. MS-Net: multi-site network for improving prostate segmentation with heterogeneous MRI data[J]. IEEE transactions on medical imaging, 2020, 39(9): 2713–2724. doi: 10.1109/TMI.2020.2974574 32078543 · doi ↗ · pubmed ↗
- 7Zech J R, Badgeley M A, Liu M, et al. Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: a cross-sectional study[J]. P Lo S medicine, 2018, 15(11): e 1002683. doi: 10.1371/journal.pmed.1002683 30399157 PMC 6219764 · doi ↗ · pubmed ↗
- 8Shi F, Wang J, Shi J, et al. Review of artificial intelligence techniques in imaging data acquisition, segmentation, and diagnosis for COVID-19[J]. IEEE reviews in biomedical engineering, 2020, 14: 4–15. doi: 10.1109/RBME.2020.298797532305937 · doi ↗ · pubmed ↗
