Deep learning for autism detection using clinical notes: A comparison of transfer learning for a transparent and black-box approach
Gondy Leroy, Prakash Bisht, Sai Madhuri Kandula, Nell Maltman, Sydney Rice

TL;DR
This study compares transparent BioBERT-based models and black-box models for autism detection from clinical notes, demonstrating that transparent models with mixed datasets achieve high accuracy and better transferability, advancing trustworthy AI in diagnostics.
Contribution
The paper introduces a transparent, interpretable ML approach using BioBERT for autism detection and compares its transfer learning performance to black-box models across multiple datasets.
Findings
Transparent model achieved 97% sensitivity and 98% specificity.
Mixed dataset training improved model performance.
Transparent models outperformed black-box models in transferability.
Abstract
Autism spectrum disorder (ASD) is a complex neurodevelopmental condition whose rising prevalence places increasing demands on a lengthy diagnostic process. Machine learning (ML) has shown promise in automating ASD diagnosis, but most existing models operate as black boxes and are typically trained on a single dataset, limiting their generalizability. In this study, we introduce a transparent and interpretable ML approach that leverages BioBERT, a state-of-the-art language model, to analyze unstructured clinical text. The model is trained to label descriptions of behaviors and map them to diagnostic criteria, which are then used to assign a final label (ASD or not). We evaluate transfer learning, the ability to transfer knowledge to new data, using two distinct real-world datasets. We trained on datasets sequentially and mixed together and compared the performance of the best models and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutism Spectrum Disorder Research · Domain Adaptation and Few-Shot Learning · Digital Mental Health Interventions
