ASD-Bench: A Four-Axis Comprehensive Benchmark of AI Models for Autism Spectrum Disorder
Shubhankit Singh, Hassan Shaikh, Kuldeep Raghuwanshi, Keshav Bulia

TL;DR
ASD-Bench is a comprehensive benchmark evaluating various AI models across age groups and axes like performance, interpretability, and robustness for autism spectrum disorder screening.
Contribution
Introduces a systematic, multi-axis benchmark for AI models on ASD detection across different age cohorts using diverse model architectures.
Findings
Adult models achieve perfect F1 and AUC scores.
Adolescents are more challenging, with lower F1 scores.
Feature importance varies significantly across age groups.
Abstract
Automated ASD screening tools remain limited by single-architecture evaluations, axis-restricted assessment, and near-exclusive focus on adult cohorts, obscuring age-specific diagnostic patterns critical for early intervention. We introduce ASD-Bench, a systematic tabular benchmark evaluating ML, deep learning, and foundation model configurations across three age cohorts (children 1-11 yr, adolescents 12-16 yr, adults 17-64 yr) on four axes: predictive performance, calibration, interpretability, and adversarial robustness. Applied to a curated v3 dataset of 4,068 AQ-10 records, our benchmark spans classical models (XGBoost, AdaBoost, Random Forest, Logistic Regression), neural networks (MLP), deep tabular transformers (TabNet, TabTransformer, FT-Transformer), and TabPFN v2. We introduce the Heuristic Aggregate Penalty (HAP): a cost-sensitive metric penalising false negatives more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
