Central Dogma Transformer II: An AI Microscope for Understanding Cellular Regulatory Mechanisms
Nobuyuki Ota

TL;DR
The paper introduces CDT-II, an interpretable AI model mimicking the central dogma, capable of revealing cellular regulatory mechanisms and predicting perturbation effects with high accuracy, serving as an AI microscope for biological insights.
Contribution
CDT-II is a novel architecture that models gene regulation with interpretability, directly linking learned structures to biological relationships and experimental hypotheses.
Findings
Predicts perturbation effects with mean r = 0.84
Recovers GFI1B regulatory network with 6.6-fold enrichment
Identifies regulatory elements and pathways relevant to clinical outcomes
Abstract
Current biological AI models lack interpretability -- their internal representations do not correspond to biological relationships that researchers can examine. Understanding gene regulation requires models whose learned structure can be directly interrogated to generate experimentally testable hypotheses. CDT-II mirrors the central dogma in its architecture -- DNA self-attention, RNA self-attention, and cross-attention for transcriptional control -- requiring only genomic embeddings and raw per-cell expression. Applied to K562 CRISPRi data with five genes held out entirely, CDT-II predicts perturbation effects (per-gene mean r = 0.84), recovers the GFI1B regulatory network (6.6-fold enrichment, P = 3.5 x 10^{-17}), and shows that cross-attention focuses on ENCODE regulatory elements including CTCF sites (mean 7.67x across 28 targets, P < 0.001). Gradient-based attribution…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSingle-cell and spatial transcriptomics · Cell Image Analysis Techniques · Machine Learning in Bioinformatics
