CytoCLIP: Learning Cytoarchitectural Characteristics in Developing Human Brain Using Contrastive Language Image Pre-Training
Pralaypati Ta, Sriram Venkatesaperumal, Keerthi Ram, and Mohanasankar Sivaprakasam

TL;DR
CytoCLIP is a novel vision-language model that learns detailed and overall brain cytoarchitecture from histological images, enabling automated brain region identification with high accuracy.
Contribution
This work introduces CytoCLIP, the first contrastive learning framework for joint visual-text representation of brain cytoarchitecture, trained on fetal brain histology data.
Findings
Achieves 0.87 F1 score in whole-region classification
Attains 0.91 F1 score in high-resolution tile classification
Outperforms existing methods in cytoarchitecture recognition
Abstract
The functions of different regions of the human brain are closely linked to their distinct cytoarchitecture, which is defined by the spatial arrangement and morphology of the cells. Identifying brain regions by their cytoarchitecture enables various scientific analyses of the brain. However, delineating these areas manually in brain histological sections is time-consuming and requires specialized knowledge. An automated approach is necessary to minimize the effort needed from human experts. To address this, we propose CytoCLIP, a suite of vision-language models derived from pre-trained Contrastive Language-Image Pre-Training (CLIP) frameworks to learn joint visual-text representations of brain cytoarchitecture. CytoCLIP comprises two model variants: one is trained using low-resolution whole-region images to understand the overall cytoarchitectural pattern of an area, and the other is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFetal and Pediatric Neurological Disorders · AI in cancer detection · Domain Adaptation and Few-Shot Learning
