# A distribution-aware semi-supervised pipeline for cost-effective neuron segmentation

**Authors:** Yanchao Zhang, Hao Zhai, Jinyue Guo, Jing Liu, Qiwei Xie, Hua Han

PMC · DOI: 10.1016/j.isci.2025.114507 · 2025-12-19

## TL;DR

This paper introduces a new semi-supervised method for segmenting neurons in electron microscopy images, which reduces the need for extensive manual labeling.

## Contribution

The novel pipeline addresses distribution mismatch by using unsupervised selection and mixed-view consistency regularization for better segmentation.

## Key findings

- The proposed pipeline improves segmentation by selecting representative sub-volumes for annotation.
- Mixed-view consistency regularization enhances model performance across diverse EM datasets.
- The method reduces proofreading efforts in large-scale connectomic reconstructions.

## Abstract

Semi-supervised learning offers a cost-effective approach for neuron segmentation in electron microscopy (EM) volumes. This technique leverages unlabeled data to regularize supervised training for robust neuron boundary prediction. However, distribution mismatch between labeled and unlabeled data, caused by limited annotations and diverse neuronal structures, limits model generalization. In this study, we develop a distribution-aware pipeline to address the inherent mismatch issue and enhance semi-supervised neuron segmentation in EM volumes. At the data level, we select representative sub-volumes for annotation using an unsupervised measure of distributional similarity, ensuring broad coverage of neuronal structures. At the model level, we encourage consistent predictions across mixed views of labeled and unlabeled data. This design prompts the network to align feature distributions and learn shared semantics. Experiments on diverse EM datasets demonstrate the effectiveness of our method, which holds the potential to reduce proofreading demands and accelerate large-scale connectomic reconstruction efforts.

•Addressing the distribution mismatch inherent in semi-supervised neuron segmentation•An unsupervised heuristic for selecting representative sub-volumes in EM data•A novel semi-supervised framework using mixed-view consistency regularization•Extensive experiments across diverse EM imaging modalities and voxel resolutions

Addressing the distribution mismatch inherent in semi-supervised neuron segmentation

An unsupervised heuristic for selecting representative sub-volumes in EM data

A novel semi-supervised framework using mixed-view consistency regularization

Extensive experiments across diverse EM imaging modalities and voxel resolutions

Health sciences

## Full-text entities

- **Genes:** Igkv4-63 (immunoglobulin kappa variable 4-63) [NCBI Gene 628571] {aka EG628571, Gm6896, ac4}, Adcy3 (adenylate cyclase 3) [NCBI Gene 104111] {aka AC3, ACIII, mKIAA0511}, ADCY4 (adenylate cyclase 4) [NCBI Gene 196883] {aka AC4}, ADCY3 (adenylate cyclase 3) [NCBI Gene 109] {aka AC-III, AC3, BMIQ19}
- **Diseases:** EM (MESH:D028361), CGS (MESH:D019292)
- **Chemicals:** S (MESH:D013455)
- **Species:** Mus musculus (house mouse, species) [taxon 10090], Homo sapiens (human, species) [taxon 9606], Drosophila melanogaster (fruit fly, species) [taxon 7227]

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12805337/full.md

---
Source: https://tomesphere.com/paper/PMC12805337