Synthetic and real-world datasets for crosswalk segmentation under diverse weather and lighting conditions

Krešimir Romić; Hrvoje Leventić; Marija Habijan; Irena Galić

PMC · DOI:10.1016/j.dib.2025.111755·June 7, 2025

Synthetic and real-world datasets for crosswalk segmentation under diverse weather and lighting conditions

Krešimir Romić, Hrvoje Leventić, Marija Habijan, Irena Galić

PDF

Open Access

TL;DR

This paper introduces a new dataset for crosswalk segmentation, combining synthetic and real-world images under various weather and lighting conditions to aid assistive technologies for the visually impaired.

Contribution

The novelty lies in the creation of a diverse crosswalk segmentation dataset with both synthetic and real-world images under varied environmental conditions.

Findings

01

The synthetic dataset includes 3000 images generated using a fine-tuned Stable Diffusion model with different environmental prompts.

02

The real-world dataset contains 300 images from chest-mounted smartphone recordings, distributed across sunny, cloudy, rainy, and night conditions.

03

All images were manually annotated with crosswalk regions as binary masks using a custom interface.

Abstract

This article presents a new dataset for crosswalk segmentation targeting assistive technologies for visually impaired individuals. The dataset combines synthetic and real-world first-person view images with corresponding binary segmentation masks. The synthetic portion contains 3000 images generated using a fine-tuned Stable Diffusion model, with 1500 images created using a standard prompt ("a crosswalk image") and 1500 additional images incorporating various environmental conditions (sunny, cloudy, rainy, and night) through specialized prompts. The real-world component comprises 300 images extracted from chest-mounted smartphone video recordings of pedestrians approaching crosswalks, carefully distributed across different environmental conditions (120 sunny, 60 cloudy, 60 rainy, and 60 night images). To ensure diversity, each physical crosswalk location appears in at most two images…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Diseases1

visually impaired

Figures4

Click any figure to enlarge with its caption.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutomated Road and Building Extraction · Video Surveillance and Tracking Methods · Remote Sensing and LiDAR Applications