Synthetic and real-world datasets for crosswalk segmentation under diverse weather and lighting conditions
Krešimir Romić, Hrvoje Leventić, Marija Habijan, Irena Galić

TL;DR
This paper introduces a new dataset for crosswalk segmentation, combining synthetic and real-world images under various weather and lighting conditions to aid assistive technologies for the visually impaired.
Contribution
The novelty lies in the creation of a diverse crosswalk segmentation dataset with both synthetic and real-world images under varied environmental conditions.
Findings
The synthetic dataset includes 3000 images generated using a fine-tuned Stable Diffusion model with different environmental prompts.
The real-world dataset contains 300 images from chest-mounted smartphone recordings, distributed across sunny, cloudy, rainy, and night conditions.
All images were manually annotated with crosswalk regions as binary masks using a custom interface.
Abstract
This article presents a new dataset for crosswalk segmentation targeting assistive technologies for visually impaired individuals. The dataset combines synthetic and real-world first-person view images with corresponding binary segmentation masks. The synthetic portion contains 3000 images generated using a fine-tuned Stable Diffusion model, with 1500 images created using a standard prompt ("a crosswalk image") and 1500 additional images incorporating various environmental conditions (sunny, cloudy, rainy, and night) through specialized prompts. The real-world component comprises 300 images extracted from chest-mounted smartphone video recordings of pedestrians approaching crosswalks, carefully distributed across different environmental conditions (120 sunny, 60 cloudy, 60 rainy, and 60 night images). To ensure diversity, each physical crosswalk location appears in at most two images…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutomated Road and Building Extraction · Video Surveillance and Tracking Methods · Remote Sensing and LiDAR Applications
