Learning to Sense for Driving: Joint Optics-Sensor-Model Co-Design for Semantic Segmentation

Reeshad Khan; John Gauch

arXiv:2512.20815·cs.CV·December 29, 2025

Learning to Sense for Driving: Joint Optics-Sensor-Model Co-Design for Semantic Segmentation

Reeshad Khan, John Gauch

PDF

Open Access

TL;DR

This paper introduces a unified end-to-end RAW-to-task co-design framework for autonomous driving, optimizing optics, sensors, and segmentation models jointly to improve robustness and efficiency in perception tasks.

Contribution

It presents a novel joint optimization approach integrating realistic sensor models with lightweight segmentation networks, enhancing robustness and accuracy in autonomous driving perception pipelines.

Findings

01

Improved mIoU on KITTI-360 dataset with co-designed sensors.

02

Robustness gains under blur, noise, and low-bit conditions.

03

Achieves real-time performance with a compact model (~1M parameters).

Abstract

Traditional autonomous driving pipelines decouple camera design from downstream perception, relying on fixed optics and handcrafted ISPs that prioritize human viewable imagery rather than machine semantics. This separation discards information during demosaicing, denoising, or quantization, while forcing models to adapt to sensor artifacts. We present a task-driven co-design framework that unifies optics, sensor modeling, and lightweight semantic segmentation networks into a single end-to-end RAW-to-task pipeline. Building on DeepLens[19], our system integrates realistic cellphone-scale lens models, learnable color filter arrays, Poisson-Gaussian noise processes, and quantization, all optimized directly for segmentation objectives. Evaluations on KITTI-360 show consistent mIoU improvements over fixed pipelines, with optics modeling and CFA learning providing the largest gains,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCCD and CMOS Imaging Sensors · Advanced Neural Network Applications · Neural Networks and Reservoir Computing