Learning to Sense for Driving: Joint Optics-Sensor-Model Co-Design for Semantic Segmentation
Reeshad Khan, John Gauch

TL;DR
This paper introduces a unified end-to-end RAW-to-task co-design framework for autonomous driving, optimizing optics, sensors, and segmentation models jointly to improve robustness and efficiency in perception tasks.
Contribution
It presents a novel joint optimization approach integrating realistic sensor models with lightweight segmentation networks, enhancing robustness and accuracy in autonomous driving perception pipelines.
Findings
Improved mIoU on KITTI-360 dataset with co-designed sensors.
Robustness gains under blur, noise, and low-bit conditions.
Achieves real-time performance with a compact model (~1M parameters).
Abstract
Traditional autonomous driving pipelines decouple camera design from downstream perception, relying on fixed optics and handcrafted ISPs that prioritize human viewable imagery rather than machine semantics. This separation discards information during demosaicing, denoising, or quantization, while forcing models to adapt to sensor artifacts. We present a task-driven co-design framework that unifies optics, sensor modeling, and lightweight semantic segmentation networks into a single end-to-end RAW-to-task pipeline. Building on DeepLens[19], our system integrates realistic cellphone-scale lens models, learnable color filter arrays, Poisson-Gaussian noise processes, and quantization, all optimized directly for segmentation objectives. Evaluations on KITTI-360 show consistent mIoU improvements over fixed pipelines, with optics modeling and CFA learning providing the largest gains,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCCD and CMOS Imaging Sensors · Advanced Neural Network Applications · Neural Networks and Reservoir Computing
