ParkSense: Where Should a Delivery Driver Park? Leveraging Idle AV Compute and Vision-Language Models

Die Hu; Henan Li

arXiv:2604.07912·cs.CV·April 10, 2026

ParkSense: Where Should a Delivery Driver Park? Leveraging Idle AV Compute and Vision-Language Models

Die Hu, Henan Li

PDF

TL;DR

ParkSense is a system that uses idle autonomous vehicle compute to identify optimal parking spots near merchant entrances, reducing delivery times and increasing driver income.

Contribution

It introduces the DAPP problem, leveraging vision-language models on satellite and street view images for precise parking spot selection.

Findings

01

A 7B VLM completes inference in 4-8 seconds on HW4 hardware.

02

Estimated annual income gains per driver are $3,000-$8,000 in the U.S.

03

ParkSense effectively identifies legal parking zones and entrances.

Abstract

Finding parking consumes a disproportionate share of food delivery time, yet no system addresses precise parking-spot selection relative to merchant entrances. We propose ParkSense, a framework that repurposes idle compute during low-risk AV states -- queuing at red lights, traffic congestion, parking-lot crawl -- to run a Vision-Language Model (VLM) on pre-cached satellite and street view imagery, identifying entrances and legal parking zones. We formalize the Delivery-Aware Precision Parking (DAPP) problem, show that a quantized 7B VLM completes inference in 4-8 seconds on HW4-class hardware, and estimate annual per-driver income gains of 3,000-8,000 USD in the U.S. Five open research directions are identified at this unexplored intersection of autonomous driving, computer vision, and last-mile logistics.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.