Layout Sequence Prediction From Noisy Mobile Modality
Haichao Zhang, Yi Xu, Hongsheng Lu, Takayuki Shimizu, Yun Fu

TL;DR
This paper introduces LTrajDiff, a novel diffusion-based model that predicts pedestrian layout sequences from noisy mobile sensor data, effectively handling obstructed and short sequences in real-world scenarios.
Contribution
It is the first to combine vision with noisy mobile data for layout sequence prediction, addressing challenges of obstruction and short sequences with a diffusion model.
Findings
Achieves state-of-the-art results in obstructed and short sequence experiments.
Effectively predicts layout sequences from noisy mobile sensor data.
Demonstrates robustness in real-world, obstructed environments.
Abstract
Trajectory prediction plays a vital role in understanding pedestrian movement for applications such as autonomous driving and robotics. Current trajectory prediction models depend on long, complete, and accurately observed sequences from visual modalities. Nevertheless, real-world situations often involve obstructed cameras, missed objects, or objects out of sight due to environmental factors, leading to incomplete or noisy trajectories. To overcome these limitations, we propose LTrajDiff, a novel approach that treats objects obstructed or out of sight as equally important as those with fully visible trajectories. LTrajDiff utilizes sensor data from mobile phones to surmount out-of-sight constraints, albeit introducing new challenges such as modality fusion, noisy data, and the absence of spatial layout and object size information. We employ a denoising diffusion model to predict…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDiffusion
