DuIVRS-2: An LLM-based Interactive Voice Response System for Large-scale POI Attribute Acquisition
Le Zhang, Shengming Zhang, Rui Zha, Yunpeng Wu, Jingbo Zhou, Jizhou Huang

TL;DR
DuIVRS-2 is an LLM-based interactive voice response system that efficiently acquires POI attributes at scale, outperforming previous methods with high success rate and low latency.
Contribution
The paper introduces a novel end-to-end LLM framework with data augmentation, dialogue management, and iterative learning for large-scale POI attribute collection in industry.
Findings
Processed 0.4 million calls daily in production.
Achieved 83.9% task success rate, 4% higher than previous system.
Maintained low reaction time of 130ms.
Abstract
Accurate Point of Interest (POI) attribute acquisition is essential for location-based services, yet traditional modular Interactive Voice Response (IVR) systems suffer from error accumulation and high maintenance overhead. We present DuIVRS-2, a large language model (LLM)-based end-to-end framework designed for large-scale POI attribute acquisition at Baidu Maps. To address the long-tail distribution of real-world interactions, our methodology first employs a finite state machine (FSM)-guided data augmentation strategy to synthesize a balanced and diverse training dataset. We then streamline dialogue management via a selective generation scheme combined with a Chain-of-Thought (CoT) mechanism, which ensures output stability and effectively eliminates hallucinations in industrial settings. To facilitate continuous policy refinement with minimal manual effort, we design a cooperative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
