Transforming NLU with Babylon: A Case Study in Development of Real-time, Edge-Efficient, Multi-Intent Translation System for Automated Drive-Thru Ordering
Mostafa Varzaneh, Pooja Voladoddi, Tanmay Bakshi, Uma Gunturi

TL;DR
Babylon is a transformer-based NLU system designed for real-time, edge-efficient multi-intent understanding in noisy environments like drive-thru ordering, outperforming traditional models in accuracy, latency, and memory use.
Contribution
This work introduces Babylon, a novel transformer architecture that handles multi-intent translation and noise robustness for edge NLU applications, with optimized low-latency and memory footprint.
Findings
Babylon achieves superior accuracy-latency trade-offs compared to Flan-T5 and BART.
The LSTM-based token pooling reduces input length and improves robustness.
Babylon effectively manages multi-intent scenarios in noisy, real-time environments.
Abstract
Real-time conversational AI agents face challenges in performing Natural Language Understanding (NLU) in dynamic, outdoor environments like automated drive-thru systems. These settings require NLU models to handle background noise, diverse accents, and multi-intent queries while operating under strict latency and memory constraints on edge devices. Additionally, robustness to errors from upstream Automatic Speech Recognition (ASR) is crucial, as ASR outputs in these environments are often noisy. We introduce Babylon, a transformer-based architecture that tackles NLU as an intent translation task, converting natural language inputs into sequences of regular language units ('transcodes') that encode both intents and slot information. This formulation allows Babylon to manage multi-intent scenarios in a single dialogue turn. Furthermore, Babylon incorporates an LSTM-based token pooling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel-Driven Software Engineering Techniques · Embedded Systems Design Techniques · Robotic Path Planning Algorithms
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Adam · Residual Connection · Softmax · Multi-Head Attention · Dense Connections · Dropout
