GEM: Graph-Enhanced Mixture-of-Experts with ReAct Agents for Dialogue State Tracking
Ziqi Zhu, Adithya Suresh, Tomal Deb, Iman Abbasnejad

TL;DR
GEM introduces a graph-enhanced mixture-of-experts framework with ReAct agents, significantly improving dialogue state tracking accuracy by combining structured dialogue understanding, dynamic expert routing, and agent-based reasoning.
Contribution
The paper presents a novel architecture that integrates graph neural networks, mixture-of-experts, and ReAct agents for enhanced dialogue state tracking performance.
Findings
GEM achieves 65.19% Joint Goal Accuracy on MultiWOZ 2.2.
Outperforms existing end-to-end LLM approaches and SOTA methods.
Demonstrates the effectiveness of combining structured dialogue representation with expert routing and reasoning.
Abstract
Dialogue State Tracking (DST) requires precise extraction of structured information from multi-domain conversations, a task where Large Language Models (LLMs) struggle despite their impressive general capabilities. We present GEM (Graph-Enhanced Mixture-of-Experts), a novel framework that combines language models and graph-structured dialogue understanding with ReAct agent-based reasoning for superior DST performance. Our approach dynamically routes between specialized experts: a Graph Neural Network that captures dialogue structure and turn-level dependencies, and a finetuned T5-Small encoder-decoder for sequence modeling, coordinated by an intelligent router. For complex value generation tasks, we integrate ReAct agents that perform structured reasoning over dialogue context. On MultiWOZ 2.2, GEM achieves 65.19% Joint Goal Accuracy, substantially outperforming end-to-end LLM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
