Neuro-Symbolic Control with Large Language Models for Language-Guided Spatial Tasks
Momina Liaqat Ali, Muhammad Abid, Muhammad Saqlain, Jose M. Merigo

TL;DR
This paper introduces a neuro-symbolic control framework combining large language models and neural controllers to improve language-guided spatial tasks in embodied systems, enhancing stability and efficiency.
Contribution
It presents a modular neuro-symbolic approach that separates semantic reasoning from motion control, outperforming LLM-only methods in success rate and speed.
Findings
Achieves over 70% reduction in steps compared to LLM-only control.
Speeds up control by up to 8.83 times.
Improves robustness and interpretability of language-guided control.
Abstract
Although large language models (LLMs) have recently become effective tools for language-conditioned control in embodied systems, instability, slow convergence, and hallucinated actions continue to limit their direct application to continuous control. A modular neuro-symbolic control framework that clearly distinguishes between low-level motion execution and high-level semantic reasoning is proposed in this work. While a lightweight neural delta controller performs bounded, incremental actions in continuous space, a locally deployed LLM interprets symbolic tasks. We assess the suggested method in a planar manipulation setting with spatial relations between objects specified by language. Numerous tasks and local language models, such as Mistral, Phi, and LLaMA-3.2, are used in extensive experiments to compare LLM-only control, neural-only control, and the suggested LLM+DL framework. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
