What's in an embedding? Would a rose by any embedding smell as sweet?
Venkat Venkatasubramanian

TL;DR
This paper argues that LLMs develop a geometric form of understanding that is useful but limited, and proposes integrating symbolic AI to create more reliable, explainable, and reasoning-capable knowledge models.
Contribution
It introduces the concept of combining geometric and algebraic representations to enhance LLMs into comprehensive knowledge models with reasoning and explanation abilities.
Findings
LLMs have a geometry-like understanding suitable for many applications.
Current LLMs are unreliable and lack inference and explanation capabilities.
Integrating symbolic AI can create more robust and explainable knowledge models.
Abstract
Large Language Models (LLMs) are often criticized for lacking true "understanding" and the ability to "reason" with their knowledge, being seen merely as autocomplete systems. We believe that this assessment might be missing a nuanced insight. We suggest that LLMs do develop a kind of empirical "understanding" that is "geometry"-like, which seems adequate for a range of applications in NLP, computer vision, coding assistance, etc. However, this "geometric" understanding, built from incomplete and noisy data, makes them unreliable, difficult to generalize, and lacking in inference capabilities and explanations, similar to the challenges faced by heuristics-based expert systems decades ago. To overcome these limitations, we suggest that LLMs should be integrated with an "algebraic" representation of knowledge that includes symbolic AI elements used in expert systems. This integration…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPlant and animal studies · Plant Physiology and Cultivation Studies
