Symbolic and Language Agnostic Large Language Models

Walid S. Saba

arXiv:2308.14199·cs.AI·August 29, 2023

Symbolic and Language Agnostic Large Language Models

Walid S. Saba

PDF

Open Access

TL;DR

This paper proposes a novel approach to large language models by integrating bottom-up reverse engineering with symbolic, language-agnostic, and ontologically grounded methods, addressing limitations of subsymbolic models.

Contribution

It introduces a symbolic framework for large language models that is language-agnostic and ontologically grounded, moving beyond purely subsymbolic approaches.

Findings

01

Symbolic models can effectively capture language knowledge.

02

Ontologically grounded models improve interpretability.

03

Proposed approach addresses inferential limitations of subsymbolic models.

Abstract

We argue that the relative success of large language models (LLMs) is not a reflection on the symbolic vs. subsymbolic debate but a reflection on employing an appropriate strategy of bottom-up reverse engineering of language at scale. However, due to the subsymbolic nature of these models whatever knowledge these systems acquire about language will always be buried in millions of microfeatures (weights) none of which is meaningful on its own. Moreover, and due to their stochastic nature, these models will often fail in capturing various inferential aspects that are prevalent in natural language. What we suggest here is employing the successful bottom-up strategy in a symbolic setting, producing symbolic, language agnostic and ontologically grounded large language models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Language and cultural evolution

MethodsNone · fail