Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches
Alhassan Mumuni, Fuseini Mumuni

TL;DR
This survey reviews foundational principles and approaches for developing large language models towards achieving artificial general intelligence, emphasizing embodiment, symbol grounding, causality, and memory.
Contribution
It provides a comprehensive overview of how core cognitive principles can be integrated into LLMs to advance towards AGI.
Findings
Current LLMs have superficial and brittle cognitive abilities.
Foundational problems like embodiment and causality are crucial for AGI.
Survey of state-of-the-art methods addressing these foundational issues.
Abstract
Generative artificial intelligence (AI) systems based on large-scale pretrained foundation models (PFMs) such as vision-language models, large language models (LLMs), diffusion models and vision-language-action (VLA) models have demonstrated the ability to solve complex and truly non-trivial AI problems in a wide variety of domains and contexts. Multimodal large language models (MLLMs), in particular, learn from vast and diverse data sources, allowing rich and nuanced representations of the world and, thereby, providing extensive capabilities, including the ability to reason, engage in meaningful dialog; collaborate with humans and other agents to jointly solve complex problems; and understand social and emotional aspects of humans. Despite this impressive feat, the cognitive abilities of state-of-the-art LLMs trained on large-scale datasets are still superficial and brittle.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
MethodsDiffusion
