Scalable Machines with Intrinsic Higher Mental-State Dynamics
Ahsan Adeel, M. Bilal

TL;DR
This paper introduces a biologically inspired, scalable neural model that enhances computational efficiency and learning speed by mimicking higher mental-state dynamics, applicable to vision tasks like ImageNet classification.
Contribution
It presents a novel mathematically grounded framework linking neurobiological insights to scalable transformer models with improved efficiency.
Findings
Faster learning on ImageNet-1K compared to standard ViT
Reduced computational demand with fewer model components
Operates at approximately linear complexity with input size
Abstract
Drawing on recent breakthroughs in cellular neurobiology and detailed biophysical modeling linking neocortical pyramidal neurons to distinct mental-state regimes, this work introduces a mathematically grounded formulation showing how models (e.g., Transformers) can implement computational principles underlying awake imaginative thought to pre-select relevant information before attention is applied via triadic modulation loops among queries (), keys (), and values ().~Scalability experiments on ImageNet-1K, benchmarked against a standard Vision Transformer (ViT), demonstrate significantly faster learning with reduced computational demand (fewer heads, layers, and tokens), consistent with our prior findings in reinforcement learning and language modeling. The approach operates at approximately complexity with respect to the number of input tokens .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFerroelectric and Negative Capacitance Devices · Neural dynamics and brain function · Advanced Memory and Neural Computing
