Beyond the Black Box: A Cognitive Architecture for Explainable and Aligned AI

Hu Keyi

arXiv:2512.03072·cs.AI·December 9, 2025

Beyond the Black Box: A Cognitive Architecture for Explainable and Aligned AI

Hu Keyi

PDF

Open Access

TL;DR

This paper presents 'Weight-Calculatism,' a novel cognitive architecture that enhances explainability and value alignment in AI, aiming to advance toward trustworthy Artificial General Intelligence through atomic cognition and traceable decision-making.

Contribution

It introduces a first-principles-based architecture with atomic cognition and an interpretable weight-calculation model, enabling radical explainability and intrinsic alignment for AGI.

Findings

01

Achieves transparent, human-like reasoning in complex scenarios

02

Demonstrates robust learning and decision traceability

03

Provides a practical foundation for trustworthy AGI

Abstract

Current AI paradigms, as "architects of experience," face fundamental challenges in explainability and value alignment. This paper introduces "Weight-Calculatism," a novel cognitive architecture grounded in first principles, and demonstrates its potential as a viable pathway toward Artificial General Intelligence (AGI). The architecture deconstructs cognition into indivisible Logical Atoms and two fundamental operations: Pointing and Comparison. Decision-making is formalized through an interpretable Weight-Calculation model (Weight = Benefit * Probability), where all values are traceable to an auditable set of Initial Weights. This atomic decomposition enables radical explainability, intrinsic generality for novel situations, and traceable value alignment. We detail its implementation via a graph-algorithm-based computational engine and a global workspace workflow, supported by a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications · Ethics and Social Impacts of AI