Synthetic emotions and consciousness: exploring architectural boundaries
Hermann Borotschnig

TL;DR
This paper introduces a modular, biologically inspired architecture for synthetic emotion control that aims to avoid features associated with consciousness, providing a framework for safer AI development.
Contribution
It proposes a novel hierarchical architecture with specific constraints to prevent access-related consciousness features, and offers a methodological template for assessing consciousness risks in AI.
Findings
Architecture satisfies all four risk-reduction constraints
Stable modifications can extend the architecture without increasing access risk
Gradual violations of constraints map potential pathways to consciousness
Abstract
As artificial agents display increasingly sophisticated emotion-like behaviors, frameworks for assessing whether such systems risk instantiating consciousness remain limited. This contribution asks whether synthetic emotion-like control can be implemented while deliberately excluding architectural features that major theories associate with access-like consciousness. We propose architectural principles (A1-A8) for a hierarchical, dual-source implementation in which (i) immediate needs generate motivational signals and (ii) episodic memory provides affective guidance from similar past situations; the two sources converge to modulate action selection. To operationalize consciousness-related risk, we distill predictions from major theories into four engineering risk-reduction constraints: (R1) no content-general, workspace-like global broadcast, (R2) no metarepresentation, (R3) no…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
