What does a system modify when it modifies itself?
Florentin Koch

TL;DR
This paper develops a formal framework to analyze what aspects of a cognitive or artificial system are modified during self-modification, distinguishing different regimes and their implications for human and AI cognition.
Contribution
It introduces a hierarchical structure and regimes of self-modification, providing a novel formal approach to compare biological and artificial self-modifying systems.
Findings
Identifies four regimes of self-modification: action, low-level, structural, and teleological revision.
Reveals a crossing of opacities in human versus artificial systems' self-representation.
Derives four testable predictions and discusses open problems in self-modification.
Abstract
When a cognitive system modifies its own functioning, what exactly does it modify: a low-level rule, a control rule, or the norm that evaluates its own revisions? Cognitive science describes executive control, metacognition, and hierarchical learning with precision, but lacks a formal framework distinguishing these targets of transformation. Contemporary artificial intelligence likewise exhibits self-modification without common criteria for comparison with biological cognition. We show that the question of what counts as a self-modifying system entails a minimal structure: a hierarchy of rules, a fixed core, and a distinction between effective rules, represented rules, and causally accessible rules. Four regimes are identified: (1) action without modification, (2) low-level modification, (3) structural modification, and (4) teleological revision. Each regime is anchored in a cognitive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
