Reward-Punishment Symmetric Universal Intelligence

Samuel Allen Alexander; Marcus Hutter

arXiv:2110.02450·cs.AI·October 7, 2021

Reward-Punishment Symmetric Universal Intelligence

Samuel Allen Alexander, Marcus Hutter

PDF

TL;DR

This paper extends the Legg-Hutter framework to include punishments, demonstrating that under certain symmetries, agents can have negative or zero intelligence levels, challenging traditional notions of intelligence measurement.

Contribution

It introduces a symmetry-based extension to the Legg-Hutter measure, allowing for negative and zero intelligence values under specific conditions.

Findings

01

Intelligence can be negative with punishment considerations.

02

Symmetry conditions lead to a zero intelligence baseline.

03

Reward-ignoring agents have zero Legg-Hutter intelligence.

Abstract

Can an agent's intelligence level be negative? We extend the Legg-Hutter agent-environment framework to include punishments and argue for an affirmative answer to that question. We show that if the background encodings and Universal Turing Machine (UTM) admit certain Kolmogorov complexity symmetries, then the resulting Legg-Hutter intelligence measure is symmetric about the origin. In particular, this implies reward-ignoring agents have Legg-Hutter intelligence 0 according to such UTMs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.