CIRO7.2: A Material Network with Circularity of -7.2 and Reinforcement-Learning-Controlled Robotic Disassembler
Federico Zocco, Monica Malvezzi

TL;DR
This paper models a thermodynamical material network for circular economy using reinforcement learning-controlled robotic disassembly, achieving improved circularity metrics and providing principles for circular intelligence and robotics.
Contribution
It introduces a novel thermodynamical framework for circularity and applies RL to optimize robotic disassembly, advancing circular economy research.
Findings
Maximum circularity of -2.1 for disassembling 2 parts
Circularity reduces to -7.2 with more parts disassembled
RL controller performance positively impacts circularity based on material criticality
Abstract
The competition over natural reserves of minerals is expected to increase in part because of the linear-economy paradigm based on take-make-dispose. Simultaneously, the linear economy considers end-of-use products as waste rather than as a resource, which results in large volumes of waste whose management remains an unsolved problem. Since a transition to a circular economy can mitigate these open issues, in this paper we begin by enhancing the notion of circularity based on compartmental dynamical thermodynamics, namely, , and then, we model a thermodynamical material network processing a batch of 2 solid materials of criticality coefficients of 0.1 and 0.95, with a robotic disassembler compartment controlled via reinforcement learning (RL), and processing 2-7 kg of materials. Subsequently, we focused on the design of the robotic disassembler compartment using state-of-the-art…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModular Robots and Swarm Intelligence
