Cooperative System Identification via Correctional Learning
In\^es Louren\c{c}o, Robert Mattila, Cristian R. Rojas, Bo Wahlberg

TL;DR
This paper introduces correctional learning, where a teacher modifies system observations to help a student learn more effectively, especially when their models differ, with theoretical and finite-sample analysis.
Contribution
It proposes correctional learning as a novel approach for cooperative system identification involving model mismatch and provides an optimization framework.
Findings
Optimization-based correctional learning enhances student estimation accuracy.
Finite-sample bounds show reduced estimator variance with teacher assistance.
Applicable to multinomial and binomial systems.
Abstract
We consider a cooperative system identification scenario in which an expert agent (teacher) knows a correct, or at least a good, model of the system and aims to assist a learner-agent (student), but cannot directly transfer its knowledge to the student. For example, the teacher's knowledge of the system might be abstract or the teacher and student might be employing different model classes, which renders the teacher's parameters uninformative to the student. In this paper, we propose correctional learning as an approach to the above problem: Suppose that in order to assist the student, the teacher can intercept the observations collected from the system and modify them to maximize the amount of information the student receives about the system. We formulate a general solution as an optimization problem, which for a multinomial system instantiates itself as an integer program.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems · Advanced Statistical Process Monitoring · Insect Pheromone Research and Control
