Learning Mobile Robot Based on Adaptive Controlled Markov Chains
Valery Vilisov

TL;DR
This paper introduces an adaptive control algorithm for mobile robots that learns from a decision taker's preferences using Markov chains, allowing for flexible setup and improved adaptation during operation.
Contribution
It presents a novel adaptive learning algorithm based on Markov chains that models and adjusts to a decision taker's preferences in robot control.
Findings
Model parameters can be effectively set up using simulation data.
The algorithm demonstrates good adaptation to the decision taker's preferences.
The approach is flexible for setup during operation or testing.
Abstract
Herein we suggest a mobile robot-training algorithm that is based on the preference approximation of the decision taker who controls the robot, which in its turn is managed by the Markov chain. Setup of the model parameters is made on the basis of the data referring to the situations and decisions involving the decision taker. The model that adapts to the decision taker's preferences can be set up either a priori, during the process of the robot's normal operation, or during specially planned testing sessions. Basing on the simulation modelling data of the robot's operation process and on the decision taker's robot control we have set up the model parameters thus illustrating both working capacity of all algorithm components and adaptation effectiveness.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
