Towards Sample-Efficiency and Generalization of Transfer and Inverse Reinforcement Learning: A Comprehensive Literature Review

Hossein Hassani; Ehsan Hallaji; Roozbeh Razavi-Far; Mehrdad Saif; Liang Lin

arXiv:2411.10268·cs.LG·September 23, 2025

Towards Sample-Efficiency and Generalization of Transfer and Inverse Reinforcement Learning: A Comprehensive Literature Review

Hossein Hassani, Ehsan Hallaji, Roozbeh Razavi-Far, Mehrdad Saif, Liang Lin

PDF

Open Access

TL;DR

This comprehensive review discusses recent advances in transfer and inverse reinforcement learning aimed at improving sample efficiency and generalization in reinforcement learning, highlighting key strategies like human-in-the-loop and sim-to-real transfer.

Contribution

It provides an extensive overview of fundamental T-IRL methods and recent advancements, emphasizing strategies for efficient knowledge transfer and low-data training schemes.

Findings

01

Most recent research uses human-in-the-loop strategies.

02

Sim-to-real transfer enhances sample efficiency.

03

Low experience transition schemes are prioritized.

Abstract

Reinforcement learning (RL) is a sub-domain of machine learning, mainly concerned with solving sequential decision-making problems by a learning agent that interacts with the decision environment to improve its behavior through the reward it receives from the environment. This learning paradigm is, however, well-known for being time-consuming due to the necessity of collecting a large amount of data, making RL suffer from sample inefficiency and difficult generalization. Furthermore, the construction of an explicit reward function that accounts for the trade-off between multiple desiderata of a decision problem is often a laborious task. These challenges have been recently addressed utilizing transfer and inverse reinforcement learning (T-IRL). In this regard, this paper is devoted to a comprehensive review of realizing the sample efficiency and generalization of RL algorithms through…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics