Reinforcement Learning for Scalable and Trustworthy Intelligent Systems
Guangchen Lan

TL;DR
This paper discusses advancing reinforcement learning to improve scalability in distributed systems and trustworthiness in language models, focusing on federated optimization and safety alignment.
Contribution
It introduces four key contributions that enhance reinforcement learning's scalability and trustworthiness through federated methods and safety measures.
Findings
Developed communication-efficient federated optimization techniques.
Improved alignment of language models with human preferences.
Reduced inappropriate information disclosure in language-based systems.
Abstract
Reinforcement learning has become a powerful paradigm for improving the capability of intelligent systems, but its practical deployment faces two central challenges. First, reinforcement learning must scale efficiently in distributed environments where communication bandwidth is limited and computation is heterogeneous across agents. Second, as reinforcement learning is increasingly used in post-training large language models and autonomous agents, the optimized policies must also be aligned with human preferences and satisfy safety requirements such as privacy-aware information disclosure. This dissertation addresses both challenges through four complementary contributions spanning federated optimization, preference alignment, and contextual safety. The first part of the dissertation studies scalable reinforcement learning in federated settings. The second part of the dissertation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
