Reinforcement Learning in hyperbolic space for multi-step reasoning
Tao Xu, Dung-Yang Lee, Momiao Xiong

TL;DR
This paper introduces a hyperbolic Transformer-based reinforcement learning framework that significantly improves multi-step reasoning accuracy and efficiency by modeling hierarchical structures more effectively.
Contribution
It presents a novel integration of hyperbolic Transformers into RL, enhancing performance on complex reasoning tasks with theoretical insights and empirical validation.
Findings
Accuracy improved by 32-44% on FrontierMath benchmark
Accuracy improved by 43-45% on nonlinear optimal control benchmark
Computational time reduced by 16-32% on FrontierMath benchmark
Abstract
Multi-step reasoning is a fundamental challenge in artificial intelligence, with applications ranging from mathematical problem-solving to decision-making in dynamic environments. Reinforcement Learning (RL) has shown promise in enabling agents to perform multi-step reasoning by optimizing long-term rewards. However, conventional RL methods struggle with complex reasoning tasks due to issues such as credit assignment, high-dimensional state representations, and stability concerns. Recent advancements in Transformer architectures and hyperbolic geometry have provided novel solutions to these challenges. This paper introduces a new framework that integrates hyperbolic Transformers into RL for multi-step reasoning. The proposed approach leverages hyperbolic embeddings to model hierarchical structures effectively. We present theoretical insights, algorithmic details, and experimental…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputability, Logic, AI Algorithms · Cellular Automata and Applications · Reinforcement Learning in Robotics
