Universal Trading for Order Execution with Oracle Policy Distillation
Yuchen Fang, Kan Ren, Weiqing Liu, Dong Zhou, Weinan Zhang, Jiang, Bian, Yong Yu, Tie-Yan Liu

TL;DR
This paper introduces a universal trading policy framework using oracle policy distillation to improve order execution efficiency under noisy market conditions, outperforming existing methods.
Contribution
It presents a novel reinforcement learning framework that distills an oracle policy to enhance order execution strategies in noisy market environments.
Findings
Significant performance improvements over strong baselines.
Effective guidance of policy learning via oracle distillation.
Robustness to noisy and imperfect market information.
Abstract
As a fundamental problem in algorithmic trading, order execution aims at fulfilling a specific trading order, either liquidation or acquirement, for a given instrument. Towards effective execution strategy, recent years have witnessed the shift from the analytical view with model-based market assumptions to model-free perspective, i.e., reinforcement learning, due to its nature of sequential decision optimization. However, the noisy and yet imperfect market information that can be leveraged by the policy has made it quite challenging to build up sample efficient reinforcement learning methods to achieve effective order execution. In this paper, we propose a novel universal trading policy optimization framework to bridge the gap between the noisy yet imperfect market states and the optimal action sequences for order execution. Particularly, this framework leverages a policy distillation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsStock Market Forecasting Methods · Auction Theory and Applications · Advanced Bandit Algorithms Research
