Model-Free Inference of Investor Preferences: A Relative Entropy IRL Approach

Chen Xu

arXiv:2604.24280·cs.LG·April 28, 2026

Model-Free Inference of Investor Preferences: A Relative Entropy IRL Approach

Chen Xu

PDF

TL;DR

This paper introduces a novel RE-IRL framework to infer investor preferences from market data without requiring known transition probabilities, incorporating a K-nearest neighbor method and statistical validation.

Contribution

It develops a model-free IRL approach tailored for financial markets, addressing data sparsity and validation challenges in investor preference inference.

Findings

01

RE-IRL effectively recovers investor reward functions from observed actions.

02

K-nearest neighbor approach improves behavior policy estimation.

03

Statistical testing framework assesses the robustness of inferred preferences.

Abstract

We present a framework using Relative Entropy Inverse Reinforcement Learning (RE-IRL) to recover investor reward functions from observed investment actions and market conditions. Unlike traditional IRL algorithms, RE-IRL is employed to account for environments where transition probabilities are unknown or inaccessible. To address the challenge of data sparsity, we utilize a $K$ -nearest neighbor approach to estimate the observed behavior policy. Furthermore, we propose a statistical testing framework to evaluate the validity and robustness of the estimated results.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.