# Q-Cogni: An Integrated Causal Reinforcement Learning Framework

**Authors:** Cris Cunha, Wei Liu, Tim French, Ajmal Mian

arXiv: 2302.13240 · 2023-02-28

## TL;DR

Q-Cogni introduces a causal reinforcement learning framework that integrates causal structure discovery into Q-Learning, enhancing learning efficiency, interpretability, and policy quality in complex environments like vehicle routing.

## Contribution

It presents a novel causal reinforcement learning framework that combines causal inference with Q-Learning, enabling better decision-making and interpretability in high-dimensional problems.

## Key findings

- Q-Cogni outperforms state-of-the-art algorithms in VRP tasks.
- It achieves 85% success rate in real-world taxi routing.
- The framework improves learning efficiency and interpretability.

## Abstract

We present Q-Cogni, an algorithmically integrated causal reinforcement learning framework that redesigns Q-Learning with an autonomous causal structure discovery method to improve the learning process with causal inference. Q-Cogni achieves optimal learning with a pre-learned structural causal model of the environment that can be queried during the learning process to infer cause-and-effect relationships embedded in a state-action space. We leverage on the sample efficient techniques of reinforcement learning, enable reasoning about a broader set of policies and bring higher degrees of interpretability to decisions made by the reinforcement learning agent. We apply Q-Cogni on the Vehicle Routing Problem (VRP) and compare against state-of-the-art reinforcement learning algorithms. We report results that demonstrate better policies, improved learning efficiency and superior interpretability of the agent's decision making. We also compare this approach with traditional shortest-path search algorithms and demonstrate the benefits of our causal reinforcement learning framework to high dimensional problems. Finally, we apply Q-Cogni to derive optimal routing decisions for taxis in New York City using the Taxi & Limousine Commission trip record data and compare with shortest-path search, reporting results that show 85% of the cases with an equal or better policy derived from Q-Cogni in a real-world domain.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/2302.13240/full.md

## Figures

19 figures with captions in the complete paper: https://tomesphere.com/paper/2302.13240/full.md

## References

45 references — full list in the complete paper: https://tomesphere.com/paper/2302.13240/full.md

---
Source: https://tomesphere.com/paper/2302.13240