Deep Reinforcement Learning of Marked Temporal Point Processes

Utkarsh Upadhyay; Abir De; Manuel Gomez-Rodriguez

arXiv:1805.09360·cs.LG·November 7, 2018·34 cites

Deep Reinforcement Learning of Marked Temporal Point Processes

Utkarsh Upadhyay, Abir De, Manuel Gomez-Rodriguez

PDF

Open Access 1 Repo

TL;DR

This paper introduces a deep reinforcement learning framework for marked temporal point processes, enabling online interventions in asynchronous environments with complex feedback, demonstrated through personalized teaching and viral marketing applications.

Contribution

It develops a novel policy gradient method that models actions and feedback as marked temporal point processes using deep neural networks, without assuming specific functional forms.

Findings

01

Outperforms existing methods in personalized teaching scenarios.

02

Effective in viral marketing applications with real-world data.

03

Flexible approach handling complex reward functions.

Abstract

In a wide variety of applications, humans interact with a complex environment by means of asynchronous stochastic discrete events in continuous time. Can we design online interventions that will help humans achieve certain goals in such asynchronous setting? In this paper, we address the above problem from the perspective of deep reinforcement learning of marked temporal point processes, where both the actions taken by an agent and the feedback it receives from the environment are asynchronous stochastic discrete events characterized using marked temporal point processes. In doing so, we define the agent's policy using the intensity and mark distribution of the corresponding process and then derive a flexible policy gradient method, which embeds the agent's actions and the feedback it receives into real-valued vectors using deep recurrent neural networks. Our method does not make any…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Networks-Learning/tpprl
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInnovation Diffusion and Forecasting