Preventing Conflicting Gradients in Neural Marked Temporal Point   Processes

Tanguy Bosser; Souhaib Ben Taieb

arXiv:2412.08590·cs.LG·December 12, 2024

Preventing Conflicting Gradients in Neural Marked Temporal Point Processes

Tanguy Bosser, Souhaib Ben Taieb

PDF

Open Access

TL;DR

This paper identifies conflicting gradient issues in neural marked temporal point processes during joint training and proposes new parametrizations to separate task learning, improving model performance on real-world datasets.

Contribution

The paper introduces novel parametrizations for neural MTPP models that prevent conflicting gradients by separating task-specific training, enhancing learning stability and accuracy.

Findings

01

Conflicting gradients can degrade neural MTPP training.

02

Separating task modeling improves performance.

03

Experimental results show benefits on real-world datasets.

Abstract

Neural Marked Temporal Point Processes (MTPP) are flexible models to capture complex temporal inter-dependencies between labeled events. These models inherently learn two predictive distributions: one for the arrival times of events and another for the types of events, also known as marks. In this study, we demonstrate that learning a MTPP model can be framed as a two-task learning problem, where both tasks share a common set of trainable parameters that are optimized jointly. We show that this often leads to the emergence of conflicting gradients during training, where task-specific gradients are pointing in opposite directions. When such conflicts arise, following the average gradient can be detrimental to the learning of each individual tasks, resulting in overall degraded performance. To overcome this issue, we introduce novel parametrizations for neural MTPP models that allow for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPoint processes and geometric inequalities

MethodsSparse Evolutionary Training