Tri-Transformer Hawkes Process: Three Heads are better than one
Zhi-yan Song, Jian-wei Liu, Lu-ning Zhang, and Ya-nan Han

TL;DR
This paper introduces the Tri-Transformer Hawkes Process, a novel model that enhances event sequence modeling by integrating three transformer heads to better utilize event time and type information, outperforming previous methods.
Contribution
The paper proposes a tri-transformer architecture that effectively incorporates event and time information, reducing learning bias and improving performance over existing transformer Hawkes processes.
Findings
Tri-THP outperforms existing models on real-world data.
Incorporating multiple transformer heads improves learning accuracy.
The model effectively utilizes event time and type information.
Abstract
Abstract. Most of the real world data we encounter are asynchronous event sequence, so the last decades have been characterized by the implementation of various point process into the field of social networks,electronic medical records and financial transactions. At the beginning, Hawkes process and its variants which can simulate simultaneously the self-triggering and mutual triggering patterns between different events in complex sequences in a clear and quantitative way are more popular.Later on, with the advances of neural network, neural Hawkes process has been proposed one after another, and gradually become a research hotspot. The proposal of the transformer Hawkes process (THP) has gained a huge performance improvement, so a new upsurge of the neural Hawkes process based on transformer is set off. However, THP does not make full use of the information of occurrence time and type…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHedgehog Signaling Pathway Studies · Medical Image Segmentation Techniques · Diffusion and Search Dynamics
