Universal Transformer Hawkes Process with Adaptive Recursive Iteration
Lu-ning Zhang, Jian-wei Liu, Zhi-yan Song, Xin Zuo

TL;DR
This paper introduces the Universal Transformer Hawkes Process (UTHP), combining recursive mechanisms, self-attention, and CNNs to enhance modeling of asynchronous event sequences, achieving improved performance over existing models.
Contribution
The paper proposes a novel universal transformer Hawkes process model with recursive and self-attention mechanisms, plus CNN integration, to better capture local and global features in event data.
Findings
UTHP outperforms previous state-of-the-art models on multiple datasets.
Recursive mechanisms improve model performance.
Incorporating CNN enhances local perception in the model.
Abstract
Asynchronous events sequences are widely distributed in the natural world and human activities, such as earthquakes records, users activities in social media and so on. How to distill the information from these seemingly disorganized data is a persistent topic that researchers focus on. The one of the most useful model is the point process model, and on the basis, the researchers obtain many noticeable results. Moreover, in recent years, point process models on the foundation of neural networks, especially recurrent neural networks (RNN) are proposed and compare with the traditional models, their performance are greatly improved. Enlighten by transformer model, which can learning sequence data efficiently without recurrent and convolutional structure, transformer Hawkes process is come out, and achieves state-of-the-art performance. However, there is some research proving that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHedgehog Signaling Pathway Studies · Point processes and geometric inequalities
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Adam · Layer Normalization · *Communicated@Fast*How Do I Communicate to Expedia? · Residual Connection · Dropout · Softmax · Attention Dropout
