Characterization of anomalous diffusion through convolutional transformers
Nicol\'as Firbas, \`Oscar Garibo-i-Orts, Miguel \'Angel Garcia-March,, J. Alberto Conejero

TL;DR
This paper introduces a novel convolutional transformer architecture for characterizing anomalous diffusion, outperforming previous methods especially on short trajectories, and is the first to combine transformers with CNNs for this task.
Contribution
The paper presents the first use of transformer models for anomalous diffusion characterization, integrating CNNs and transformers without positional encoding, enabling parallel training and improved accuracy.
Findings
Outperforms state-of-the-art methods on short trajectories
Enables parallel training of diffusion models
First application of transformers in anomalous diffusion analysis
Abstract
The results of the Anomalous Diffusion Challenge (AnDi Challenge) have shown that machine learning methods can outperform classical statistical methodology at the characterization of anomalous diffusion in both the inference of the anomalous diffusion exponent alpha associated with each trajectory (Task 1), and the determination of the underlying diffusive regime which produced such trajectories (Task 2). Furthermore, of the five teams that finished in the top three across both tasks of the AnDi challenge, three of those teams used recurrent neural networks (RNNs). While RNNs, like the long short-term memory (LSTM) network, are effective at learning long-term dependencies in sequential data, their key disadvantage is that they must be trained sequentially. In order to facilitate training with larger data sets, by training in parallel, we propose a new transformer based neural network…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Systems and Time Series Analysis · stochastic dynamics and bifurcation · Diffusion and Search Dynamics
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Residual Connection · Dropout · Adam · Dense Connections · Softmax · Label Smoothing
