LaplacianFormer:Rethinking Linear Attention with Laplacian Kernel

Zhe Feng; Sen Lian; Changwei Wang; Muyang Zhang; Tianlong Tan; Rongtao Xu; Weiliang Meng; and Xiaopeng Zhang

arXiv:2604.20368·cs.CV·April 23, 2026

LaplacianFormer:Rethinking Linear Attention with Laplacian Kernel

Zhe Feng, Sen Lian, Changwei Wang, Muyang Zhang, Tianlong Tan, Rongtao Xu, Weiliang Meng, and Xiaopeng Zhang

PDF

1 Video

TL;DR

LaplacianFormer introduces a Laplacian kernel-based Transformer that improves efficiency and expressiveness for high-resolution vision tasks, with theoretical grounding and practical CUDA implementations.

Contribution

It proposes a Laplacian kernel as a principled alternative to softmax in Transformers, along with a provably injective feature map and efficient Nyström approximation for better performance.

Findings

01

Achieves strong performance-efficiency trade-offs on ImageNet.

02

Retains fine-grained token information with a new injective feature map.

03

Utilizes efficient CUDA implementations for high-throughput processing.

Abstract

The quadratic complexity of softmax attention presents a major obstacle for scaling Transformers to high-resolution vision tasks. Existing linear attention variants often replace the softmax with Gaussian kernels to reduce complexity, but such approximations lack theoretical grounding and tend to oversuppress mid-range token interactions. We propose LaplacianFormer, a Transformer variant that employs a Laplacian kernel as a principled alternative to softmax, motivated by empirical observations and theoretical analysis. To address expressiveness degradation under low-rank approximations, we introduce a provably injective feature map that retains fine-grained token information. For efficient computation, we adopt a Nystr\"om approximation of the kernel matrix and solve the resulting system using Newton--Schulz iteration, avoiding costly matrix inversion and SVD. We further develop custom…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

LaplacianFormer:Rethinking Linear Attention with Laplacian Kernel· slideslive