ArcGate: Adaptive Arctangent Gated Activation
Avik Bhattacharya, Siddhant Dnyanesh Gole, Subhasis Chaudhuri, Alejandro C. Frery, Biplab Banerjee

TL;DR
ArcGate introduces a flexible, learnable activation function with seven parameters per layer, optimizing non-linearity for improved performance and robustness in remote sensing neural networks.
Contribution
It proposes a novel adaptive activation function, ArcGate, that outperforms fixed-shape activations and adapts to data and network depth.
Findings
ArcGate achieves 99.67% accuracy on PatternNet.
It maintains a 26.65% performance advantage over ReLU under Gaussian noise.
Learned parameters show depth-dependent functional evolution.
Abstract
Activation functions are central to deep networks, influencing non-linearity, feature learning, convergence, and robustness. This paper proposes the Adaptive Arctangent Gated Activation (ArcGate) function, a flexible formulation that generates a broad spectrum of activation shapes via a three-stage non-linear transformation. Unlike conventional fixed-shape activations such as ReLU, GELU, or SiLU, ArcGate uses seven learnable parameters per layer, allowing the neural network to autonomously optimize its non-linearity to the specific requirements of the feature hierarchy and data distribution. We evaluate ArcGate using ResNet-50 and Vision Transformer (ViT-B/16) architectures on three widely used remote sensing benchmarks: PatternNet, UC Merced Land Use, and the 13-band EuroSAT MSI multispectral dataset. Experimental results show that ArcGate consistently outperforms standard baselines,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
