Tangma: A Tanh-Guided Activation Function with Learnable Parameters

Shreel Golwala

arXiv:2507.10560·cs.NE·July 16, 2025

Tangma: A Tanh-Guided Activation Function with Learnable Parameters

Shreel Golwala

PDF

Open Access

TL;DR

Tangma is a novel learnable activation function combining the benefits of tanh with adjustable parameters, leading to improved training stability and accuracy in deep neural networks for vision tasks.

Contribution

This paper introduces Tangma, a new activation function with learnable parameters that enhance training stability and performance in deep neural networks.

Findings

01

Achieved 99.09% accuracy on MNIST

02

Outperformed ReLU, Swish, and GELU on CIFAR-10

03

Enabled faster and more stable training

Abstract

Activation functions are key to effective backpropagation and expressiveness in deep neural networks. This work introduces Tangma, a new activation function that combines the smooth shape of the hyperbolic tangent with two learnable parameters: $α$ , which shifts the curve's inflection point to adjust neuron activation, and $γ$ , which adds linearity to preserve weak gradients and improve training stability. Tangma was evaluated on MNIST and CIFAR-10 using custom networks composed of convolutional and linear layers, and compared against ReLU, Swish, and GELU. On MNIST, Tangma achieved the highest validation accuracy of 99.09% and the lowest validation loss, demonstrating faster and more stable convergence than the baselines. On CIFAR-10, Tangma reached a top validation accuracy of 78.15%, outperforming all other activation functions while maintaining a competitive training loss.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition