Tangma: A Tanh-Guided Activation Function with Learnable Parameters
Shreel Golwala

TL;DR
Tangma is a novel learnable activation function combining the benefits of tanh with adjustable parameters, leading to improved training stability and accuracy in deep neural networks for vision tasks.
Contribution
This paper introduces Tangma, a new activation function with learnable parameters that enhance training stability and performance in deep neural networks.
Findings
Achieved 99.09% accuracy on MNIST
Outperformed ReLU, Swish, and GELU on CIFAR-10
Enabled faster and more stable training
Abstract
Activation functions are key to effective backpropagation and expressiveness in deep neural networks. This work introduces Tangma, a new activation function that combines the smooth shape of the hyperbolic tangent with two learnable parameters: , which shifts the curve's inflection point to adjust neuron activation, and , which adds linearity to preserve weak gradients and improve training stability. Tangma was evaluated on MNIST and CIFAR-10 using custom networks composed of convolutional and linear layers, and compared against ReLU, Swish, and GELU. On MNIST, Tangma achieved the highest validation accuracy of 99.09% and the lowest validation loss, demonstrating faster and more stable convergence than the baselines. On CIFAR-10, Tangma reached a top validation accuracy of 78.15%, outperforming all other activation functions while maintaining a competitive training loss.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition
