APTx: better activation function than MISH, SWISH, and ReLU's variants used in deep learning

Ravin Kumar

arXiv:2209.06119·cs.LG·September 29, 2025

APTx: better activation function than MISH, SWISH, and ReLU's variants used in deep learning

Ravin Kumar

PDF

Open Access 2 Repos

TL;DR

This paper introduces APTx, a new activation function that performs similarly to MISH but with fewer computations, leading to faster training and lower hardware demands in deep learning models.

Contribution

The paper proposes APTx, a novel activation function that reduces computational complexity while maintaining performance comparable to MISH.

Findings

01

APTx speeds up model training compared to MISH.

02

APTx requires fewer mathematical operations.

03

APTx reduces hardware requirements for deep learning models.

Abstract

Activation Functions introduce non-linearity in the deep neural networks. This nonlinearity helps the neural networks learn faster and efficiently from the dataset. In deep learning, many activation functions are developed and used based on the type of problem statement. ReLU's variants, SWISH, and MISH are goto activation functions. MISH function is considered having similar or even better performance than SWISH, and much better than ReLU. In this paper, we propose an activation function named APTx which behaves similar to MISH, but requires lesser mathematical operations to compute. The lesser computational requirements of APTx does speed up the model training, and thus also reduces the hardware requirement for the deep learning model. Source code: https://github.com/mr-ravin/aptx_activation

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Machine Learning and Data Classification · Advanced Neural Network Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Tanh Activation