Parameter-Efficient Transformer Embeddings

Henry Ndubuaku; Mouad Talhi

arXiv:2505.02266·cs.CL·May 6, 2025

Parameter-Efficient Transformer Embeddings

Henry Ndubuaku, Mouad Talhi

PDF

Open Access 1 Repo

TL;DR

This paper introduces a parameter-efficient method for transformer embeddings that uses deterministic Fourier-based token vectors and a lightweight MLP, reducing parameters and training time while maintaining competitive NLP performance.

Contribution

It presents a novel embedding approach combining Fourier expansion and a small MLP, significantly decreasing model size and training time without sacrificing accuracy.

Findings

01

Achieves competitive NLP performance with fewer parameters

02

Trains faster and operates without dropout

03

Demonstrates potential for scalable, memory-efficient language models

Abstract

Embedding layers in transformer-based NLP models typically account for the largest share of model parameters, scaling with vocabulary size but not yielding performance gains proportional to scale. We propose an alternative approach in which token embedding vectors are first generated deterministically, directly from the token IDs using a Fourier expansion of their normalized values, followed by a lightweight multilayer perceptron (MLP) that captures higher-order interactions. We train standard transformers and our architecture on natural language inference tasks (SNLI and MNLI), and evaluate zero-shot performance on sentence textual similarity (STS-B). Our results demonstrate that the proposed method achieves competitive performance using significantly fewer parameters, trains faster, and operates effectively without the need for dropout. This proof-of-concept study highlights the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

HMUNACHI/pete
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSensor Technology and Measurement Systems

MethodsSix Ways To Communicate To Someone At Expedia Via Phone And Email's.