The Backpropagation of the Wave Network
Xin Zhang, Victor S. Sheng

TL;DR
This paper introduces Wave Network, a novel token representation method using complex vectors to capture global and local semantics, with analysis showing efficiency and reduced resource usage compared to BERT.
Contribution
The paper presents a new wave-inspired token representation method, Token2Wave, with analysis of its convergence, backpropagation, and efficiency advantages over existing models.
Findings
Reduces video memory usage compared to BERT
Maintains semantic richness through complex vector representations
Demonstrates efficient training and convergence behavior
Abstract
This paper provides an in-depth analysis of Wave Network, a novel token representation method derived from the Wave Network, designed to capture both global and local semantics of input text through wave-inspired complex vectors. In complex vector token representation, each token is represented with a magnitude component, capturing the global semantics of the entire input text, and a phase component, encoding the relationships between individual tokens and the global semantics. Building on prior research that demonstrated the effectiveness of wave-like operations, such as interference and modulation, during forward propagation, this study investigates the convergence behavior, backpropagation characteristics, and embedding independence within the Token2Wave framework. A detailed computational complexity analysis shows that Token2Wave can significantly reduce video memory usage and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Ferroelectric and Negative Capacitance Devices · Topic Modeling
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Dropout · Linear Warmup With Linear Decay · WordPiece · Dense Connections · Layer Normalization · Adam · Attention Dropout
