The Disappearance of Timestep Embedding in Modern Time-Dependent Neural Networks
Bum Jun Kim, Yoshinobu Kawahara, Sang Woo Kim

TL;DR
This paper analyzes how modern time-dependent neural networks often lose their ability to be aware of time due to vanishing timestep embeddings, and proposes solutions to restore this crucial feature, improving model performance.
Contribution
It identifies the vanishing timestep embedding as a key vulnerability in time-dependent neural networks and offers effective solutions to enhance their time-awareness.
Findings
Vanishing timestep embedding disables time-awareness in neural ODEs.
Similar issues are present in diffusion models using timestep embeddings.
Proposed solutions successfully restore time-awareness and improve model performance.
Abstract
Dynamical systems are often time-varying, whose modeling requires a function that evolves with respect to time. Recent studies such as the neural ordinary differential equation proposed a time-dependent neural network, which provides a neural network varying with respect to time. However, we claim that the architectural choice to build a time-dependent neural network significantly affects its time-awareness but still lacks sufficient validation in its current states. In this study, we conduct an in-depth analysis of the architecture of modern time-dependent neural networks. Here, we report a vulnerability of vanishing timestep embedding, which disables the time-awareness of a time-dependent neural network. Furthermore, we find that this vulnerability can also be observed in diffusion models because they employ a similar architecture that incorporates timestep embedding to discriminate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsDiffusion
