Time dependent loss reweighting for flow matching and diffusion models is theoretically justified
Lukas Billera, Hedwig Nora Nordlinder, and Ben Murrell

TL;DR
This paper provides a theoretical foundation for time-dependent loss weighting in flow matching and diffusion models, showing that such schemes are justified and can simplify model construction.
Contribution
It demonstrates that time-dependent loss functions and generator parameterizations are theoretically valid within Generator Matching and Edit Flows frameworks.
Findings
Time-dependent loss weighting schemes are theoretically justified.
Dependence of loss and generator on time $t$ can simplify model design.
Examples show the practical benefits of time-dependent parameterizations.
Abstract
This brief note clarifies that, in Generator Matching (which subsumes a large family of flow matching and diffusion models over continuous, manifold, and discrete spaces), both the Bregman divergence loss and the linear parameterization of the generator can depend on both the current state and the time , and we show that the expectation over time in the loss can be taken with respect to a broad class of time distributions. We also show this for Edit Flows, which falls outside of Generator Matching. That the loss can depend on clarifies that time-dependent loss weighting schemes, often used in practice to stabilize training, are theoretically justified when the specific flow or diffusion scheme is a special case of Generator Matching (or Edit Flows). It also often simplifies the construction of -predictor schemes, which are sometimes preferred for model-related reasons.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Generative Adversarial Networks and Image Synthesis · Markov Chains and Monte Carlo Methods
