Markup-to-Image Diffusion Models with Scheduled Sampling

Yuntian Deng; Noriyuki Kojima; Alexander M. Rush

arXiv:2210.05147·cs.LG·October 12, 2022·1 cites

Markup-to-Image Diffusion Models with Scheduled Sampling

Yuntian Deng, Noriyuki Kojima, Alexander M. Rush

PDF

Open Access 1 Repo 1 Models

TL;DR

This paper introduces a diffusion-based method with scheduled sampling for converting markup languages into images, improving generation quality across diverse datasets.

Contribution

It adapts scheduled sampling to diffusion models for markup-to-image tasks, addressing exposure bias and enhancing generation accuracy.

Findings

01

Effective diffusion process verified across datasets

02

Scheduled sampling mitigates generation errors

03

Markup-to-image task aids in analyzing generative models

Abstract

Building on recent advances in image generation, we present a fully data-driven approach to rendering markup into images. The approach is based on diffusion models, which parameterize the distribution of data using a sequence of denoising operations on top of a Gaussian noise distribution. We view the diffusion denoising process as a sequential decision making process, and show that it exhibits compounding errors similar to exposure bias issues in imitation learning problems. To mitigate these issues, we adapt the scheduled sampling algorithm to diffusion training. We conduct experiments on four markup datasets: mathematical formulas (LaTeX), table layouts (HTML), sheet music (LilyPond), and molecular images (SMILES). These experiments each verify the effectiveness of the diffusion process and the use of scheduled sampling to fix generation issues. These results also show that the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

da03/markup2im
pytorchOfficial

Models

🤗
yuntian-deng/latex2im_ss_finetunegptneo
model· 2 dl
2 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Image Retrieval and Classification Techniques · Music and Audio Processing

MethodsDiffusion