Parallel Decoder Transformer: Planner-Seeded Latent Coordination for Synchronized Parallel Decoding
Logan Robbins

TL;DR
The paper introduces the Parallel Decoder Transformer (PDT), a novel architecture enabling synchronized parallel decoding in language models through internal coordination mechanisms, improving upon external prompt-based methods.
Contribution
It presents a frozen-trunk transformer architecture with a planner-seeded latent workspace and synchronized multi-stream output protocol for better parallel decoding coordination.
Findings
Enables synchronized parallel decoding within a frozen language model.
Uses a planner and latent workspace for internal coordination.
Improves parallel task decomposition over external prompting.
Abstract
Autoregressive language models can often identify parallel subproblems, but standard decoding exposes only a single left-to-right output interface. External orchestration methods can launch multiple prompts concurrently, yet they provide no model-internal state through which those generations can synchronize, resolve ownership, or wait for missing information. We present the Parallel Decoder Transformer (PDT), a frozen-trunk architecture that augments a decoder with a planner-seeded latent workspace and a synchronized multi-stream output protocol. Before any stream emits tokens, a mandatory prompt-time planner predicts fixed latent plan slots and projects them as snapshot 0 on an embeddings-only Dynamic Notes Bus. During decoding, each stream reads the visible notes window through Speculative Note Conditioning (SNC), emits provisional token blocks and latent summaries, and advances only…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Generative Adversarial Networks and Image Synthesis
