Seed1.8 Model Card: Towards Generalized Real-World Agency

Bytedance Seed

arXiv:2603.20633·cs.AI·April 20, 2026

Seed1.8 Model Card: Towards Generalized Real-World Agency

Bytedance Seed

PDF

TL;DR

Seed1.8 is a versatile foundation model designed for complex, multi-step real-world tasks involving multi-turn interactions, tool use, and multi-modal understanding, with optimized deployment features.

Contribution

It introduces Seed1.8, a unified model supporting multi-turn interaction, tool use, and multi-modal tasks, with deployment optimizations for real-world applications.

Findings

01

Strong performance on standard benchmarks and workflows.

02

Supports multi-turn, multi-modal, and agentic behaviors.

03

Offers latency- and cost-aware inference modes.

Abstract

We present Seed1.8, a foundation model aimed at generalized real-world agency: going beyond single-turn prediction to multi-turn interaction, tool use, and multi-step execution. Seed1.8 keeps strong LLM and vision-language performance while supporting a unified agentic interface-search, code generation and execution, and GUI interaction. For deployment, it offers latency- and cost-aware inference, including configurable thinking modes and optimized visual encoding for images and video. We report evaluations on standard benchmarks and application-aligned workflows spanning foundational skills, multimodal understanding, and agentic behavior. Seed1.8 is released to support further research and development on interactive, real-world use cases.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.