K2-V2: A 360-Open, Reasoning-Enhanced LLM

K2 Team: Zhengzhong Liu; Liping Tang; Linghao Jin; Haonan Li; Nikhil Ranjan; Desai Fan; Shaurya Rohatgi; Richard Fan; Omkar Pangarkar; Huijuan Wang; Zhoujun Cheng; Suqi Sun; Seungwook Han; Bowen Tan; Gurpreet Gosal; Xudong Han; Varad Pimpalkhute; Shibo Hao; Ming Shan Hee; Joel Hestness; Haolong Jia; Liqun Ma; Aaryamonvikram Singh; Daria Soboleva; Natalia Vassilieva; Renxi Wang; Yingquan Wu; Yuekai Sun; Taylor Killian; Alexander Moreno; John Maggs; Hector Ren; Guowei He; Hongyi Wang; Xuezhe Ma; Yuqi Wang; Mikhail Yurochkin; Eric P. Xing

arXiv:2512.06201·cs.LG·January 28, 2026

K2-V2: A 360-Open, Reasoning-Enhanced LLM

K2 Team: Zhengzhong Liu, Liping Tang, Linghao Jin, Haonan Li, Nikhil Ranjan, Desai Fan, Shaurya Rohatgi, Richard Fan, Omkar Pangarkar, Huijuan Wang, Zhoujun Cheng, Suqi Sun, Seungwook Han, Bowen Tan, Gurpreet Gosal, Xudong Han, Varad Pimpalkhute, Shibo Hao, Ming Shan Hee

PDF

Open Access 2 Models 3 Datasets

TL;DR

K2-V2 is a fully open, reasoning-enhanced large language model that outperforms many existing models in size and capability, with a focus on reasoning, domain knowledge, and tool use, and is openly released for community use.

Contribution

The paper introduces K2-V2, a new open-source LLM with superior reasoning and performance, and provides full training data and artifacts to support community development.

Findings

01

Outperforms Qwen2.5-72B and approaches Qwen3-235B performance.

02

Explicitly infused reasoning, domain knowledge, and tool use during training.

03

Open release of training data and model weights to facilitate further research.

Abstract

We introduce K2-V2, a 360-open LLM built from scratch as a superior base for reasoning adaptation, in addition to functions such as conversation and knowledge retrieval from general LLMs. It stands as the strongest fully open model, rivals open-weight leaders in its size class, outperforms Qwen2.5-72B and approaches the performance of Qwen3-235B. We actively infuse domain knowledge, reasoning, long-context, and tool use throughout the training process. This explicitly prepares the model for complex reasoning tasks. We demonstrate this potential using simple supervised fine-tuning, establishing a strong baseline that indicates significant headroom for advanced alignment. By releasing the full training history and data composition, we maximize the effectiveness of continuous training, a key open source production scenario. We release the model weights and signature LLM360 artifacts, such…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications