A.X K1 Technical Report

Sung Jun Cheon; Jaekyung Cho; Seongho Choi; Hyunjun Eun; Seokhwan Jo; Jaehyun Jun; Minsoo Kang; Jin Kim; Jiwon Kim; Minsang Kim; Seungsik Kim; Sungwan Kim; Tae Yoon Kim; Youngrang Kim; Hyeongmun Lee; Sangyeol Lee; Sungeun Lee; Youngsoon Lee; Yujin Lee; Seongmin Ok; Chanyong Park; Hyewoong Park; Junyoung Park; Hyunho Yang; Subin Yi; Dhammiko Arya; Soohyun Bae; Dongyeon Cho; Seungmo Cho; Sangho Choi; Yongseok Choi; Gyoungeun Han; Yong-jin Han; Seokyoung Hong; Hyeon Hwang; Wonbeom Jang; Minjeong Ju; Wonjin Jung; Keummin Ka; Sungil Kang; Dongnam Kim; Jonghwi Kim; Joonghoon Kim; SaeRom Kim; Sangjin Kim; Seongwon Kim; Youngjin Kim; Seojin Lee; Sunwoo Lee; Taehoon Lee; Chanwoo Park; Sohee Park; Sooyeon Park; Yohan Ra; Sereimony Sek; Seungyeon Seo; Gun Song; Sanghoon Woo; Janghan Yoon; Sungbin Yoon

arXiv:2601.09200·cs.CL·February 12, 2026

A.X K1 Technical Report

Sung Jun Cheon, Jaekyung Cho, Seongho Choi, Hyunjun Eun, Seokhwan Jo, Jaehyun Jun, Minsoo Kang, Jin Kim, Jiwon Kim, Minsang Kim, Seungsik Kim, Sungwan Kim, Tae Yoon Kim, Youngrang Kim, Hyeongmun Lee, Sangyeol Lee, Sungeun Lee, Youngsoon Lee, Yujin Lee, Seongmin Ok, Chanyong Park

PDF

Open Access 1 Models

TL;DR

A.X K1 is a large Mixture-of-Experts language model trained on 10 trillion tokens, designed for scalable deployment and controllable reasoning, with competitive performance and a novel Think-Fusion training method.

Contribution

Introduces A.X K1, a 519B-parameter MoE model with a new training recipe for user-controlled reasoning modes and optimized for diverse real-world applications.

Findings

01

Achieves competitive performance with leading models

02

Demonstrates superior results on Korean-language benchmarks

03

Supports explicit controllable reasoning modes

Abstract

We introduce A.X K1, a 519B-parameter Mixture-of-Experts (MoE) language model trained from scratch. Our design leverages scaling laws to optimize training configurations and vocabulary size under fixed computational budgets. A.X K1 is pre-trained on a corpus of approximately 10T tokens, curated by a multi-stage data processing pipeline. Designed to bridge the gap between reasoning capability and inference efficiency, A.X K1 supports explicitly controllable reasoning to facilitate scalable deployment across diverse real-world scenarios. We propose a simple yet effective Think-Fusion training recipe, enabling user-controlled switching between thinking and non-thinking modes within a single unified model. Extensive evaluations demonstrate that A.X K1 achieves performance competitive with leading open-source models, while establishing a distinctive advantage in Korean-language benchmarks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
skt/A.X-K1
model· 12k dl· ♡ 275
12k dl♡ 275

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques