LFM2 Technical Report

Alexander Amini; Anna Banaszak; Harold Benoit; Arthur B\"o\"ok; Tarek Dakhran; Song Duong; Alfred Eng; Fernando Fernandes; Marc H\"ark\"onen; Anne Harrington; Ramin Hasani; Saniya Karwa; Yuri Khrustalev; Maxime Labonne; Mathias Lechner; Valentine Lechner; Simon Lee; Zetian Li; Noel Loo; Jacob Marks; Edoardo Mosca; Samuel J. Paech; Paul Pak; Rom N. Parnichkun; Alex Quach; Ryan Rogers; Daniela Rus; Nayan Saxena; Bettina Schlager; Tim Seyde; Jimmy T.H. Smith; Aditya Tadimeti; Neehal Tumma

arXiv:2511.23404·cs.LG·December 1, 2025

LFM2 Technical Report

Alexander Amini, Anna Banaszak, Harold Benoit, Arthur B\"o\"ok, Tarek Dakhran, Song Duong, Alfred Eng, Fernando Fernandes, Marc H\"ark\"onen, Anne Harrington, Ramin Hasani, Saniya Karwa, Yuri Khrustalev, Maxime Labonne, Mathias Lechner, Valentine Lechner, Simon Lee, Zetian Li

PDF

Open Access 10 Models 1 Datasets

TL;DR

LFM2 introduces a family of efficient, task-capable Liquid Foundation Models optimized for on-device deployment, featuring hardware-aware architecture search, diverse modalities, and strong benchmark performance.

Contribution

The paper presents a novel hardware-in-the-loop architecture search for compact models, a comprehensive training pipeline, and multimodal variants, advancing on-device AI capabilities.

Findings

01

Up to 2x faster prefill and decode on CPUs.

02

Achieves 79.56% on IFEval and 82.41% on GSM8K.

03

Models are open-sourced for practical deployment.

Abstract

We present LFM2, a family of Liquid Foundation Models designed for efficient on-device deployment and strong task capabilities. Using hardware-in-the-loop architecture search under edge latency and memory constraints, we obtain a compact hybrid backbone that combines gated short convolutions with a small number of grouped query attention blocks, delivering up to 2x faster prefill and decode on CPUs compared to similarly sized models. The LFM2 family covers 350M-8.3B parameters, including dense models (350M, 700M, 1.2B, 2.6B) and a mixture-of-experts variant (8.3B total, 1.5B active), all with 32K context length. LFM2's training pipeline includes a tempered, decoupled Top-K knowledge distillation objective that avoids support mismatch; curriculum learning with difficulty-ordered data; and a three-stage post-training recipe of supervised fine-tuning, length-normalized preference…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Datasets

LiquidAI/nanobeir-multilingual-extended
dataset· 1.9k dl
1.9k dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Multimodal Machine Learning Applications · Natural Language Processing Techniques