LFM2 Technical Report
Alexander Amini, Anna Banaszak, Harold Benoit, Arthur B\"o\"ok, Tarek Dakhran, Song Duong, Alfred Eng, Fernando Fernandes, Marc H\"ark\"onen, Anne Harrington, Ramin Hasani, Saniya Karwa, Yuri Khrustalev, Maxime Labonne, Mathias Lechner, Valentine Lechner, Simon Lee, Zetian Li

TL;DR
LFM2 introduces a family of efficient, task-capable Liquid Foundation Models optimized for on-device deployment, featuring hardware-aware architecture search, diverse modalities, and strong benchmark performance.
Contribution
The paper presents a novel hardware-in-the-loop architecture search for compact models, a comprehensive training pipeline, and multimodal variants, advancing on-device AI capabilities.
Findings
Up to 2x faster prefill and decode on CPUs.
Achieves 79.56% on IFEval and 82.41% on GSM8K.
Models are open-sourced for practical deployment.
Abstract
We present LFM2, a family of Liquid Foundation Models designed for efficient on-device deployment and strong task capabilities. Using hardware-in-the-loop architecture search under edge latency and memory constraints, we obtain a compact hybrid backbone that combines gated short convolutions with a small number of grouped query attention blocks, delivering up to 2x faster prefill and decode on CPUs compared to similarly sized models. The LFM2 family covers 350M-8.3B parameters, including dense models (350M, 700M, 1.2B, 2.6B) and a mixture-of-experts variant (8.3B total, 1.5B active), all with 32K context length. LFM2's training pipeline includes a tempered, decoupled Top-K knowledge distillation objective that avoids support mismatch; curriculum learning with difficulty-ordered data; and a three-stage post-training recipe of supervised fine-tuning, length-normalized preference…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗LiquidAI/LFM2.5-350Mmodel· 3.8k dl· ♡ 1613.8k dl♡ 161
- 🤗LiquidAI/LFM2.5-1.2B-Instructmodel· 292k dl· ♡ 545292k dl♡ 545
- 🤗LiquidAI/LFM2.5-1.2B-Thinkingmodel· 27k dl· ♡ 32427k dl♡ 324
- 🤗LiquidAI/LFM2.5-350M-Basemodel· 305 dl· ♡ 8305 dl♡ 8
- 🤗LiquidAI/LFM2.5-VL-1.6Bmodel· 128k dl· ♡ 263128k dl♡ 263
- 🤗LiquidAI/LFM2-24B-A2Bmodel· 39k dl· ♡ 30039k dl♡ 300
- 🤗LiquidAI/LFM2-8B-A1Bmodel· 47k dl· ♡ 34347k dl♡ 343
- 🤗LiquidAI/LFM2.5-Audio-1.5Bmodel· 1.0k dl· ♡ 3741.0k dl♡ 374
- 🤗LiquidAI/LFM2-350Mmodel· 39k dl· ♡ 24339k dl♡ 243
- 🤗LiquidAI/LFM2-350M-Mathmodel· 516 dl· ♡ 55516 dl♡ 55
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Multimodal Machine Learning Applications · Natural Language Processing Techniques
