InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning
Congkai Xie, Shuo Cai, Wenjun Wang, Pengxiang Li, Zhijie, Sang, Kejing Yang, Yiming Zhang, Zhen Li, Guanghao Zhu, Zeyu, Liu, Yang Yu, Yuhang Liu, Su Lu, Baoyi He, Qi Zhou and, Xiaotian Han, Jianbo Yuan, Shengyu Zhang, Fei Wu, Hongxia Yang

TL;DR
This paper presents InfiR, a training pipeline for small and multimodal small language models that achieve strong reasoning performance, are resource-efficient, and suitable for edge deployment, addressing high costs and privacy issues of large models.
Contribution
The paper introduces a novel training pipeline for small and multimodal language models that enhances reasoning and enables efficient deployment on edge devices.
Findings
Achieves state-of-the-art reasoning performance among small models
Reduces computational and privacy concerns compared to large models
Facilitates deployment on resource-constrained devices
Abstract
Large Language Models (LLMs) and Multimodal Large Language Models (MLLMs) have made significant advancements in reasoning capabilities. However, they still face challenges such as high computational demands and privacy concerns. This paper focuses on developing efficient Small Language Models (SLMs) and Multimodal Small Language Models (MSLMs) that retain competitive reasoning abilities. We introduce a novel training pipeline that enhances reasoning capabilities and facilitates deployment on edge devices, achieving state-of-the-art performance while minimizing development costs. \InfR~ aims to advance AI systems by improving reasoning, reducing adoption barriers, and addressing privacy concerns through smaller model sizes. Resources are available at https://github. com/Reallm-Labs/InfiR.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
