Environmental Footprint of GenAI Research: Insights from the Moshi Foundation Model

Marta L\'opez-Rauhut; Loic Landrieu; Mathieu Aubry; Anne-Laure Ligozat

arXiv:2604.11154·cs.AI·April 14, 2026

Environmental Footprint of GenAI Research: Insights from the Moshi Foundation Model

Marta L\'opez-Rauhut, Loic Landrieu, Mathieu Aubry, Anne-Laure Ligozat

PDF

TL;DR

This paper analyzes the environmental impact of developing the Moshi 7B-parameter MLLM, providing detailed insights into compute usage and offering guidelines for more sustainable AI research.

Contribution

It offers the first fine-grained analysis of compute and environmental impacts throughout the entire research process of a large language model.

Findings

01

Quantifies GPU-time spent on different model components and phases.

02

Assesses energy, water, and resource depletion from hardware production and use.

03

Provides actionable guidelines to reduce environmental impacts of MLLM research.

Abstract

New multi-modal large language models (MLLMs) are continuously being trained and deployed, following rapid development cycles. This generative AI frenzy is driving steady increases in energy consumption, greenhouse gas emissions, and a plethora of other environmental impacts linked to datacenter construction and hardware manufacturing. Mitigating the environmental consequences of GenAI remains challenging due to an overall lack of transparency by the main actors in the field. Even when the environmental impacts of specific models are mentioned, they are typically restricted to the carbon footprint of the final training run, omitting the research and development stages. In this work, we explore the impact of GenAI research through a fine-grained analysis of the compute spent to create Moshi, a 7B-parameter speech-text foundation model for real-time dialogue developed by Kyutai, a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.