H2O-Danube3 Technical Report
Pascal Pfeiffer, Philipp Singer, Yauhen Babakhin, Gabor Fodor, Nischay, Dhankhar, Sri Satish Ambati

TL;DR
H2O-Danube3 introduces a series of compact, high-performing language models trained on web data, optimized for mobile deployment, and openly available to democratize access to advanced LLM technology.
Contribution
The paper presents a new series of small, efficient language models trained on large web datasets, optimized for mobile use, and openly released under an open-source license.
Findings
Models achieve competitive benchmarks across multiple tasks.
Designed for efficient inference on smartphones.
Openly available to promote democratization of LLMs.
Abstract
We present H2O-Danube3, a series of small language models consisting of H2O-Danube3-4B, trained on 6T tokens and H2O-Danube3-500M, trained on 4T tokens. Our models are pre-trained on high quality Web data consisting of primarily English tokens in three stages with different data mixes before final supervised tuning for chat version. The models exhibit highly competitive metrics across a multitude of academic, chat, and fine-tuning benchmarks. Thanks to its compact architecture, H2O-Danube3 can be efficiently run on a modern smartphone, enabling local inference and rapid processing capabilities even on mobile devices. We make all models openly available under Apache 2.0 license further democratizing LLMs to a wider audience economically.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗h2oai/h2o-danube3-500m-chatmodel· 34k dl· ♡ 4034k dl♡ 40
- 🤗h2oai/h2o-danube3-4b-chatmodel· 1.2k dl· ♡ 681.2k dl♡ 68
- 🤗h2oai/h2o-danube3-500m-basemodel· 1.0k dl· ♡ 311.0k dl♡ 31
- 🤗h2oai/h2o-danube3-4b-basemodel· 1.7k dl· ♡ 221.7k dl♡ 22
- 🤗jncraton/h2o-danube3-500m-chat-ct2-int8model· 2 dl2 dl
- 🤗jncraton/h2o-danube3-4b-chat-ct2-int8model· 1 dl1 dl
- 🤗BoscoTheDog/Danube_3-500M_Chat_GGUFmodel· 29 dl· ♡ 129 dl♡ 1
- 🤗RichardErkhov/h2oai_-_h2o-danube3-4b-chat-ggufmodel· 66 dl66 dl
- 🤗RichardErkhov/h2oai_-_h2o-danube3-4b-base-ggufmodel· 26 dl26 dl
- 🤗RichardErkhov/h2oai_-_h2o-danube3-500m-base-ggufmodel· 11 dl11 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIndustrial Gas Emission Control
