Hermes 3 Technical Report
Ryan Teknium, Jeffrey Quesnelle, Chen Guang

TL;DR
Hermes 3 is a neutrally-aligned, generalist instruct and tool-use model with strong reasoning and creative abilities, achieving state-of-the-art performance among open weight models on various benchmarks.
Contribution
Introduces Hermes 3, a new instruct-tuned model with strong reasoning and creative skills, and demonstrates its superior performance on multiple benchmarks.
Findings
Hermes 3 405B achieves state-of-the-art results among open weight models.
The model demonstrates strong reasoning capabilities.
It is neutrally aligned and versatile in instruction and tool use.
Abstract
Instruct (or "chat") tuned models have become the primary way in which most people interact with large language models. As opposed to "base" or "foundation" models, instruct-tuned models are optimized to respond to imperative statements. We present Hermes 3, a neutrally-aligned generalist instruct and tool use model with strong reasoning and creative abilities. Its largest version, Hermes 3 405B, achieves state of the art performance among open weight models on several public benchmarks.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗NousResearch/Hermes-3-Llama-3.1-8B-GGUFmodel· 7.8k dl· ♡ 1427.8k dl♡ 142
- 🤗NousResearch/Hermes-3-Llama-3.1-8Bmodel· 115k dl· ♡ 400115k dl♡ 400
- 🤗NousResearch/Hermes-3-Llama-3.1-70B-GGUFmodel· 319 dl· ♡ 44319 dl♡ 44
- 🤗NousResearch/Hermes-3-Llama-3.1-405Bmodel· 133 dl· ♡ 266133 dl♡ 266
- 🤗NousResearch/Hermes-3-Llama-3.1-70Bmodel· 789 dl· ♡ 122789 dl♡ 122
- 🤗NousResearch/Hermes-3-Llama-3.1-70B-FP8model· 466 dl· ♡ 25466 dl♡ 25
- 🤗NousResearch/Hermes-3-Llama-3.1-405B-FP8model· 13 dl· ♡ 2813 dl♡ 28
- 🤗QuantFactory/Hermes-3-Llama-3.1-8B-GGUFmodel· 73 dl· ♡ 373 dl♡ 3
- 🤗bullerwins/Hermes-3-Llama-3.1-8B-exl2_4.0bpwmodel
- 🤗emplitude/offtheshelf2model· 1 dl1 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques
