Author's Description
Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the board. Hermes 3 405B is a frontier-level, full-parameter finetune of the Llama-3.1 405B foundation model, focused on aligning LLMs to the user, with powerful steering capabilities and control given to the end user. The Hermes 3 series builds and expands on the Hermes 2 set of capabilities, including more powerful and reliable function calling and structured output capabilities, generalist assistant capabilities, and improved code generation skills. Hermes 3 is competitive, if not superior, to Llama-3.1 Instruct models at general capabilities, with varying strengths and weaknesses attributable between the two.
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
Nous: Hermes 3 405B Instruct demonstrates competitive response times, performing among the fastest models with a 51st percentile speed ranking. It also offers competitive pricing, ranking in the 47th percentile. A standout feature is its exceptional reliability, achieving a 100% success rate across all benchmarks, indicating minimal technical failures and consistent evaluable responses. In terms of performance across categories, Hermes 3 405B exhibits strong capabilities in several areas. It achieved perfect accuracy in Ethics and Email Classification, highlighting its precision in moral reasoning and categorization tasks. Its ability to appropriately acknowledge uncertainty is strong, with 96.0% accuracy in Hallucinations, indicating a low propensity for generating fabricated information. The model also performs well in Instruction Following (70th percentile) and Coding (60th percentile), showcasing its ability to adhere to complex directives and generate functional code. While its General Knowledge (98.5%) and Reasoning (70.0%) scores are solid, its Mathematics performance (77.0% accuracy, 38th percentile) is a relative weakness compared to other benchmarks. Overall, Hermes 3 405B is a robust generalist model, competitive with Llama-3.1 Instruct models, particularly excelling in alignment, steering capabilities, and reliable function calling.
Model Pricing
Current Pricing
| Feature | Price (per 1M tokens) |
|---|---|
| Prompt | $1 |
| Completion | $1 |
Price History
Available Endpoints
| Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
|---|---|---|---|---|
|
DeepInfra
|
DeepInfra | nousresearch/hermes-3-llama-3.1-405b | 131K | $1 / 1M tokens | $1 / 1M tokens |
|
Lambda
|
Lambda | nousresearch/hermes-3-llama-3.1-405b | 131K | $1 / 1M tokens | $1 / 1M tokens |
|
Nebius
|
Nebius | nousresearch/hermes-3-llama-3.1-405b | 131K | $1 / 1M tokens | $1 / 1M tokens |
Benchmark Results
| Benchmark | Category | Reasoning | Strategy | Free | Executions | Accuracy | Cost | Duration |
|---|
Other Models by nousresearch
|
|
Released | Params | Context |
|
Speed | Ability | Cost |
|---|---|---|---|---|---|---|---|
| Nous: Hermes 4 70B | Aug 26, 2025 | 70B | 131K |
Text input
Text output
|
★★★★ | ★★★ | $$ |
| Nous: Hermes 4 405B | Aug 26, 2025 | 405B | 131K |
Text input
Text output
|
★★★★ | ★★★★ | $$$$ |
| Nous: DeepHermes 3 Mistral 24B Preview | May 09, 2025 | 24B | 32K |
Text input
Text output
|
★★★★ | ★★★ | $$ |
| Nous: DeepHermes 3 Llama 3 8B Preview Unavailable | Feb 27, 2025 | 8B | 131K |
Text input
Text output
|
— | — | $ |
| Nous: Hermes 3 70B Instruct | Aug 17, 2024 | 70B | 12K |
Text input
Text output
|
★★★★ | ★★★ | $$ |
| NousResearch: Hermes 2 Pro - Llama-3 8B | May 26, 2024 | 8B | 8K |
Text input
Text output
|
★★★★★ | ★★ | $ |
| Nous: Hermes 2 Mixtral 8x7B DPO Unavailable | Jan 15, 2024 | 56B | 32K |
Text input
Text output
|
★★★ | ★ | $$$$ |