Nous: Hermes 3 405B Instruct

Text input Text output Free Option
Author's Description

Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the board. Hermes 3 405B is a frontier-level, full-parameter finetune of the Llama-3.1 405B foundation model, focused on aligning LLMs to the user, with powerful steering capabilities and control given to the end user. The Hermes 3 series builds and expands on the Hermes 2 set of capabilities, including more powerful and reliable function calling and structured output capabilities, generalist assistant capabilities, and improved code generation skills. Hermes 3 is competitive, if not superior, to Llama-3.1 Instruct models at general capabilities, with varying strengths and weaknesses attributable between the two.

Key Specifications
Cost
$$$
Context
131K
Parameters
405B
Released
Aug 15, 2024
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Response Format Presence Penalty Min P Top P Frequency Penalty Max Tokens Seed Stop Temperature
Features

This model supports the following features:

Response Format
Performance Summary

Nous: Hermes 3 405B Instruct demonstrates competitive response times, performing among the fastest models with a 51st percentile speed ranking. It also offers competitive pricing, ranking in the 47th percentile. A standout feature is its exceptional reliability, achieving a 100% success rate across all benchmarks, indicating minimal technical failures and consistent evaluable responses. In terms of performance across categories, Hermes 3 405B exhibits strong capabilities in several areas. It achieved perfect accuracy in Ethics and Email Classification, highlighting its precision in moral reasoning and categorization tasks. Its ability to appropriately acknowledge uncertainty is strong, with 96.0% accuracy in Hallucinations, indicating a low propensity for generating fabricated information. The model also performs well in Instruction Following (70th percentile) and Coding (60th percentile), showcasing its ability to adhere to complex directives and generate functional code. While its General Knowledge (98.5%) and Reasoning (70.0%) scores are solid, its Mathematics performance (77.0% accuracy, 38th percentile) is a relative weakness compared to other benchmarks. Overall, Hermes 3 405B is a robust generalist model, competitive with Llama-3.1 Instruct models, particularly excelling in alignment, steering capabilities, and reliable function calling.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $1
Completion $1

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
DeepInfra
DeepInfra | nousresearch/hermes-3-llama-3.1-405b 131K $1 / 1M tokens $1 / 1M tokens
Lambda
Lambda | nousresearch/hermes-3-llama-3.1-405b 131K $1 / 1M tokens $1 / 1M tokens
Nebius
Nebius | nousresearch/hermes-3-llama-3.1-405b 131K $1 / 1M tokens $1 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by nousresearch