Nous: Hermes 3 405B Instruct

Text input Text output Free Option
Author's Description

Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...

Key Specifications
Cost
$$$
Context
131K
Parameters
405B
Released
Aug 15, 2024
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Min P Top P Seed Structured Outputs Presence Penalty Temperature Response Format Max Tokens Logit Bias Frequency Penalty Stop
Features

This model supports the following features:

Structured Outputs Response Format
Performance Summary

Nous: Hermes 3 405B Instruct demonstrates competitive response times, ranking in the 54th percentile across eight benchmarks, and offers competitive pricing, placing in the 51st percentile. The model exhibits exceptional reliability with a 100% success rate across all benchmarks, indicating consistent and usable responses without technical failures. In terms of performance across categories, Hermes 3 405B shows strong capabilities in Ethics and Email Classification, achieving perfect 100% accuracy in both, and is noted as the most accurate model at its price point and speed for these tasks. It also performs well in Hallucinations (96.0% accuracy) and General Knowledge (98.5% accuracy), indicating a robust understanding of factual information and an appropriate acknowledgment of uncertainty. Its Instruction Following is solid at 63.0% accuracy, placing it in the 63rd percentile. While its Coding accuracy is respectable at 85.0%, its Mathematics performance is a notable weakness, with 77.0% accuracy, placing it in the 31st percentile. Reasoning also presents an area for potential improvement at 70.0% accuracy. Overall, Hermes 3 405B is a generalist model with particular strengths in ethical reasoning, classification, and general factual recall, while mathematics remains an area for development.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $1
Completion $1

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
DeepInfra
DeepInfra | nousresearch/hermes-3-llama-3.1-405b 131K $1 / 1M tokens $1 / 1M tokens
Lambda
Lambda | nousresearch/hermes-3-llama-3.1-405b 131K $1 / 1M tokens $1 / 1M tokens
Nebius
Nebius | nousresearch/hermes-3-llama-3.1-405b 131K $1 / 1M tokens $1 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by nousresearch