Nous: Hermes 3 70B Instruct

Text input Text output
Author's Description

Hermes 3 is a generalist language model with many improvements over [Hermes 2](/models/nousresearch/nous-hermes-2-mistral-7b-dpo), including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the board. Hermes 3 70B is a competitive, if not superior finetune of the [Llama-3.1 70B foundation model](/models/meta-llama/llama-3.1-70b-instruct), focused on aligning LLMs to the user, with powerful steering capabilities and control given to the end user. The Hermes 3 series builds and expands on the Hermes 2 set of capabilities, including more powerful and reliable function calling and structured output capabilities, generalist assistant capabilities, and improved code generation skills.

Key Specifications
Cost
$$
Context
131K
Parameters
70B
Released
Aug 17, 2024
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Stop Presence Penalty Logit Bias Top P Temperature Seed Min P Response Format Frequency Penalty Logprobs Max Tokens Top Logprobs
Features

This model supports the following features:

Response Format
Performance Summary

Nous: Hermes 3 70B Instruct demonstrates competitive response times, performing among the faster models with a 60th percentile speed ranking. It also offers cost-effective solutions, ranking in the 76th percentile for price. Notably, the model exhibits exceptional reliability, achieving a perfect 100th percentile, indicating minimal technical failures and consistent provision of usable responses. Across benchmarks, Hermes 3 70B shows strong general knowledge and classification capabilities, scoring 97.2% in General Knowledge and 98.0% in Email Classification. Its Coding performance is solid at 81.0% accuracy. However, its Instruction Following (55.6%) and Reasoning (54.0%) scores are more moderate, suggesting areas for potential improvement in complex multi-step tasks. While its Ethics score is high at 98.0%, its percentile ranking (45th) indicates this is a common high-scoring area for models. Key strengths include its high reliability, cost-effectiveness, and strong performance in knowledge-based and classification tasks. Weaknesses are primarily in advanced reasoning and intricate instruction following, where accuracy could be further enhanced.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.12
Completion $0.3

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Lambda
Lambda | nousresearch/hermes-3-llama-3.1-70b 131K $0.12 / 1M tokens $0.3 / 1M tokens
Hyperbolic
Hyperbolic | nousresearch/hermes-3-llama-3.1-70b 12K $0.4 / 1M tokens $0.4 / 1M tokens
DeepInfra
DeepInfra | nousresearch/hermes-3-llama-3.1-70b 131K $0.1 / 1M tokens $0.28 / 1M tokens
NextBit
NextBit | nousresearch/hermes-3-llama-3.1-70b 65K $0.3 / 1M tokens $0.3 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Free Executions Accuracy Cost Duration
Other Models by nousresearch