Nous: Hermes 3 70B Instruct

Text input Text output
Author's Description

Hermes 3 is a generalist language model with many improvements over [Hermes 2](/models/nousresearch/nous-hermes-2-mistral-7b-dpo), including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the board. Hermes 3 70B is a competitive, if not superior finetune of the [Llama-3.1 70B foundation model](/models/meta-llama/llama-3.1-70b-instruct), focused on aligning LLMs to the user, with powerful steering capabilities and control given to the end user. The Hermes 3 series builds and expands on the Hermes 2 set of capabilities, including more powerful and reliable function calling and structured output capabilities, generalist assistant capabilities, and improved code generation skills.

Key Specifications
Cost
$$
Context
12K
Parameters
70B
Released
Aug 17, 2024
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Seed Temperature Max Tokens Min P Top P Presence Penalty Frequency Penalty Logit Bias Response Format Logprobs Top Logprobs Stop
Features

This model supports the following features:

Response Format
Performance Summary

Nous: Hermes 3 70B Instruct demonstrates competitive response times, ranking in the 58th percentile across benchmarks, indicating it performs at an average to above-average speed. It offers cost-effective solutions, placing in the 77th percentile for price. The model exhibits exceptional reliability with a 97% success rate, consistently providing usable responses. Across benchmarks, Hermes 3 70B shows strong performance in General Knowledge (97.2% accuracy) and Email Classification (98.0% accuracy), highlighting its ability to handle diverse information and categorize effectively. Its Ethics performance is also robust at 98.0% accuracy. Notable strengths include its cost-efficiency and high reliability. However, the model shows weaknesses in Mathematics (68.0% accuracy), Reasoning (52.0% accuracy), and Instruction Following (55.6% accuracy), suggesting areas for improvement in complex problem-solving and multi-step directive execution. Its Hallucinations score of 90.0% accuracy, while decent, indicates some room for improvement in acknowledging uncertainty. The model's Coding accuracy is solid at 81.0%.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.3
Completion $0.3

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Lambda
Lambda | nousresearch/hermes-3-llama-3.1-70b 131K $0.3 / 1M tokens $0.3 / 1M tokens
Hyperbolic
Hyperbolic | nousresearch/hermes-3-llama-3.1-70b 12K $0.3 / 1M tokens $0.3 / 1M tokens
DeepInfra
DeepInfra | nousresearch/hermes-3-llama-3.1-70b 131K $0.3 / 1M tokens $0.3 / 1M tokens
NextBit
NextBit | nousresearch/hermes-3-llama-3.1-70b 65K $0.3 / 1M tokens $0.3 / 1M tokens
NextBit
NextBit | nousresearch/hermes-3-llama-3.1-70b 65K $0.3 / 1M tokens $0.3 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by nousresearch