Mistral: Mixtral 8x7B Instruct

Text input Text output
Author's Description

Mixtral 8x7B Instruct is a pretrained generative Sparse Mixture of Experts, by Mistral AI, for chat and instruction use. Incorporates 8 experts (feed-forward networks) for a total of 47 billion parameters. Instruct model fine-tuned by Mistral. #moe

Key Specifications
Cost
$$
Context
32K
Parameters
56B
Released
Dec 09, 2023
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tools Frequency Penalty Temperature Tool Choice Seed Max Tokens Response Format Presence Penalty Top P Stop Min P
Features

This model supports the following features:

Tools Response Format
Performance Summary

Mixtral 8x7B Instruct consistently performs among the fastest models and offers highly competitive pricing across various benchmarks. The model demonstrates strong reliability with a 92% success rate, indicating consistent delivery of usable responses. Analysis of benchmark results reveals a mixed performance profile. The model exhibits notable strengths in Ethics, achieving 99.5% accuracy, and performs reasonably well in General Knowledge with 96.8% accuracy. It also shows a commendable ability to acknowledge uncertainty, scoring 84.0% in the Hallucinations test. A standout performance is observed in Email Classification, where it achieves 82.0% accuracy and ranks among the top 3 in speed. However, significant weaknesses are apparent in other areas. The model scored 0.0% accuracy in Instruction Following, indicating a critical area for improvement. Performance in Reasoning (34.0% accuracy) and Coding (38.0% accuracy) is also relatively low, suggesting challenges with complex problem-solving and programming tasks. While its speed and cost efficiency are excellent, the variability in accuracy across different categories highlights a need for further refinement in specific cognitive domains.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.4
Completion $0.4

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
DeepInfra
DeepInfra | mistralai/mixtral-8x7b-instruct 32K $0.4 / 1M tokens $0.4 / 1M tokens
Together
Together | mistralai/mixtral-8x7b-instruct 32K $0.6 / 1M tokens $0.6 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by mistralai