Mistral: Mistral Small 3

Text input Text output
Author's Description

Mistral Small 3 is a 24B-parameter language model optimized for low-latency performance across common AI tasks. Released under the Apache 2.0 license, it features both pre-trained and instruction-tuned versions designed...

Key Specifications
Cost
$
Context
32K
Parameters
24B
Released
Jan 30, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Seed Frequency Penalty Top P Logprobs Min P Temperature Stop Presence Penalty Max Tokens Logit Bias Top Logprobs
Performance Summary

Mistral Small 3, a 24B-parameter model, demonstrates strong overall performance, particularly excelling in efficiency. It consistently performs among the fastest models, ranking in the 70th percentile for speed across eight benchmarks, and offers highly competitive pricing, placing in the 89th percentile across seven benchmarks. Notably, the model exhibits exceptional reliability with a 100% success rate across all evaluated benchmarks, indicating minimal technical failures. In terms of specific benchmarks, Mistral Small 3 shows a key strength in Instruction Following, achieving perfect 100% accuracy in one instance, making it the most accurate among models of comparable speed. It also performs well in Hallucinations (96.0% accuracy) and General Knowledge (98.0% accuracy). While its Ethics and Email Classification scores are respectable (98.0% and 97.0% respectively), they fall within the mid-range percentile. A notable area for improvement appears to be Reasoning (62.0% accuracy) and Coding (79.0% accuracy), where it ranks in the lower percentiles. Despite these areas, its overall MMLU accuracy of 81% and competitive performance against larger models like Llama 3.3 70B and Qwen 32B, at three times the speed, underscore its efficiency-focused design.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.05
Completion $0.08

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Kluster
Kluster | mistralai/mistral-small-24b-instruct-2501 32K $0.05 / 1M tokens $0.08 / 1M tokens
DeepInfra
DeepInfra | mistralai/mistral-small-24b-instruct-2501 32K $0.05 / 1M tokens $0.08 / 1M tokens
Enfer
Enfer | mistralai/mistral-small-24b-instruct-2501 28K $0.05 / 1M tokens $0.08 / 1M tokens
NextBit
NextBit | mistralai/mistral-small-24b-instruct-2501 32K $0.05 / 1M tokens $0.08 / 1M tokens
Mistral
Mistral | mistralai/mistral-small-24b-instruct-2501 32K $0.05 / 1M tokens $0.08 / 1M tokens
Ubicloud
Ubicloud | mistralai/mistral-small-24b-instruct-2501 32K $0.05 / 1M tokens $0.08 / 1M tokens
Together
Together | mistralai/mistral-small-24b-instruct-2501 32K $0.05 / 1M tokens $0.08 / 1M tokens
Enfer
Enfer | mistralai/mistral-small-24b-instruct-2501 28K $0.05 / 1M tokens $0.08 / 1M tokens
Chutes
Chutes | mistralai/mistral-small-24b-instruct-2501 32K $0.05 / 1M tokens $0.08 / 1M tokens
Chutes
Chutes | mistralai/mistral-small-24b-instruct-2501 32K $0.05 / 1M tokens $0.08 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by mistralai