Mistral: Voxtral Small 24B 2507

Text input Audio input Text output
Author's Description

Voxtral Small is an enhancement of Mistral Small 3, incorporating state-of-the-art audio input capabilities while retaining best-in-class text performance. It excels at speech transcription, translation and audio understanding. Input audio...

Key Specifications
Cost
$$
Context
32K
Parameters
24B
Released
Oct 30, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Seed Tools Frequency Penalty Structured Outputs Top P Response Format Temperature Stop Presence Penalty Tool Choice Max Tokens
Features

This model supports the following features:

Structured Outputs Response Format Tools
Performance Summary

Mistral: Voxtral Small 24B 2507, an enhancement of Mistral Small 3 with advanced audio capabilities, demonstrates strong overall performance. The model consistently ranks among the fastest models, achieving the 89th percentile across seven benchmarks, and offers highly competitive pricing, placing in the 82nd percentile. Notably, it exhibits exceptional reliability with a perfect 100% success rate across all benchmarks, indicating minimal technical failures. In terms of specific capabilities, Voxtral Small excels in General Knowledge (98.5% accuracy) and demonstrates strong performance in Ethics (98.0%) and Email Classification (97.0%). Its unique audio input capabilities for transcription, translation, and understanding are a key differentiator. However, the model shows a notable weakness in handling hallucinations, with only 72.0% accuracy in identifying fictional concepts, suggesting an area for improvement in acknowledging uncertainty. Instruction Following (51.0% accuracy) and Reasoning (64.0% accuracy) also present opportunities for further development. Despite these areas, its speed, cost-effectiveness, and robust reliability make it a compelling option, particularly for applications requiring audio processing.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.1
Completion $0.3
Input Cache Read $0.01

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Mistral
Mistral | mistralai/voxtral-small-24b-2507 32K $0.1 / 1M tokens $0.3 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by mistralai