Mistral: Mistral Nemo

Text input Text output
Author's Description

A 12B parameter model with a 128k token context length built by Mistral in collaboration with NVIDIA. The model is multilingual, supporting English, French, German, Spanish, Italian, Portuguese, Chinese, Japanese,...

Key Specifications
Cost
$
Context
131K
Parameters
12B (Rumoured)
Released
Jul 18, 2024
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Seed Frequency Penalty Top P Min P Response Format Temperature Stop Presence Penalty Max Tokens
Features

This model supports the following features:

Response Format
Performance Summary

Mistral Nemo, a 12B parameter multilingual model developed by Mistral in collaboration with NVIDIA, demonstrates exceptional speed and competitive pricing. It consistently ranks among the fastest models, achieving an Infinityth percentile across nine benchmarks, and offers highly competitive pricing, placing in the 95th percentile across eight benchmarks. The model also exhibits strong reliability with a 91% success rate across nine benchmarks, indicating consistent and usable responses. In terms of performance across categories, Mistral Nemo achieved perfect accuracy in Ethics, standing out as the most accurate model at its price point and among models of similar speed. It also shows strong performance in Coding (79.0% accuracy) and Email Classification (93.0% accuracy). However, the model struggles significantly with Instruction Following, particularly in one benchmark where it scored 0.0% accuracy, and shows limited proficiency in Mathematics (15.0% accuracy) and Reasoning (28.0% accuracy). Its hallucination rate is moderate at 62.0% accuracy, and general knowledge is fair at 84.2% accuracy, being the most accurate at its price point. The model's multilingual support and function calling capabilities, combined with its Apache 2.0 license, enhance its versatility for various applications.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.02
Completion $0.04

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
DeepInfra
DeepInfra | mistralai/mistral-nemo 131K $0.02 / 1M tokens $0.04 / 1M tokens
Kluster
Kluster | mistralai/mistral-nemo 131K $0.02 / 1M tokens $0.04 / 1M tokens
Enfer
Enfer | mistralai/mistral-nemo 131K $0.02 / 1M tokens $0.04 / 1M tokens
Parasail
Parasail | mistralai/mistral-nemo 131K $0.02 / 1M tokens $0.04 / 1M tokens
NextBit
NextBit | mistralai/mistral-nemo 128K $0.02 / 1M tokens $0.04 / 1M tokens
InferenceNet
InferenceNet | mistralai/mistral-nemo 16K $0.02 / 1M tokens $0.04 / 1M tokens
Nebius
Nebius | mistralai/mistral-nemo 128K $0.02 / 1M tokens $0.04 / 1M tokens
Novita
Novita | mistralai/mistral-nemo 60K $0.02 / 1M tokens $0.04 / 1M tokens
Atoma
Atoma | mistralai/mistral-nemo 128K $0.02 / 1M tokens $0.04 / 1M tokens
InoCloud
InoCloud | mistralai/mistral-nemo 131K $0.02 / 1M tokens $0.04 / 1M tokens
Mistral
Mistral | mistralai/mistral-nemo 131K $0.15 / 1M tokens $0.15 / 1M tokens
Azure
Azure | mistralai/mistral-nemo 128K $0.02 / 1M tokens $0.04 / 1M tokens
Enfer
Enfer | mistralai/mistral-nemo 131K $0.02 / 1M tokens $0.04 / 1M tokens
Nineteen
Nineteen | mistralai/mistral-nemo 32K $0.02 / 1M tokens $0.04 / 1M tokens
Chutes
Chutes | mistralai/mistral-nemo 131K $0.02 / 1M tokens $0.04 / 1M tokens
Chutes
Chutes | mistralai/mistral-nemo 131K $0.02 / 1M tokens $0.04 / 1M tokens
Novita
Novita | mistralai/mistral-nemo 60K $0.04 / 1M tokens $0.17 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by mistralai