xAI: Grok 3 Beta

Text input Text output
Author's Description

Grok 3 is the latest model from xAI. It's their flagship model that excels at enterprise use cases like data extraction, coding, and text summarization. Possesses deep domain knowledge in finance, healthcare, law, and science. Excels in structured tasks and benchmarks like GPQA, LCB, and MMLU-Pro where it outperforms Grok 3 Mini even on high thinking. Note: That there are two xAI endpoints for this model. By default when using this model we will always route you to the base endpoint. If you want the fast endpoint you can add `provider: { sort: throughput}`, to sort by throughput instead.

Key Specifications
Cost
$$$$$
Context
131K
Parameters
2.7T (Rumoured)
Released
Apr 09, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Stop Presence Penalty Temperature Seed Structured Outputs Response Format Frequency Penalty Max Tokens Tool Choice Top P Tools Logprobs Top Logprobs
Features

This model supports the following features:

Tools Structured Outputs Response Format
Performance Summary

Grok 3 Beta, xAI's flagship model, demonstrates strong overall performance, particularly excelling in enterprise use cases. It performs among the fastest models, typically ranking in the top tier for speed (66th percentile). While positioned at premium pricing levels (12th percentile), its exceptional reliability is a standout feature, consistently providing usable responses with minimal technical failures (100th percentile). The model shows impressive accuracy across various benchmarks. It achieves high scores in Coding (92.0%), Instruction Following (64.0%), Email Classification (99.0%), and Reasoning (74.0%). Notably, Grok 3 Beta achieved perfect accuracy in both Ethics (100.0%) and General Knowledge (100.0%), often being the most accurate model at its price point and speed for these categories. Its deep domain knowledge in finance, healthcare, law, and science, combined with its strong performance in structured tasks like GPQA, LCB, and MMLU-Pro, underscores its capability for complex applications. The primary weakness is its premium pricing, which may be a consideration for cost-sensitive deployments.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $5
Completion $25
Input Cache Read $1.25

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
xAI
xAI | x-ai/grok-3 131K $3 / 1M tokens $15 / 1M tokens
xAI
xAI | x-ai/grok-3 131K $5 / 1M tokens $25 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Free Executions Accuracy Cost Duration
Other Models by x-ai