xAI: Grok 3 Beta

Text input Text output
Author's Description

Grok 3 is the latest model from xAI. It's their flagship model that excels at enterprise use cases like data extraction, coding, and text summarization. Possesses deep domain knowledge in finance, healthcare, law, and science. Excels in structured tasks and benchmarks like GPQA, LCB, and MMLU-Pro where it outperforms Grok 3 Mini even on high thinking. Note: That there are two xAI endpoints for this model. By default when using this model we will always route you to the base endpoint. If you want the fast endpoint you can add `provider: { sort: throughput}`, to sort by throughput instead.

Key Specifications
Cost
$$$$$
Context
131K
Parameters
2.7T (Rumoured)
Released
Apr 09, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tool Choice Response Format Seed Top P Temperature Top Logprobs Tools Structured Outputs Logprobs Stop Max Tokens Frequency Penalty Presence Penalty
Features

This model supports the following features:

Tools Response Format Structured Outputs
Performance Summary

Grok 3 Beta, xAI's flagship model, demonstrates competitive response times, performing among the faster models with a 58th percentile speed ranking. However, it is positioned at premium pricing levels, ranking in the 11th percentile for cost. A standout feature is its exceptional reliability, achieving a perfect 100% success rate across all benchmarks, indicating minimal technical failures. The model exhibits perfect accuracy in critical areas such as Hallucinations, General Knowledge, and Ethics, often being the most accurate at its price point and speed. It also performs strongly in Email Classification (99.0% accuracy) and Coding (92.0% accuracy), placing it in the upper percentiles for these categories. While its Instruction Following (64.0% accuracy) and Reasoning (76.0% accuracy) capabilities are solid, they are not as dominant as its knowledge-based and ethical performance. Grok 3 Beta excels in structured tasks and benchmarks like GPQA, LCB, and MMLU-Pro, outperforming Grok 3 Mini even on high-thinking tasks, aligning with its description as a model suited for enterprise use cases like data extraction and text summarization, and possessing deep domain knowledge.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $3
Completion $15
Input Cache Read $0.75

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
xAI
xAI | x-ai/grok-3 131K $3 / 1M tokens $15 / 1M tokens
xAI
xAI | x-ai/grok-3 131K $5 / 1M tokens $25 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by x-ai