xAI: Grok 4.20 Beta

Image input Text input Text output Unavailable
Author's Description

Grok 4.20 Beta is xAI's newest flagship model with industry-leading speed and agentic tool calling capabilities. It combines the lowest hallucination rate on the market with strict prompt adherance, delivering consistently precise and truthful responses. Reasoning can be enabled/disabled using the `reasoning` `enabled` parameter in the API. [Learn more in our docs](https://openrouter.ai/docs/use-cases/reasoning-tokens#controlling-reasoning-tokens)

Key Specifications
Cost
$$$$$
Context
2M
Released
Mar 12, 2026
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Logprobs Tool Choice Include Reasoning Tools Response Format Temperature Max Tokens Reasoning Structured Outputs Top Logprobs Top P Seed
Features

This model supports the following features:

Structured Outputs Tools Reasoning Response Format
Performance Summary

Grok 4.20 Beta, xAI's latest flagship model, demonstrates strong performance across several key metrics. It consistently ranks among the fastest models, placing in the 92nd percentile for speed across eight benchmarks. Its pricing is moderate, positioned in the 22nd percentile, offering a balanced cost-to-performance ratio. The model exhibits exceptional reliability with a 99% success rate, indicating minimal technical failures and consistent response delivery. In terms of benchmark performance, Grok 4.20 Beta excels in Ethics, achieving perfect 100% accuracy, making it the most accurate model at its price point and among models of similar speed. It also shows impressive accuracy in General Knowledge (99%) and Mathematics (94%), often being the most accurate among models of comparable speed. Its Coding capabilities are strong at 92% accuracy. However, its performance in Hallucinations (90% accuracy, 38th percentile) and Instruction Following (57% accuracy, 53rd percentile) suggests areas for improvement, particularly given the description of "lowest hallucination rate on the market" and "strict prompt adherence." While its Reasoning accuracy is moderate at 74%, the model's agentic tool calling capabilities, as highlighted in its description, are not directly reflected in these benchmarks.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $2
Completion $6
Input Cache Read $0.2
Web Search $5000

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
xAI
xAI | x-ai/grok-4.20-beta-20260309 2M $2 / 1M tokens $6 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by x-ai