xAI: Grok 4.20 Multi-Agent Beta

Image input File input Text input Text output Unavailable
Author's Description

Grok 4.20 Multi-Agent Beta is a variant of xAI’s Grok 4.20 designed for collaborative, agent-based workflows. Multiple agents operate in parallel to conduct deep research, coordinate tool use, and synthesize information across complex tasks. Reasoning effort behavior: - low / medium: 4 agents - high / xhigh: 16 agents

Key Specifications
Cost
$$$$$
Context
2M
Released
Mar 12, 2026
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Logprobs Include Reasoning Response Format Temperature Max Tokens Reasoning Structured Outputs Top Logprobs Top P Seed
Features

This model supports the following features:

Structured Outputs Reasoning Response Format
Performance Summary

xAI's Grok 4.20 Multi-Agent Beta, released on March 12, 2026, is a specialized variant designed for collaborative, multi-agent workflows, leveraging 4 agents for low/medium reasoning and 16 for high/xhigh reasoning. The model demonstrates competitive response times, performing at the 45th percentile across benchmarks. However, it is positioned at a premium pricing level, ranking in the 1st percentile for cost. A standout feature is its exceptional reliability, achieving a 100% success rate across all benchmarks, indicating minimal technical failures. In terms of benchmark performance, Grok 4.20 Multi-Agent Beta excels in critical areas. It achieved perfect accuracy (100.0%) in the Hallucinations (Baseline) test, effectively acknowledging uncertainty and avoiding fabricated responses. This performance also positions it as the most accurate model at its price point and among models of similar speed for this task. Similarly, it achieved perfect accuracy (100.0%) in the Reasoning (Baseline) benchmark, showcasing strong capabilities in complex problem-solving, logic, and pattern recognition, again being the most accurate at its price and speed. While its Email Classification (Baseline) accuracy was a respectable 98.0%, it ranked in the 49th percentile, suggesting room for improvement compared to top performers in this specific category. Overall, its key strengths lie in its multi-agent architecture enabling high accuracy in reasoning and hallucination avoidance, coupled with outstanding reliability, though its premium pricing is a notable consideration.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $2
Completion $6
Input Cache Read $0.2
Web Search $5000

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
xAI
xAI | x-ai/grok-4.20-multi-agent-beta-20260309 2M $2 / 1M tokens $6 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by x-ai