Google: Gemma 4 26B A4B

Text input Image input Video input Text output Unavailable
Author's Description

Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind. Despite 25.2B total parameters, only 3.8B activate per token during inference — delivering near-31B quality at a fraction of the compute cost. Supports multimodal input including text, images, and video (up to 60s at 1fps). Features a 256K token context window, native function calling, configurable thinking/reasoning mode, and structured output support. Released under Apache 2.0.

Key Specifications
Cost
$$
Context
262K
Parameters
26B
Released
Apr 03, 2026
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Seed Structured Outputs Response Format Reasoning Temperature Presence Penalty Include Reasoning Tools Frequency Penalty Top P Stop Tool Choice Max Tokens Logit Bias
Features

This model supports the following features:

Structured Outputs Response Format Tools Reasoning
Performance Summary

Google's Gemma 4 26B A4B IT, an instruction-tuned Mixture-of-Experts model, demonstrates a strong overall performance profile. It performs among the fastest models, typically ranking in the top tier for speed (65th percentile), and offers competitive pricing, generally providing cost-effective solutions (76th percentile). A standout feature is its exceptional reliability, achieving a 100% success rate across all benchmarks, indicating minimal technical failures and consistent response delivery. In terms of specific benchmark performance, the model excels in acknowledging uncertainty, achieving 98.0% accuracy in Hallucinations (Baseline) tests, placing it in the 72nd percentile. This suggests a robust ability to identify and decline to answer questions based on fictional information. Its Instruction Following (Baseline) capability is solid, with 67.3% accuracy (70th percentile), indicating proficiency in handling complex, multi-step instructions. While its Email Classification (Baseline) accuracy is 97.0%, its 45th percentile ranking suggests it performs adequately but not exceptionally compared to other models in this specific task. Key strengths include its high reliability, strong hallucination avoidance, and efficient inference due to its MoE architecture. A potential area for improvement, relative to its other strong performances, could be its comparative standing in classification tasks.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.13
Completion $0.4

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Parasail
Parasail | google/gemma-4-26b-a4b-it-20260403 262K $0.13 / 1M tokens $0.4 / 1M tokens
Novita
Novita | google/gemma-4-26b-a4b-it-20260403 262K $0.13 / 1M tokens $0.4 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by google