Google: Gemma 4 31B

Text input Image input Video input Text output Unavailable
Author's Description

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function calling, and multilingual support across 140+ languages. Strong on coding, reasoning, and document understanding tasks. Apache 2.0 license.

Key Specifications
Cost
$$
Context
262K
Parameters
31B
Released
Apr 02, 2026
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Seed Frequency Penalty Top P Reasoning Temperature Stop Presence Penalty Include Reasoning Max Tokens
Features

This model supports the following features:

Reasoning
Performance Summary

Google's Gemma 4 31B Instruct model demonstrates a strong overall performance profile, particularly excelling in reliability and cost-effectiveness. It consistently provides usable responses, achieving a perfect 100% success rate across all benchmarks, indicating exceptional stability. The model offers competitive pricing, ranking in the 73rd percentile for cost across various tasks. While its speed performance is competitive, ranking in the 41st percentile, it is not among the fastest models available. Key strengths include perfect accuracy in Hallucinations (Baseline) and Ethics (Baseline), where it also stands out as the most accurate model at its price point and speed. It shows impressive accuracy in Instruction Following (93rd percentile), Email Classification (90th percentile), and Mathematics (95th percentile), often being the most accurate among models of comparable speed. The model also performs very well in Coding (89th percentile). Its ability to handle a 256K token context window, support multimodal input, and offer configurable reasoning modes further enhances its versatility. While its General Knowledge and Reasoning scores are solid, they are not as high-ranking as its other top performances.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.14
Completion $0.4

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Novita
Novita | google/gemma-4-31b-it-20260402 262K $0.14 / 1M tokens $0.4 / 1M tokens
Parasail
Parasail | google/gemma-4-31b-it-20260402 262K $0.14 / 1M tokens $0.4 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by google