Google: Gemini 2.5 Flash

Text input Image input File input Audio input Text output
Author's Description

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater accuracy and nuanced context handling. Additionally, Gemini 2.5 Flash is configurable through the "max tokens for reasoning" parameter, as described in the documentation (https://openrouter.ai/docs/use-cases/reasoning-tokens#max-tokens-for-reasoning).

Key Specifications
Cost
$$$$$
Context
1M
Released
Jun 17, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tools Structured Outputs Tool Choice Reasoning Include Reasoning Response Format Stop Seed Top P Max Tokens Temperature
Features

This model supports the following features:

Tools Reasoning Response Format Structured Outputs
Performance Summary

Google's Gemini 2.5 Flash, a state-of-the-art workhorse model, demonstrates strong overall performance, particularly excelling in reliability and specific accuracy benchmarks. It performs among the fastest models, ranking in the top tier for speed (66th percentile), and offers moderate pricing (34th percentile). Notably, Gemini 2.5 Flash exhibits exceptional reliability with a 100% success rate across all benchmarks, indicating consistent and usable responses. The model shows remarkable accuracy in General Knowledge and Ethics, achieving perfect scores in both categories, and is highlighted as the most accurate at its price point and speed for these tasks. Its Instruction Following capabilities are also very strong at 75.0% accuracy (87th percentile). While its Hallucinations accuracy is 96.0%, suggesting a good ability to acknowledge uncertainty, its duration for this test is notably high. Performance in Reasoning (80.0% accuracy) and Email Classification (99.0% accuracy) is also robust. Mathematics and Coding benchmarks show solid, though not top-tier, performance at 86.0% and 83.0% accuracy respectively. The configurable "max tokens for reasoning" parameter further enhances its utility for complex tasks.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.3
Completion $2.5
Input Cache Read $0.075
Input Cache Write $0.383

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Google
Google | google/gemini-2.5-flash 1M $0.3 / 1M tokens $2.5 / 1M tokens
Google AI Studio
Google AI Studio | google/gemini-2.5-flash 1M $0.3 / 1M tokens $2.5 / 1M tokens
Google
Google | google/gemini-2.5-flash 1M $0.3 / 1M tokens $2.5 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by google