Google: Gemini 2.5 Flash

Audio input File input Text input Image input Text output
Author's Description

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater accuracy and nuanced context handling. Additionally, Gemini 2.5 Flash is configurable through the "max tokens for reasoning" parameter, as described in the documentation (https://openrouter.ai/docs/use-cases/reasoning-tokens#max-tokens-for-reasoning).

Key Specifications
Cost
$$$$$
Context
1M
Released
Jun 17, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Include Reasoning Stop Tool Choice Top P Temperature Seed Tools Structured Outputs Response Format Reasoning Max Tokens
Features

This model supports the following features:

Tools Reasoning Structured Outputs Response Format
Performance Summary

Google's Gemini 2.5 Flash demonstrates competitive response times, performing among the faster models with a 52nd percentile speed ranking. Its pricing is moderate, positioned at the 30th percentile, offering a balanced cost-efficiency. A standout feature is its exceptional reliability, achieving a perfect 100th percentile, indicating virtually no technical failures and consistent response delivery. The model excels in several key areas. It achieved perfect accuracy in both Ethics and General Knowledge benchmarks, notably being the most accurate model at its price point and speed for these categories. Its Reasoning capabilities are also highly impressive, with 99.0% accuracy, placing it in the 97th percentile. Instruction Following is another strong suit, with 75.0% accuracy, ranking in the 91st percentile. Email Classification also shows strong performance at 99.0% accuracy. While its Coding performance is solid at 83.0% accuracy, it represents a relative area for potential improvement compared to its near-perfect scores in other domains. Overall, Gemini 2.5 Flash is a robust workhorse model, particularly strong in reasoning, knowledge, and ethical considerations, making it highly suitable for complex analytical and content generation tasks.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.3
Completion $2.5
Input Cache Read $0.075
Input Cache Write $0.383

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Google
Google | google/gemini-2.5-flash 1M $0.3 / 1M tokens $2.5 / 1M tokens
Google AI Studio
Google AI Studio | google/gemini-2.5-flash 1M $0.3 / 1M tokens $2.5 / 1M tokens
Google
Google | google/gemini-2.5-flash 1M $0.3 / 1M tokens $2.5 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Free Executions Accuracy Cost Duration
Other Models by google