Google: Gemini 2.5 Flash Preview 05-20

File input Text input Image input Text output Unavailable
Author's Description

Gemini 2.5 Flash May 20th Checkpoint is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater accuracy and nuanced context handling. Note: This model is available in two variants: thinking and non-thinking. The output pricing varies significantly depending on whether the thinking capability is active. If you select the standard variant (without the ":thinking" suffix), the model will explicitly avoid generating thinking tokens. To utilize the thinking capability and receive thinking tokens, you must choose the ":thinking" variant, which will then incur the higher thinking-output pricing. Additionally, Gemini 2.5 Flash is configurable through the "max tokens for reasoning" parameter, as described in the documentation (https://openrouter.ai/docs/use-cases/reasoning-tokens#max-tokens-for-reasoning).

Key Specifications
Cost
$$$
Context
1M
Released
May 20, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Include Reasoning Stop Top P Tool Choice Temperature Tools Structured Outputs Response Format Reasoning Max Tokens
Features

This model supports the following features:

Tools Reasoning Structured Outputs Response Format
Performance Summary

Google's Gemini 2.5 Flash Preview 05-20 demonstrates strong overall performance, positioning itself as a robust workhorse model. In terms of speed, it consistently performs among the faster models, typically ranking in the top 75th percentile across various benchmarks. Its pricing is competitive, generally offering cost-effective solutions and ranking in the 64th percentile for affordability. A standout feature is its exceptional reliability, achieving a perfect 100th percentile, indicating minimal technical failures and consistent response delivery. Across benchmarks, Gemini 2.5 Flash exhibits particular strengths in Ethics, achieving perfect accuracy and being noted as the most accurate model at its price point and speed. It also excels in Email Classification and General Knowledge, with 99.0% accuracy in both, showcasing its strong classification and broad knowledge capabilities. While its Coding performance is solid at 83.0% accuracy, it's not its absolute strongest area. Reasoning also shows good performance at 82.0% accuracy. The model's built-in "thinking" capabilities, configurable via "max tokens for reasoning," offer enhanced accuracy and nuanced context handling, though this comes with a higher cost for the ":thinking" variant. Overall, Gemini 2.5 Flash is a highly reliable and accurate model, particularly suited for tasks requiring strong ethical reasoning, classification, and general knowledge, with good performance in coding and complex reasoning.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.15
Completion $0.6
Input Cache Read $0.0375
Input Cache Write $0.233

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Google
Google | google/gemini-2.5-flash-preview-05-20 1M $0.15 / 1M tokens $0.6 / 1M tokens
Google AI Studio
Google AI Studio | google/gemini-2.5-flash-preview-05-20 1M $0.15 / 1M tokens $0.6 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Free Executions Accuracy Cost Duration
Other Models by google