Google: Gemini 2.5 Flash Preview 04-17

Text input Image input File input Text output Unavailable
Author's Description

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater accuracy and nuanced context handling. Note: This model is available in two variants: thinking and non-thinking. The output pricing varies significantly depending on whether the thinking capability is active. If you select the standard variant (without the ":thinking" suffix), the model will explicitly avoid generating thinking tokens. To utilize the thinking capability and receive thinking tokens, you must choose the ":thinking" variant, which will then incur the higher thinking-output pricing. Additionally, Gemini 2.5 Flash is configurable through the "max tokens for reasoning" parameter, as described in the documentation (https://openrouter.ai/docs/use-cases/reasoning-tokens#max-tokens-for-reasoning).

Key Specifications
Cost
$$$
Context
1M
Released
Apr 17, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tools Structured Outputs Tool Choice Reasoning Include Reasoning Response Format Stop Top P Max Tokens Temperature
Features

This model supports the following features:

Tools Reasoning Response Format Structured Outputs
Performance Summary

Google's Gemini 2.5 Flash Preview 04-17 demonstrates strong performance as a state-of-the-art workhorse model. It performs among the fastest models, typically ranking in the top tier for speed (75th percentile). The model also offers competitive pricing, generally providing cost-effective solutions (67th percentile). A standout feature is its exceptional reliability, achieving a perfect 100% success rate across all benchmarks, indicating consistent and dependable operation. In terms of specific benchmarks, Gemini 2.5 Flash excels in classification tasks, achieving perfect 100% accuracy in Email Classification, making it the most accurate model at its price point and among models of similar speed. It also shows strong performance in General Knowledge (99.3% accuracy) and Ethics (99.0% accuracy), demonstrating its advanced reasoning capabilities. While its Coding performance (83.0% accuracy) is solid, it is not as dominant as its classification or knowledge-based results. The model's configurable "thinking" capability, while impacting pricing, allows for enhanced accuracy and nuanced context handling, positioning it as a versatile tool for advanced reasoning, coding, mathematics, and scientific tasks.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.15
Completion $0.6
Input Cache Read $0.0375
Input Cache Write $0.233

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Google
Google | google/gemini-2.5-flash-preview-04-17 1M $0.15 / 1M tokens $0.6 / 1M tokens
Google AI Studio
Google AI Studio | google/gemini-2.5-flash-preview-04-17 1M $0.15 / 1M tokens $0.6 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by google