Google: Gemini 2.5 Flash

Image input Text input File input Text output
Description

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater accuracy and nuanced context handling. Additionally, Gemini 2.5 Flash is configurable through the "max tokens for reasoning" parameter, as described in the documentation (https://openrouter.ai/docs/use-cases/reasoning-tokens#max-tokens-for-reasoning).

Key Specifications
Context Length

1M

Parameters

Unknown

Created

Jun 17, 2025

Supported Parameters

This model supports the following parameters:

Top P Structured Outputs Temperature Stop Tools Tool Choice Response Format Max Tokens
Features

This model supports the following features:

Structured Outputs Response Format Tools
Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.3
Completion $2.5
Input Cache Read $0.075
Input Cache Write $0.383

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Google
Google | google/gemini-2.5-flash 1M $0.3 / 1M tokens $2.5 / 1M tokens
Google AI Studio
Google AI Studio | google/gemini-2.5-flash 1M $0.3 / 1M tokens $2.5 / 1M tokens
Benchmark Performance Summary
Benchmark Category Reasoning Free Executions Accuracy Cost Duration