Google: Gemini 2.5 Flash

Video input Image input File input Text input Audio input Text output
Author's Description

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater...

Key Specifications
Cost
$$$$
Context
1M
Released
Jun 17, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Response Format Tool Choice Include Reasoning Temperature Tools Max Tokens Reasoning Stop Structured Outputs Seed Top P
Features

This model supports the following features:

Structured Outputs Reasoning Tools Response Format
Performance Summary

Google's Gemini 2.5 Flash demonstrates strong overall performance, particularly excelling in reliability with a perfect 100% success rate across all benchmarks, indicating exceptional stability and consistent response delivery. In terms of speed, it performs among the faster models, typically ranking in the top tier (68th percentile). Its pricing is moderate, positioned in the 37th percentile. The model showcases remarkable accuracy in specific domains, achieving perfect scores in both General Knowledge and Ethics benchmarks, often at competitive price points and speeds. It also performs very well in Instruction Following (82nd percentile accuracy) and Email Classification (99% accuracy, 82nd percentile). While its Hallucinations accuracy is respectable at 96%, its duration for this test is notably high. Performance in Reasoning, Mathematics, and Coding is solid, though not top-tier, with accuracies of 80%, 86%, and 83% respectively, placing it in the 61st, 46th, and 46th percentiles. The built-in "thinking" capabilities and configurable "max tokens for reasoning" parameter likely contribute to its advanced reasoning and nuanced context handling, making it a robust choice for complex tasks.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.3
Completion $2.5
Input Cache Read $0.03
Input Cache Write $0.0833
Internal Reasoning $2.5
Web Search $14000

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Google
Google | google/gemini-2.5-flash 1M $0.3 / 1M tokens $2.5 / 1M tokens
Google AI Studio
Google AI Studio | google/gemini-2.5-flash 1M $0.3 / 1M tokens $2.5 / 1M tokens
Google
Google | google/gemini-2.5-flash 1M $0.3 / 1M tokens $2.5 / 1M tokens
Google
Google | google/gemini-2.5-flash 1M $0.3 / 1M tokens $2.5 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by google