Google: Gemini 2.5 Flash

File input Text input Image input Audio input Video input Text output
Author's Description

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater...

Key Specifications
Cost
$$$$
Context
1M
Released
Jun 17, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Seed Tools Structured Outputs Top P Response Format Reasoning Temperature Stop Include Reasoning Tool Choice Max Tokens
Features

This model supports the following features:

Structured Outputs Response Format Tools Reasoning
Performance Summary

Google's Gemini 2.5 Flash demonstrates strong overall performance, particularly excelling in reliability with a perfect 100% success rate across all benchmarks, indicating exceptional stability and consistent response delivery. In terms of speed, it performs among the faster models, typically ranking in the top tier (68th percentile). Its pricing is moderate, positioned in the 37th percentile. The model showcases remarkable accuracy in specific domains, achieving perfect scores in both General Knowledge and Ethics benchmarks, often at competitive price points and speeds. It also performs very well in Instruction Following (82nd percentile accuracy) and Email Classification (99% accuracy, 82nd percentile). While its Hallucinations accuracy is respectable at 96%, its duration for this test is notably high. Performance in Reasoning, Mathematics, and Coding is solid, though not top-tier, with accuracies of 80%, 86%, and 83% respectively, placing it in the 61st, 46th, and 46th percentiles. The built-in "thinking" capabilities and configurable "max tokens for reasoning" parameter likely contribute to its advanced reasoning and nuanced context handling, making it a robust choice for complex tasks.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.3
Completion $2.5
Input Cache Read $0.03
Input Cache Write $0.0833
Internal Reasoning $2.5

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Google
Google | google/gemini-2.5-flash 1M $0.3 / 1M tokens $2.5 / 1M tokens
Google AI Studio
Google AI Studio | google/gemini-2.5-flash 1M $0.3 / 1M tokens $2.5 / 1M tokens
Google
Google | google/gemini-2.5-flash 1M $0.3 / 1M tokens $2.5 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by google