Author's Description
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance across common benchmarks compared to earlier Flash models. By default, "thinking" (i.e. multi-pass reasoning) is disabled to prioritize speed, but developers can enable it via the [Reasoning API parameter](https://openrouter.ai/docs/use-cases/reasoning-tokens) to selectively trade off cost for intelligence.
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
Gemini 2.5 Flash Lite consistently performs among the fastest models, ranking in the 86th percentile across six benchmarks, and offers competitive pricing, typically providing cost-effective solutions in the 68th percentile. Notably, it demonstrates exceptional reliability with a 100% success rate across all evaluated benchmarks, indicating minimal technical failures. In terms of specific performance, the model achieved perfect accuracy in both Ethics and Email Classification, standing out as the most accurate model at its price point and among models of comparable speed in these categories. It also showed strong performance in General Knowledge with 99% accuracy, again being the most accurate among models of similar speed. However, its performance in Instruction Following (58% accuracy), Coding (79% accuracy), and Reasoning (60% accuracy) was more moderate, suggesting areas where its "lite" nature, particularly with disabled multi-pass reasoning by default, might impact complex task execution. Its strength lies in rapid, accurate responses for well-defined tasks, making it highly suitable for applications prioritizing speed and cost efficiency.
Model Pricing
Current Pricing
Feature | Price (per 1M tokens) |
---|---|
Prompt | $0.1 |
Completion | $0.4 |
Input Cache Read | $0.025 |
Input Cache Write | $0.183 |
Price History
Available Endpoints
Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
---|---|---|---|---|
Google
|
Google | google/gemini-2.5-flash-lite | 1M | $0.1 / 1M tokens | $0.4 / 1M tokens |
Google AI Studio
|
Google AI Studio | google/gemini-2.5-flash-lite | 1M | $0.1 / 1M tokens | $0.4 / 1M tokens |
Benchmark Results
Benchmark | Category | Reasoning | Free | Executions | Accuracy | Cost | Duration |
---|
Other Models by google
|
Released | Params | Context |
|
Speed | Ability | Cost |
---|---|---|---|---|---|---|---|
Google: Gemini 2.5 Flash Lite Preview 06-17 | Jun 17, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★★★★ | ★★★ | $$$ |
Google: Gemini 2.5 Flash | Jun 17, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★★★ | ★★★★★ | $$$$$ |
Google: Gemini 2.5 Pro | Jun 17, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★ | ★★★★★ | $$$$$ |
Google: Gemini 2.5 Pro Preview 06-05 | Jun 05, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★ | ★★★★★ | $$$$$ |
Google: Gemma 1 2B Unavailable | May 28, 2025 | 2B | 8K |
Text input
Text output
|
— | — | $$ |
Google: Gemma 3n 4B | May 20, 2025 | 4B | 32K |
Text input
Text output
|
★★★ | ★★★ | $ |
Google: Gemini 2.5 Flash Preview 05-20 Unavailable | May 20, 2025 | — | 1M |
File input
Text input
Image input
Text output
|
★★★★ | ★★★★★ | $$$ |
Google: Gemini 2.5 Pro Preview 05-06 | May 06, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★ | ★★★★★ | $$$$$ |
Google: Gemini 2.5 Flash Preview 04-17 Unavailable | Apr 17, 2025 | — | 1M |
File input
Text input
Image input
Text output
|
★★★★ | ★★★★★ | $$$ |
Google: Gemini 2.5 Pro Experimental | Mar 25, 2025 | — | 1M |
File input
Text input
Image input
Text output
|
— | — | $ |
Google: Gemma 3 4B | Mar 13, 2025 | 4B | 131K |
Text input
Image input
Text output
|
★★★ | ★★ | $ |
Google: Gemma 3 12B | Mar 13, 2025 | 12B | 131K |
Text input
Image input
Text output
|
★★ | ★★ | $$ |
Google: Gemma 3 27B | Mar 11, 2025 | 27B | 131K |
Text input
Image input
Text output
|
★★ | ★★★ | $$ |
Google: Gemini 2.0 Flash Lite | Feb 25, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★★★★★ | ★★ | $$ |
Google: Gemini 2.0 Flash | Feb 05, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★★★★★ | ★★★ | $$ |
Google: Gemini 1.5 Flash 8B | Oct 02, 2024 | ~500B | 1M |
Text input
Image input
Text output
|
★★★★★ | ★★ | $$ |
Google: Gemma 2 27B | Jul 12, 2024 | 27B | 8K |
Text input
Text output
|
★★★★★ | ★★ | $$$$ |
Google: Gemma 2 9B | Jun 27, 2024 | 9B | 8K |
Text input
Text output
|
★★★★★ | ★ | $$ |
Google: Gemini 1.5 Flash | May 13, 2024 | ~500B | 1M |
Text input
Image input
Text output
|
★★★★★ | ★★★ | $$ |
Google: Gemini 1.5 Pro | Apr 08, 2024 | ~1T | 2M |
Text input
Image input
Text output
|
★★★ | ★★★ | $$$$ |