Author's Description
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance across common benchmarks compared to earlier Flash models. By default, "thinking" (i.e. multi-pass reasoning) is disabled to prioritize speed, but developers can enable it via the [Reasoning API parameter](https://openrouter.ai/docs/use-cases/reasoning-tokens) to selectively trade off cost for intelligence.
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
Gemini 2.5 Flash Lite Preview 06-17 demonstrates strong overall performance, particularly excelling in speed and reliability. It consistently performs among the fastest models, ranking in the 77th percentile for speed across various benchmarks. The model also offers competitive pricing, placing in the 66th percentile for cost-effectiveness. Notably, its reliability is exceptional, achieving a perfect 100th percentile, indicating a highly stable and dependable service with minimal technical failures. In terms of benchmark performance, Gemini 2.5 Flash Lite shows particular strength in classification and knowledge-based tasks, achieving 99.0% accuracy in Email Classification and 98.5% in General Knowledge. It also performs well in Ethics (99.0% accuracy) and Instruction Following (60.0% accuracy), demonstrating good precision and adherence to complex directives. While its Reasoning accuracy is solid at 68.0%, its performance in Coding (78.5% accuracy) is more moderate compared to other categories. The model's design prioritizes speed, with "thinking" disabled by default, which contributes to its rapid token generation and throughput, making it ideal for latency-sensitive applications. Developers can, however, enable multi-pass reasoning for tasks requiring higher intelligence at a trade-off in cost.
Model Pricing
Current Pricing
Feature | Price (per 1M tokens) |
---|---|
Prompt | $0.1 |
Completion | $0.4 |
Input Cache Read | $0.025 |
Input Cache Write | $0.183 |
Price History
Available Endpoints
Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
---|---|---|---|---|
Google
|
Google | google/gemini-2.5-flash-lite-preview-06-17 | 1M | $0.1 / 1M tokens | $0.4 / 1M tokens |
Google AI Studio
|
Google AI Studio | google/gemini-2.5-flash-lite-preview-06-17 | 1M | $0.1 / 1M tokens | $0.4 / 1M tokens |
Benchmark Results
Benchmark | Category | Reasoning | Free | Executions | Accuracy | Cost | Duration |
---|
Other Models by google
|
Released | Params | Context |
|
Speed | Ability | Cost |
---|---|---|---|---|---|---|---|
Google: Gemini 2.5 Flash Lite | Jul 22, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★★★★★ | ★★★ | $$$ |
Google: Gemini 2.5 Flash | Jun 17, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★★★ | ★★★★★ | $$$$$ |
Google: Gemini 2.5 Pro | Jun 17, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★ | ★★★★★ | $$$$$ |
Google: Gemini 2.5 Pro Preview 06-05 | Jun 05, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★ | ★★★★★ | $$$$$ |
Google: Gemma 1 2B Unavailable | May 28, 2025 | 2B | 8K |
Text input
Text output
|
— | — | $$ |
Google: Gemma 3n 4B | May 20, 2025 | 4B | 32K |
Text input
Text output
|
★★★ | ★★★ | $ |
Google: Gemini 2.5 Flash Preview 05-20 Unavailable | May 20, 2025 | — | 1M |
File input
Text input
Image input
Text output
|
★★★★ | ★★★★★ | $$$ |
Google: Gemini 2.5 Pro Preview 05-06 | May 06, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★ | ★★★★★ | $$$$$ |
Google: Gemini 2.5 Flash Preview 04-17 Unavailable | Apr 17, 2025 | — | 1M |
File input
Text input
Image input
Text output
|
★★★★ | ★★★★★ | $$$ |
Google: Gemini 2.5 Pro Experimental | Mar 25, 2025 | — | 1M |
File input
Text input
Image input
Text output
|
— | — | $ |
Google: Gemma 3 4B | Mar 13, 2025 | 4B | 131K |
Text input
Image input
Text output
|
★★★ | ★★ | $ |
Google: Gemma 3 12B | Mar 13, 2025 | 12B | 131K |
Text input
Image input
Text output
|
★★ | ★★ | $$ |
Google: Gemma 3 27B | Mar 11, 2025 | 27B | 131K |
Text input
Image input
Text output
|
★★ | ★★★ | $$ |
Google: Gemini 2.0 Flash Lite | Feb 25, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★★★★★ | ★★ | $$ |
Google: Gemini 2.0 Flash | Feb 05, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★★★★★ | ★★★ | $$ |
Google: Gemini 1.5 Flash 8B | Oct 02, 2024 | ~500B | 1M |
Text input
Image input
Text output
|
★★★★★ | ★★ | $$ |
Google: Gemma 2 27B | Jul 12, 2024 | 27B | 8K |
Text input
Text output
|
★★★★★ | ★★ | $$$$ |
Google: Gemma 2 9B | Jun 27, 2024 | 9B | 8K |
Text input
Text output
|
★★★★★ | ★ | $$ |
Google: Gemini 1.5 Flash | May 13, 2024 | ~500B | 1M |
Text input
Image input
Text output
|
★★★★★ | ★★★ | $$ |
Google: Gemini 1.5 Pro | Apr 08, 2024 | ~1T | 2M |
Text input
Image input
Text output
|
★★★ | ★★★ | $$$$ |