Author's Description
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance across common benchmarks compared to earlier Flash models. By default, "thinking" (i.e. multi-pass reasoning) is disabled to prioritize speed, but developers can enable it via the [Reasoning API parameter](https://openrouter.ai/docs/use-cases/reasoning-tokens) to selectively trade off cost for intelligence.
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
Google's Gemini 2.5 Flash Lite demonstrates strong performance as a lightweight reasoning model, excelling in speed and reliability. It consistently ranks among the fastest models, achieving the 89th percentile across benchmarks, and offers competitive pricing, typically falling within the 69th percentile. Notably, its reliability is exceptional, with a 100% success rate across all evaluated benchmarks, indicating minimal technical failures. The model exhibits perfect accuracy in Email Classification and Ethics, with both benchmarks also highlighting its efficiency as the most accurate model at its price point and among models of comparable speed. It shows strong general knowledge (99.0% accuracy), performing as the most accurate among models of similar speed. While its Instruction Following (58.0% accuracy) and Reasoning (62.0% accuracy) capabilities are moderate, its Hallucinations score (90.0% accuracy) suggests a good ability to acknowledge uncertainty. Coding (79.0% accuracy) and Mathematics (77.0% accuracy) show room for improvement compared to top-tier models. Its primary strength lies in its combination of high speed, cost-effectiveness, and perfect reliability, making it suitable for latency-sensitive and budget-conscious applications.
Model Pricing
Current Pricing
Feature | Price (per 1M tokens) |
---|---|
Prompt | $0.1 |
Completion | $0.4 |
Input Cache Read | $0.025 |
Input Cache Write | $0.183 |
Price History
Available Endpoints
Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
---|---|---|---|---|
Google
|
Google | google/gemini-2.5-flash-lite | 1M | $0.1 / 1M tokens | $0.4 / 1M tokens |
Google AI Studio
|
Google AI Studio | google/gemini-2.5-flash-lite | 1M | $0.1 / 1M tokens | $0.4 / 1M tokens |
Benchmark Results
Benchmark | Category | Reasoning | Strategy | Free | Executions | Accuracy | Cost | Duration |
---|
Other Models by google
|
Released | Params | Context |
|
Speed | Ability | Cost |
---|---|---|---|---|---|---|---|
Google: Gemini 2.5 Flash Preview 09-2025 | Sep 25, 2025 | — | 1M |
Text input
Image input
File input
Text output
|
★★★★ | ★★★★★ | $$$$ |
Google: Gemini 2.5 Flash Lite Preview 09-2025 | Sep 25, 2025 | — | 1M |
Text input
Image input
File input
Audio input
Text output
|
★★★★★ | ★★★★ | $$$ |
Google: Gemini 2.5 Flash Image Preview (Nano Banana) | Aug 26, 2025 | — | 32K |
Text input
Image input
Text output
Image output
|
— | — | $$$$$ |
Google: Gemini 2.5 Flash Lite Preview 06-17 | Jun 17, 2025 | — | 1M |
Text input
Image input
File input
Audio input
Text output
|
★★★★ | ★★★ | $$$ |
Google: Gemini 2.5 Flash | Jun 17, 2025 | — | 1M |
Text input
Image input
File input
Audio input
Text output
|
★★★★ | ★★★★ | $$$$$ |
Google: Gemini 2.5 Pro | Jun 17, 2025 | — | 1M |
Text input
Image input
File input
Audio input
Text output
|
★ | ★★★★★ | $$$$$ |
Google: Gemini 2.5 Pro Preview 06-05 | Jun 05, 2025 | — | 1M |
Text input
Image input
File input
Audio input
Text output
|
★ | ★★★★★ | $$$$$ |
Google: Gemma 1 2B Unavailable | May 28, 2025 | 2B | 8K |
Text input
Text output
|
— | — | $$ |
Google: Gemma 3n 4B | May 20, 2025 | 4B | 32K |
Text input
Text output
|
★★★ | ★★★ | $ |
Google: Gemini 2.5 Flash Preview 05-20 Unavailable | May 20, 2025 | — | 1M |
Text input
Image input
File input
Text output
|
★★★★ | ★★★★ | $$$ |
Google: Gemini 2.5 Pro Preview 05-06 | May 06, 2025 | — | 1M |
Text input
Image input
File input
Audio input
Text output
|
★ | ★★★★★ | $$$$$ |
Google: Gemini 2.5 Flash Preview 04-17 Unavailable | Apr 17, 2025 | — | 1M |
Text input
Image input
File input
Text output
|
★★★★★ | ★★★★★ | $$$ |
Google: Gemini 2.5 Pro Experimental Unavailable | Mar 25, 2025 | — | 1M |
Text input
Image input
File input
Text output
|
— | — | $ |
Google: Gemma 3 4B | Mar 13, 2025 | 4B | 131K |
Text input
Image input
Text output
|
★★★ | ★★ | $$ |
Google: Gemma 3 12B | Mar 13, 2025 | 12B | 131K |
Text input
Image input
Text output
|
★★★ | ★★ | $$ |
Google: Gemma 3 27B | Mar 11, 2025 | 27B | 131K |
Text input
Image input
Text output
|
★★★ | ★★★ | $$ |
Google: Gemini 2.0 Flash Lite | Feb 25, 2025 | — | 1M |
Text input
Image input
File input
Audio input
Text output
|
★★★★★ | ★★ | $$ |
Google: Gemini 2.0 Flash | Feb 05, 2025 | — | 1M |
Text input
Image input
File input
Audio input
Text output
|
★★★★★ | ★★★ | $$ |
Google: Gemini 1.5 Flash 8B Unavailable | Oct 02, 2024 | ~500B | 1M |
Text input
Image input
Text output
|
★★★★★ | ★★ | $ |
Google: Gemma 2 27B | Jul 12, 2024 | 27B | 8K |
Text input
Text output
|
★★★★ | ★★ | $$$$ |
Google: Gemma 2 9B | Jun 27, 2024 | 9B | 8K |
Text input
Text output
|
★★★ | ★ | $$ |
Google: Gemini 1.5 Flash Unavailable | May 13, 2024 | ~500B | 1M |
Text input
Image input
Text output
|
★★★★★ | ★★★ | $$ |
Google: Gemini 1.5 Pro Unavailable | Apr 08, 2024 | ~1T | 2M |
Text input
Image input
Text output
|
★★★★ | ★★★ | $$$$ |