Author's Description
Gemini Flash 1.5 8B is optimized for speed and efficiency, offering enhanced performance in small prompt tasks like chat, transcription, and translation. With reduced latency, it is highly effective for real-time and large-scale operations. This model focuses on cost-effective solutions while maintaining high-quality results. [Click here to learn more about this model](https://developers.googleblog.com/en/gemini-15-flash-8b-is-now-generally-available-for-use/). Usage of Gemini is subject to Google's [Gemini Terms of Use](https://ai.google.dev/terms).
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
Google's Gemini 1.5 Flash 8B demonstrates exceptional performance, consistently ranking among the fastest models and offering highly competitive pricing across various benchmarks. Its reliability is outstanding, achieving a 100% success rate with minimal technical failures. Optimized for speed and efficiency, this model excels in tasks requiring low latency and cost-effectiveness. In terms of specific benchmarks, Gemini 1.5 Flash 8B shows strong performance in Hallucinations (98.0% accuracy), General Knowledge (97.0% accuracy), Ethics (98.0% accuracy), and Email Classification (98.0% accuracy). It is a speed champion in Hallucinations, Mathematics, and Reasoning, often delivering near-perfect accuracy at the highest speeds. Notably, it achieves the best accuracy-to-cost ratio in Mathematics. However, the model exhibits a significant weakness in Instruction Following, with 0.0% accuracy, and shows moderate performance in Mathematics (58.0% accuracy), Reasoning (42.0% accuracy), and Coding (80.0% accuracy) compared to its other strengths. Its core strength lies in its speed, cost-efficiency, and reliability, making it highly suitable for real-time and large-scale operations where these factors are critical.
Model Pricing
Current Pricing
| Feature | Price (per 1M tokens) |
|---|---|
| Prompt | $0.0375 |
| Completion | $0.15 |
| Input Cache Read | $0.01 |
| Input Cache Write | $0.0583 |
Price History
Available Endpoints
| Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
|---|---|---|---|---|
|
Google AI Studio
|
Google AI Studio | google/gemini-flash-1.5-8b | 1M | $0.0375 / 1M tokens | $0.15 / 1M tokens |
Benchmark Results
| Benchmark | Category | Reasoning | Strategy | Free | Executions | Accuracy | Cost | Duration |
|---|
Other Models by google
|
|
Released | Params | Context |
|
Speed | Ability | Cost |
|---|---|---|---|---|---|---|---|
| Google: Gemini 2.5 Flash Image (Nano Banana) | Oct 07, 2025 | — | 32K |
Image input
Text input
Image output
Text output
|
— | — | $$$$ |
| Google: Gemini 2.5 Flash Preview 09-2025 | Sep 25, 2025 | — | 1M |
Image input
Audio input
Video input
Text input
File input
Text output
|
★★★★ | ★★★★★ | $$$$ |
| Google: Gemini 2.5 Flash Lite Preview 09-2025 | Sep 25, 2025 | — | 1M |
Image input
Audio input
Video input
Text input
File input
Text output
|
★★★★★ | ★★★★ | $$$ |
| Google: Gemini 2.5 Flash Image Preview (Nano Banana) | Aug 26, 2025 | — | 32K |
Image input
Text input
Image output
Text output
|
— | — | $$$$ |
| Google: Gemini 2.5 Flash Lite | Jul 22, 2025 | — | 1M |
Image input
Audio input
Video input
Text input
File input
Text output
|
★★★★★ | ★★★ | $$$ |
| Google: Gemini 2.5 Flash Lite Preview 06-17 | Jun 17, 2025 | — | 1M |
Image input
Audio input
Text input
File input
Text output
|
★★★★ | ★★★ | $$$ |
| Google: Gemini 2.5 Flash | Jun 17, 2025 | — | 1M |
Image input
Audio input
Video input
Text input
File input
Text output
|
★★★★ | ★★★★ | $$$$$ |
| Google: Gemini 2.5 Pro | Jun 17, 2025 | — | 1M |
Image input
Audio input
Video input
Text input
File input
Text output
|
★ | ★★★★★ | $$$$$ |
| Google: Gemini 2.5 Pro Preview 06-05 | Jun 05, 2025 | — | 1M |
Image input
Audio input
Text input
File input
Text output
|
★ | ★★★★★ | $$$$$ |
| Google: Gemma 1 2B Unavailable | May 28, 2025 | 2B | 8K |
Text input
Text output
|
— | — | $$ |
| Google: Gemma 3n 4B | May 20, 2025 | 4B | 32K |
Text input
Text output
|
★★★ | ★★★ | $ |
| Google: Gemini 2.5 Flash Preview 05-20 Unavailable | May 20, 2025 | — | 1M |
Image input
Text input
File input
Text output
|
★★★★ | ★★★★ | $$$ |
| Google: Gemini 2.5 Pro Preview 05-06 | May 06, 2025 | — | 1M |
Image input
Audio input
Video input
Text input
File input
Text output
|
★ | ★★★★★ | $$$$$ |
| Google: Gemini 2.5 Flash Preview 04-17 Unavailable | Apr 17, 2025 | — | 1M |
Image input
Text input
File input
Text output
|
★★★★★ | ★★★★★ | $$$ |
| Google: Gemini 2.5 Pro Experimental Unavailable | Mar 25, 2025 | — | 1M |
Image input
Text input
File input
Text output
|
— | — | $ |
| Google: Gemma 3 4B | Mar 13, 2025 | 4B | 131K |
Image input
Text input
Text output
|
★★★ | ★★ | $$ |
| Google: Gemma 3 12B | Mar 13, 2025 | 12B | 131K |
Image input
Text input
Text output
|
★★★ | ★★ | $$ |
| Google: Gemma 3 27B | Mar 11, 2025 | 27B | 131K |
Image input
Text input
Text output
|
★★★ | ★★★ | $$ |
| Google: Gemini 2.0 Flash Lite | Feb 25, 2025 | — | 1M |
Image input
Audio input
Text input
File input
Text output
|
★★★★★ | ★★ | $$ |
| Google: Gemini 2.0 Flash | Feb 05, 2025 | — | 1M |
Image input
Audio input
Text input
File input
Text output
|
★★★★★ | ★★★ | $$ |
| Google: Gemma 2 27B | Jul 12, 2024 | 27B | 8K |
Text input
Text output
|
★★★★ | ★★ | $$$$ |
| Google: Gemma 2 9B | Jun 27, 2024 | 9B | 8K |
Text input
Text output
|
★★★ | ★ | $$ |
| Google: Gemini 1.5 Flash Unavailable | May 13, 2024 | ~500B | 1M |
Image input
Text input
Text output
|
★★★★★ | ★★★ | $$ |
| Google: Gemini 1.5 Pro Unavailable | Apr 08, 2024 | ~1T | 2M |
Image input
Text input
Text output
|
★★★★ | ★★★ | $$$$ |