Author's Description
Gemini Flash 1.5 8B is optimized for speed and efficiency, offering enhanced performance in small prompt tasks like chat, transcription, and translation. With reduced latency, it is highly effective for real-time and large-scale operations. This model focuses on cost-effective solutions while maintaining high-quality results. [Click here to learn more about this model](https://developers.googleblog.com/en/gemini-15-flash-8b-is-now-generally-available-for-use/). Usage of Gemini is subject to Google's [Gemini Terms of Use](https://ai.google.dev/terms).
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
Google's Gemini 1.5 Flash 8B consistently ranks among the fastest models available, demonstrating exceptional speed across various benchmarks. It also offers highly competitive pricing, making it a cost-effective solution for a wide range of applications. The model exhibits outstanding reliability with a 100% success rate across all evaluated benchmarks, indicating minimal technical failures and consistent response delivery. In terms of specific performance, Gemini 1.5 Flash 8B shows strong capabilities in Email Classification (98% accuracy) and Ethics (98% accuracy), performing well within the top percentiles for these categories. Its General Knowledge performance is also commendable at 97% accuracy, notably being the most accurate model at its price point and ranking among the top three in speed for this category. While its Coding (Baseline) accuracy is moderate at 80%, its Reasoning (Baseline) accuracy is lower at 45%, suggesting an area for potential improvement in complex logical problem-solving. A significant weakness is observed in Instruction Following, where it achieved 0% accuracy, indicating a limitation in handling multi-layered or highly complex instructions. Despite this, its overall speed and cost-efficiency, coupled with high reliability, position it as a strong contender for real-time and large-scale operations, particularly for tasks like chat, transcription, and translation.
Model Pricing
Current Pricing
Feature | Price (per 1M tokens) |
---|---|
Prompt | $0.0375 |
Completion | $0.15 |
Input Cache Read | $0.01 |
Input Cache Write | $0.0583 |
Price History
Available Endpoints
Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
---|---|---|---|---|
Google AI Studio
|
Google AI Studio | google/gemini-flash-1.5-8b | 1M | $0.0375 / 1M tokens | $0.15 / 1M tokens |
Benchmark Results
Benchmark | Category | Reasoning | Free | Executions | Accuracy | Cost | Duration |
---|
Other Models by google
|
Released | Params | Context |
|
Speed | Ability | Cost |
---|---|---|---|---|---|---|---|
Google: Gemini 2.5 Flash Lite | Jul 22, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★★★★★ | ★★★ | $$$ |
Google: Gemini 2.5 Flash Lite Preview 06-17 | Jun 17, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★★★★ | ★★★ | $$$ |
Google: Gemini 2.5 Flash | Jun 17, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★★★ | ★★★★★ | $$$$$ |
Google: Gemini 2.5 Pro | Jun 17, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★ | ★★★★★ | $$$$$ |
Google: Gemini 2.5 Pro Preview 06-05 | Jun 05, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★ | ★★★★★ | $$$$$ |
Google: Gemma 1 2B Unavailable | May 28, 2025 | 2B | 8K |
Text input
Text output
|
— | — | $$ |
Google: Gemma 3n 4B | May 20, 2025 | 4B | 32K |
Text input
Text output
|
★★★ | ★★★ | $ |
Google: Gemini 2.5 Flash Preview 05-20 Unavailable | May 20, 2025 | — | 1M |
File input
Text input
Image input
Text output
|
★★★★ | ★★★★★ | $$$ |
Google: Gemini 2.5 Pro Preview 05-06 | May 06, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★ | ★★★★★ | $$$$$ |
Google: Gemini 2.5 Flash Preview 04-17 Unavailable | Apr 17, 2025 | — | 1M |
File input
Text input
Image input
Text output
|
★★★★ | ★★★★★ | $$$ |
Google: Gemini 2.5 Pro Experimental | Mar 25, 2025 | — | 1M |
File input
Text input
Image input
Text output
|
— | — | $ |
Google: Gemma 3 4B | Mar 13, 2025 | 4B | 131K |
Text input
Image input
Text output
|
★★★ | ★★ | $ |
Google: Gemma 3 12B | Mar 13, 2025 | 12B | 131K |
Text input
Image input
Text output
|
★★ | ★★ | $$ |
Google: Gemma 3 27B | Mar 11, 2025 | 27B | 131K |
Text input
Image input
Text output
|
★★ | ★★★ | $$ |
Google: Gemini 2.0 Flash Lite | Feb 25, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★★★★★ | ★★ | $$ |
Google: Gemini 2.0 Flash | Feb 05, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★★★★★ | ★★★ | $$ |
Google: Gemma 2 27B | Jul 12, 2024 | 27B | 8K |
Text input
Text output
|
★★★★★ | ★★ | $$$$ |
Google: Gemma 2 9B | Jun 27, 2024 | 9B | 8K |
Text input
Text output
|
★★★★★ | ★ | $$ |
Google: Gemini 1.5 Flash | May 13, 2024 | ~500B | 1M |
Text input
Image input
Text output
|
★★★★★ | ★★★ | $$ |
Google: Gemini 1.5 Pro | Apr 08, 2024 | ~1T | 2M |
Text input
Image input
Text output
|
★★★ | ★★★ | $$$$ |