Google: Gemini 2.0 Flash

Text input Image input File input Audio input Text output
Author's Description

Gemini Flash 2.0 offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), while maintaining quality on par with larger models like [Gemini Pro 1.5](/google/gemini-pro-1.5). It introduces notable enhancements in multimodal understanding, coding capabilities, complex instruction following, and function calling. These advancements come together to deliver more seamless and robust agentic experiences.

Key Specifications
Cost
$$
Context
1M
Released
Feb 05, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tools Structured Outputs Tool Choice Response Format Stop Seed Top P Max Tokens Temperature
Features

This model supports the following features:

Tools Response Format Structured Outputs
Performance Summary

Google's Gemini 2.0 Flash model demonstrates exceptional performance across several key metrics. It consistently ranks among the fastest models, achieving the 88th percentile across 8 benchmarks, and offers competitive pricing, typically providing cost-effective solutions at the 78th percentile. Notably, its reliability is outstanding, with a 100% success rate across all benchmarks, indicating minimal technical failures. In terms of specific benchmark results, Gemini 2.0 Flash exhibits perfect accuracy in General Knowledge and Ethics, often being the most accurate model at its price point and speed. It also performs strongly in Mathematics (88.0% accuracy) and Reasoning (78.0% accuracy). While its Hallucinations benchmark shows 90.0% accuracy, suggesting some instances of not acknowledging uncertainty, its Email Classification is robust at 98.0%. The model's Coding capabilities are solid at 81.0% accuracy. However, a notable weakness is observed in Instruction Following, where it achieves 38.4% accuracy, indicating room for improvement in handling complex, multi-layered instructions. Overall, Gemini 2.0 Flash excels in speed, cost-efficiency, and reliability, making it a strong contender for applications requiring rapid, dependable responses, particularly in knowledge-based and ethical reasoning tasks.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.1
Completion $0.4
Input Cache Read $0.025
Input Cache Write $0.183

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Google AI Studio
Google AI Studio | google/gemini-2.0-flash-001 1M $0.1 / 1M tokens $0.4 / 1M tokens
Google
Google | google/gemini-2.0-flash-001 1M $0.15 / 1M tokens $0.6 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by google