Google: Gemini 2.0 Flash Lite

File input Text input Image input Audio input Video input Text output
Author's Description

Gemini 2.0 Flash Lite offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), while maintaining quality on par with larger models like [Gemini Pro 1.5](/google/gemini-pro-1.5),...

Key Specifications
Cost
$$
Context
1M
Released
Feb 25, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Seed Tools Structured Outputs Top P Response Format Temperature Stop Tool Choice Max Tokens
Features

This model supports the following features:

Structured Outputs Response Format Tools
Performance Summary

Google's Gemini 2.0 Flash Lite, created on February 25, 2025, is designed for speed and cost-efficiency, offering a significantly faster time to first token (TTFT) than its predecessor, Gemini Flash 1.5, while aiming for quality comparable to Gemini Pro 1.5. The model consistently ranks among the fastest, achieving an Infinityth percentile across 8 benchmarks, and offers highly competitive pricing, also at an Infinityth percentile. Its reliability is exceptional, demonstrating a 100% success rate across all 8 benchmarks. In terms of performance, Gemini 2.0 Flash Lite excels in Ethics, achieving perfect accuracy and being the most accurate model at its price point and among models of similar speed. It also shows strong performance in Email Classification (99.0% accuracy, 92nd percentile) and General Knowledge (99.0% accuracy, 66th percentile), where it is noted as the most accurate among models of comparable speed. While its Hallucinations accuracy is 92.0% (46th percentile), indicating a reasonable ability to acknowledge uncertainty, a notable weakness is its 0.0% accuracy in Instruction Following, suggesting significant limitations in handling complex, multi-layered instructions. Performance in Mathematics (82.0% accuracy, 38th percentile), Reasoning (74.0% accuracy, 53rd percentile), and Coding (84.0% accuracy, 51st percentile) is moderate.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.075
Completion $0.3

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Google AI Studio
Google AI Studio | google/gemini-2.0-flash-lite-001 1M $0.075 / 1M tokens $0.3 / 1M tokens
Google
Google | google/gemini-2.0-flash-lite-001 1M $0.075 / 1M tokens $0.3 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by google