Google: Gemini 2.0 Flash Lite

Text input Image input File input Audio input Text output
Author's Description

Gemini 2.0 Flash Lite offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), while maintaining quality on par with larger models like [Gemini Pro 1.5](/google/gemini-pro-1.5), all at extremely economical token prices.

Key Specifications
Cost
$$
Context
1M
Released
Feb 25, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tools Structured Outputs Tool Choice Response Format Stop Seed Top P Max Tokens Temperature
Features

This model supports the following features:

Tools Response Format Structured Outputs
Performance Summary

Google's Gemini 2.0 Flash Lite demonstrates exceptional performance across several key metrics. It consistently ranks among the fastest models available, achieving an Infinityth percentile in speed across all 8 benchmarks, indicating a significantly faster time to first token (TTFT). Furthermore, its pricing is highly competitive, also securing an Infinityth percentile ranking across all benchmarks, making it an extremely economical option. The model exhibits outstanding reliability, with a 100% success rate across all benchmarks, ensuring consistent and usable responses. In terms of specific benchmark performance, Gemini 2.0 Flash Lite shows particular strength in Ethics, achieving perfect 100% accuracy and being the most accurate model at its price point and among models of comparable speed. It also performs very well in General Knowledge (99.0% accuracy), notably being the most accurate among models of its speed, and Email Classification (99.0% accuracy). While its Hallucinations accuracy is 92.0%, it is important to note that this benchmark specifically tests for acknowledging uncertainty, and a perfect score would mean always selecting "I don't know" for fictional concepts. A significant weakness is observed in Instruction Following, where it scored 0.0% accuracy, indicating a need for improvement in complex, multi-layered instruction adherence. Its Mathematics (82.0% accuracy), Reasoning (74.0% accuracy), and Coding (84.0% accuracy) performances are moderate, suggesting areas for further refinement.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.075
Completion $0.3

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Google AI Studio
Google AI Studio | google/gemini-2.0-flash-lite-001 1M $0.075 / 1M tokens $0.3 / 1M tokens
Google
Google | google/gemini-2.0-flash-lite-001 1M $0.075 / 1M tokens $0.3 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by google