Google: Gemini 3.1 Flash Lite Preview

File input Audio input Text input Image input Video input Text output
Author's Description

Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across...

Key Specifications
Cost
$$$
Context
1M
Released
Mar 02, 2026
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tools Tool Choice Temperature Include Reasoning Reasoning Max Tokens Seed Structured Outputs Response Format Top P Stop
Features

This model supports the following features:

Structured Outputs Reasoning Response Format Tools
Performance Summary

Google's Gemini 3.1 Flash Lite Preview, created on March 2, 2026, is a high-efficiency model designed for high-volume use cases, building upon Gemini 2.5 Flash Lite with improved overall quality and approaching Gemini 2.5 Flash performance. It consistently ranks among the fastest models, achieving the 90th percentile across 8 benchmarks, and offers competitive pricing, typically providing cost-effective solutions at the 62nd percentile. Demonstrating exceptional reliability, it boasts a 100% success rate across all benchmarks, indicating minimal technical failures. The model exhibits perfect accuracy in critical areas such as Hallucinations (100%), General Knowledge (100%), and Ethics (100%), often being the most accurate at its price point and speed. It also shows strong performance in Mathematics (96% accuracy, 95th percentile) and Email Classification (99% accuracy, 88th percentile). While its Instruction Following (76% accuracy, 83rd percentile) and Reasoning (82% accuracy, 65th percentile) capabilities are solid, its Coding performance (87% accuracy, 56th percentile) is more moderate compared to other benchmarks. Key strengths include its high accuracy in knowledge-based and ethical tasks, coupled with its speed and reliability, making it a robust option for various applications.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.25
Completion $1.5
Input Cache Read $0.025
Input Cache Write $0.0833
Internal Reasoning $1.5

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Google
Google | google/gemini-3.1-flash-lite-preview-20260303 1M $0.25 / 1M tokens $1.5 / 1M tokens
Google AI Studio
Google AI Studio | google/gemini-3.1-flash-lite-preview-20260303 1M $0.25 / 1M tokens $1.5 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by google