Google: Gemini 3.5 Flash

Video input Audio input Text input File input Image input Text output
Author's Description

Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level coding and reasoning at Flash-tier cost and speed. It is highly optimized for coding proficiency and parallel agentic execution...

Key Specifications
Cost
$$$$$
Context
1M
Released
May 19, 2026
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Seed Top P Include Reasoning Max Tokens Response Format Tool Choice Stop Tools Temperature Structured Outputs Reasoning
Features

This model supports the following features:

Response Format Tools Structured Outputs Reasoning
Performance Summary

Google's Gemini 3.5 Flash, a high-efficiency multimodal model, demonstrates competitive response times, ranking in the 52nd percentile across seven benchmarks. However, its pricing tends to be at premium levels, placing it in the 6th percentile. A standout feature is its exceptional reliability, achieving a perfect 100% success rate across all benchmarks, indicating minimal technical failures. The model exhibits remarkable accuracy across several critical categories. It achieved perfect 100% accuracy in Hallucinations (Baseline), General Knowledge (Baseline), Reasoning (Baseline), and Ethics (Baseline), often being the most accurate model at its price point and speed for these tasks. Its coding proficiency is strong, with 97.0% accuracy in the Coding (Baseline) benchmark, placing it in the 98th percentile. Mathematics (Baseline) also shows high performance at 95.0% accuracy (85th percentile). While Email Classification (Baseline) is solid at 98.0% accuracy, its percentile ranking (49th) suggests more models perform similarly in this area. Key strengths include its near-perfect accuracy in knowledge, reasoning, and ethical tasks, coupled with high coding proficiency and unparalleled reliability. Its primary area for improvement lies in its premium pricing structure.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $1.5
Completion $9
Input Cache Read $0.15
Input Cache Write $0.0833
Internal Reasoning $9
Web Search $14000

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Google AI Studio
Google AI Studio | google/gemini-3.5-flash-20260519 1M $1.5 / 1M tokens $9 / 1M tokens
Google
Google | google/gemini-3.5-flash-20260519 1M $1.5 / 1M tokens $9 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by google