Google: Gemini 3.5 Flash

Video input Image input File input Text input Audio input Text output
Author's Description

Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level coding and reasoning at Flash-tier cost and speed. It is highly optimized for coding proficiency and parallel agentic execution...

Key Specifications
Cost
$$$$$
Context
1M
Released
May 19, 2026
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Response Format Tool Choice Include Reasoning Temperature Tools Max Tokens Reasoning Stop Structured Outputs Seed Top P
Features

This model supports the following features:

Structured Outputs Reasoning Tools Response Format
Performance Summary

Google's Gemini 3.5 Flash, a high-efficiency multimodal model, demonstrates competitive response times, ranking in the 52nd percentile across seven benchmarks. However, its pricing tends to be at premium levels, placing it in the 6th percentile. A standout feature is its exceptional reliability, achieving a perfect 100% success rate across all benchmarks, indicating minimal technical failures. The model exhibits remarkable accuracy across several critical categories. It achieved perfect 100% accuracy in Hallucinations (Baseline), General Knowledge (Baseline), Reasoning (Baseline), and Ethics (Baseline), often being the most accurate model at its price point and speed for these tasks. Its coding proficiency is strong, with 97.0% accuracy in the Coding (Baseline) benchmark, placing it in the 98th percentile. Mathematics (Baseline) also shows high performance at 95.0% accuracy (85th percentile). While Email Classification (Baseline) is solid at 98.0% accuracy, its percentile ranking (49th) suggests more models perform similarly in this area. Key strengths include its near-perfect accuracy in knowledge, reasoning, and ethical tasks, coupled with high coding proficiency and unparalleled reliability. Its primary area for improvement lies in its premium pricing structure.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $1.5
Completion $9
Input Cache Read $0.15
Input Cache Write $0.0833
Internal Reasoning $9
Web Search $14000

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Google AI Studio
Google AI Studio | google/gemini-3.5-flash-20260519 1M $1.5 / 1M tokens $9 / 1M tokens
Google
Google | google/gemini-3.5-flash-20260519 1M $1.5 / 1M tokens $9 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by google