Author's Description
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities, including structured outputs and function calling.
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
Google's Gemma 3 4B demonstrates competitive response times, performing among the faster models with a 59th percentile speed ranking. It stands out for its exceptional price competitiveness, consistently offering among the most affordable options, ranking in the 92nd percentile. The model also exhibits strong reliability, with a 91% success rate across benchmarks, indicating consistent operational stability. In terms of performance across categories, Gemma 3 4B shows a mixed profile. Its primary strength lies in Ethics, achieving 96.0% accuracy, and it also performs reasonably well in General Knowledge (69.8%) and Coding (66.0%). However, a significant weakness is its high hallucination rate, with only 2.0% accuracy in the Hallucinations (Baseline) test, suggesting a tendency to generate incorrect information rather than acknowledge uncertainty. Performance in Instruction Following (10.1% accuracy), Email Classification (77.0% accuracy), and Reasoning (36.0% accuracy) is also relatively low. While its Mathematics accuracy is 68.0%, this places it in the 33rd percentile, indicating room for improvement compared to other models. The model's multimodality, supporting vision-language input, and advanced features like structured outputs and function calling, are notable capabilities not directly reflected in these specific benchmarks.
Model Pricing
Current Pricing
Feature | Price (per 1M tokens) |
---|---|
Prompt | $0.04 |
Completion | $0.08 |
Price History
Available Endpoints
Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
---|---|---|---|---|
DeepInfra
|
DeepInfra | google/gemma-3-4b-it | 131K | $0.04 / 1M tokens | $0.08 / 1M tokens |
NextBit
|
NextBit | google/gemma-3-4b-it | 131K | $0.04 / 1M tokens | $0.08 / 1M tokens |
Benchmark Results
Benchmark | Category | Reasoning | Strategy | Free | Executions | Accuracy | Cost | Duration |
---|
Other Models by google
|
Released | Params | Context |
|
Speed | Ability | Cost |
---|---|---|---|---|---|---|---|
Google: Gemini 2.5 Flash Preview 09-2025 | Sep 25, 2025 | — | 1M |
Text input
Image input
File input
Text output
|
★★★★ | ★★★★★ | $$$$ |
Google: Gemini 2.5 Flash Lite Preview 09-2025 | Sep 25, 2025 | — | 1M |
Text input
Image input
File input
Audio input
Text output
|
★★★★★ | ★★★★ | $$$ |
Google: Gemini 2.5 Flash Image Preview (Nano Banana) | Aug 26, 2025 | — | 32K |
Text input
Image input
Text output
Image output
|
— | — | $$$$$ |
Google: Gemini 2.5 Flash Lite | Jul 22, 2025 | — | 1M |
Text input
Image input
File input
Audio input
Text output
|
★★★★★ | ★★★ | $$$ |
Google: Gemini 2.5 Flash Lite Preview 06-17 | Jun 17, 2025 | — | 1M |
Text input
Image input
File input
Audio input
Text output
|
★★★★ | ★★★ | $$$ |
Google: Gemini 2.5 Flash | Jun 17, 2025 | — | 1M |
Text input
Image input
File input
Audio input
Text output
|
★★★★ | ★★★★ | $$$$$ |
Google: Gemini 2.5 Pro | Jun 17, 2025 | — | 1M |
Text input
Image input
File input
Audio input
Text output
|
★ | ★★★★★ | $$$$$ |
Google: Gemini 2.5 Pro Preview 06-05 | Jun 05, 2025 | — | 1M |
Text input
Image input
File input
Audio input
Text output
|
★ | ★★★★★ | $$$$$ |
Google: Gemma 1 2B Unavailable | May 28, 2025 | 2B | 8K |
Text input
Text output
|
— | — | $$ |
Google: Gemma 3n 4B | May 20, 2025 | 4B | 32K |
Text input
Text output
|
★★★ | ★★★ | $ |
Google: Gemini 2.5 Flash Preview 05-20 Unavailable | May 20, 2025 | — | 1M |
Text input
Image input
File input
Text output
|
★★★★ | ★★★★ | $$$ |
Google: Gemini 2.5 Pro Preview 05-06 | May 06, 2025 | — | 1M |
Text input
Image input
File input
Audio input
Text output
|
★ | ★★★★★ | $$$$$ |
Google: Gemini 2.5 Flash Preview 04-17 Unavailable | Apr 17, 2025 | — | 1M |
Text input
Image input
File input
Text output
|
★★★★★ | ★★★★★ | $$$ |
Google: Gemini 2.5 Pro Experimental Unavailable | Mar 25, 2025 | — | 1M |
Text input
Image input
File input
Text output
|
— | — | $ |
Google: Gemma 3 12B | Mar 13, 2025 | 12B | 131K |
Text input
Image input
Text output
|
★★★ | ★★ | $$ |
Google: Gemma 3 27B | Mar 11, 2025 | 27B | 131K |
Text input
Image input
Text output
|
★★★ | ★★★ | $$ |
Google: Gemini 2.0 Flash Lite | Feb 25, 2025 | — | 1M |
Text input
Image input
File input
Audio input
Text output
|
★★★★★ | ★★ | $$ |
Google: Gemini 2.0 Flash | Feb 05, 2025 | — | 1M |
Text input
Image input
File input
Audio input
Text output
|
★★★★★ | ★★★ | $$ |
Google: Gemini 1.5 Flash 8B Unavailable | Oct 02, 2024 | ~500B | 1M |
Text input
Image input
Text output
|
★★★★★ | ★★ | $ |
Google: Gemma 2 27B | Jul 12, 2024 | 27B | 8K |
Text input
Text output
|
★★★★ | ★★ | $$$$ |
Google: Gemma 2 9B | Jun 27, 2024 | 9B | 8K |
Text input
Text output
|
★★★ | ★ | $$ |
Google: Gemini 1.5 Flash Unavailable | May 13, 2024 | ~500B | 1M |
Text input
Image input
Text output
|
★★★★★ | ★★★ | $$ |
Google: Gemini 1.5 Pro Unavailable | Apr 08, 2024 | ~1T | 2M |
Text input
Image input
Text output
|
★★★★ | ★★★ | $$$$ |