Google: Gemma 3 4B

Text input Image input Text output Free Option
Author's Description

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

Key Specifications
Cost
$$
Context
131K
Parameters
4B
Released
Mar 13, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Seed Frequency Penalty Top P Min P Response Format Temperature Stop Presence Penalty Max Tokens
Features

This model supports the following features:

Response Format
Performance Summary

Google's Gemma 3 4B demonstrates strong performance in terms of operational efficiency. It performs among the fastest models, ranking in the 61st percentile for speed across various benchmarks. Furthermore, it consistently offers highly competitive pricing, placing in the 91st percentile. The model also exhibits strong reliability, with a 91% success rate, indicating consistent and usable responses. In terms of capabilities, Gemma 3 4B shows notable strengths in cost-efficiency across most benchmarks, particularly in General Knowledge, Ethics, and Email Classification. Its accuracy in Ethics is commendable at 96.0%. However, the model struggles significantly with Hallucinations, achieving only 2.0% accuracy, suggesting a tendency to generate information rather than acknowledge uncertainty. Similarly, its performance in Instruction Following (10.1% accuracy) and Reasoning (36.0% accuracy) indicates areas for improvement in complex task execution and logical deduction. While its General Knowledge (69.8%), Mathematics (68.0%), and Coding (66.0%) scores are moderate, they are generally below the top tiers. The model's multimodality and support for 140+ languages are key features, though not directly assessed in these benchmarks.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.04
Completion $0.08

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
DeepInfra
DeepInfra | google/gemma-3-4b-it 131K $0.04 / 1M tokens $0.08 / 1M tokens
NextBit
NextBit | google/gemma-3-4b-it 131K $0.04 / 1M tokens $0.08 / 1M tokens
Chutes
Chutes | google/gemma-3-4b-it 96K $0.04 / 1M tokens $0.08 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by google