Author's Description
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities, including structured outputs and function calling. Gemma 3 12B is the second largest in the family of Gemma 3 models after [Gemma 3 27B](google/gemma-3-27b-it)
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
Gemma 3 12B, Google's latest multimodal AI model, demonstrates exceptional speed, consistently ranking among the fastest models across various benchmarks. Its pricing is highly competitive, placing it in the 87th percentile for cost-efficiency. Furthermore, the model exhibits outstanding reliability with a 99% success rate, indicating minimal technical failures and consistent response delivery. In terms of performance across categories, Gemma 3 12B shows strong capabilities in Classification, achieving 99% accuracy in Email Classification, notably being the most accurate among models of comparable speed. It also performs well in General Knowledge and Ethics, scoring 97% and 98% accuracy respectively, indicating a broad understanding and adherence to ethical principles. While its Coding performance is solid at 85% accuracy, its Instruction Following capabilities appear to be a notable weakness, with one benchmark showing 0% accuracy and another at 19%. Reasoning performance is moderate at 60.4% accuracy. Overall, Gemma 3 12B stands out for its speed, cost-effectiveness, and reliability, making it a strong contender for tasks requiring efficient and dependable classification and knowledge retrieval, despite some limitations in complex instruction following.
Model Pricing
Current Pricing
Feature | Price (per 1M tokens) |
---|---|
Prompt | $0.05 |
Completion | $0.1 |
Price History
Available Endpoints
Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
---|---|---|---|---|
DeepInfra
|
DeepInfra | google/gemma-3-12b-it | 131K | $0.05 / 1M tokens | $0.1 / 1M tokens |
Cloudflare
|
Cloudflare | google/gemma-3-12b-it | 80K | $0.35 / 1M tokens | $0.56 / 1M tokens |
Chutes
|
Chutes | google/gemma-3-12b-it | 96K | $0.0481 / 1M tokens | $0.193 / 1M tokens |
Benchmark Results
Benchmark | Category | Reasoning | Free | Executions | Accuracy | Cost | Duration |
---|
Other Models by google
|
Released | Params | Context |
|
Speed | Ability | Cost |
---|---|---|---|---|---|---|---|
Google: Gemini 2.5 Flash Lite | Jul 22, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★★★★★ | ★★★ | $$$ |
Google: Gemini 2.5 Flash Lite Preview 06-17 | Jun 17, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★★★★ | ★★★ | $$$ |
Google: Gemini 2.5 Flash | Jun 17, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★★★ | ★★★★★ | $$$$$ |
Google: Gemini 2.5 Pro | Jun 17, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★ | ★★★★★ | $$$$$ |
Google: Gemini 2.5 Pro Preview 06-05 | Jun 05, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★ | ★★★★★ | $$$$$ |
Google: Gemma 1 2B Unavailable | May 28, 2025 | 2B | 8K |
Text input
Text output
|
— | — | $$ |
Google: Gemma 3n 4B | May 20, 2025 | 4B | 32K |
Text input
Text output
|
★★★ | ★★★ | $ |
Google: Gemini 2.5 Flash Preview 05-20 Unavailable | May 20, 2025 | — | 1M |
File input
Text input
Image input
Text output
|
★★★★ | ★★★★★ | $$$ |
Google: Gemini 2.5 Pro Preview 05-06 | May 06, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★ | ★★★★★ | $$$$$ |
Google: Gemini 2.5 Flash Preview 04-17 Unavailable | Apr 17, 2025 | — | 1M |
File input
Text input
Image input
Text output
|
★★★★ | ★★★★★ | $$$ |
Google: Gemini 2.5 Pro Experimental | Mar 25, 2025 | — | 1M |
File input
Text input
Image input
Text output
|
— | — | $ |
Google: Gemma 3 4B | Mar 13, 2025 | 4B | 131K |
Text input
Image input
Text output
|
★★★ | ★★ | $ |
Google: Gemma 3 27B | Mar 11, 2025 | 27B | 131K |
Text input
Image input
Text output
|
★★ | ★★★ | $$ |
Google: Gemini 2.0 Flash Lite | Feb 25, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★★★★★ | ★★ | $$ |
Google: Gemini 2.0 Flash | Feb 05, 2025 | — | 1M |
Audio input
File input
Text input
Image input
Text output
|
★★★★★ | ★★★ | $$ |
Google: Gemini 1.5 Flash 8B | Oct 02, 2024 | ~500B | 1M |
Text input
Image input
Text output
|
★★★★★ | ★★ | $$ |
Google: Gemma 2 27B | Jul 12, 2024 | 27B | 8K |
Text input
Text output
|
★★★★★ | ★★ | $$$$ |
Google: Gemma 2 9B | Jun 27, 2024 | 9B | 8K |
Text input
Text output
|
★★★★★ | ★ | $$ |
Google: Gemini 1.5 Flash | May 13, 2024 | ~500B | 1M |
Text input
Image input
Text output
|
★★★★★ | ★★★ | $$ |
Google: Gemini 1.5 Pro | Apr 08, 2024 | ~1T | 2M |
Text input
Image input
Text output
|
★★★ | ★★★ | $$$$ |