Google: Gemma 3 12B

Text input Image input Text output Free Option
Author's Description

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

Key Specifications
Cost
$$
Context
131K
Parameters
12B
Released
Mar 13, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Seed Frequency Penalty Top P Min P Response Format Temperature Stop Presence Penalty Max Tokens
Features

This model supports the following features:

Response Format
Performance Summary

Google's Gemma 3 12B, a multimodal AI model supporting vision-language input and text outputs, demonstrates strong overall performance with notable strengths in speed, pricing, and reliability. It consistently ranks among the fastest models, achieving an Infinityth percentile across 8 benchmarks, and offers highly competitive pricing, placing in the 87th percentile across 7 benchmarks. The model exhibits exceptional reliability with a 97% success rate, indicating minimal technical failures. In terms of specific benchmarks, Gemma 3 12B excels in Email Classification, achieving 99.0% accuracy and being the most accurate among models of its speed. It also performs well in General Knowledge (97.0% accuracy) and Ethics (98.0% accuracy), though these fall within the mid-range percentile for accuracy. Its Coding capabilities are solid at 85.0% accuracy. However, the model shows significant weaknesses in Instruction Following, with one benchmark yielding 0.0% accuracy and another at 19.0%. Mathematics and Reasoning also present areas for improvement, with accuracies of 67.0% and 59.2% respectively, placing them in lower percentiles. The model's ability to handle a 128k token context window and support over 140 languages are key features, alongside improved math, reasoning, and chat capabilities, including structured outputs and function calling.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.04
Completion $0.13

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
DeepInfra
DeepInfra | google/gemma-3-12b-it 131K $0.04 / 1M tokens $0.13 / 1M tokens
Cloudflare
Cloudflare | google/gemma-3-12b-it 80K $0.35 / 1M tokens $0.56 / 1M tokens
Chutes
Chutes | google/gemma-3-12b-it 131K $0.04 / 1M tokens $0.13 / 1M tokens
Novita
Novita | google/gemma-3-12b-it 131K $0.04 / 1M tokens $0.13 / 1M tokens
NCompass
NCompass | google/gemma-3-12b-it 128K $0.04 / 1M tokens $0.13 / 1M tokens
Crusoe
Crusoe | google/gemma-3-12b-it 131K $0.08 / 1M tokens $0.3 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by google