Google: Gemma 3 12B

Text input Image input Text output Free Option
Author's Description

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities, including structured outputs and function calling. Gemma 3 12B is the second largest in the family of Gemma 3 models after [Gemma 3 27B](google/gemma-3-27b-it)

Key Specifications
Cost
$$
Context
131K
Parameters
12B
Released
Mar 13, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Response Format Stop Seed Min P Top P Max Tokens Frequency Penalty Temperature Presence Penalty
Features

This model supports the following features:

Response Format
Performance Summary

Google's Gemma 3 12B model demonstrates strong overall performance, particularly excelling in speed and cost-efficiency. It consistently ranks among the fastest models and offers highly competitive pricing, making it an attractive option for cost-sensitive applications. The model also exhibits exceptional reliability with a 97% success rate, indicating minimal technical failures. In terms of specific capabilities, Gemma 3 12B shows a notable strength in Email Classification, achieving 99.0% accuracy and being the most accurate among models of comparable speed. It also performs well in Coding (85.0% accuracy) and General Knowledge (97.0% accuracy). However, the model struggles significantly with Instruction Following, showing 0.0% accuracy in one benchmark and only 19.0% in another, suggesting a key area for improvement. Its performance in Mathematics (67.0% accuracy) and Reasoning (59.2% accuracy) is moderate, indicating room for enhancement in complex problem-solving. The model's multimodality, supporting vision-language input, and advanced features like structured outputs and function calling, position it as a versatile tool despite its current limitations in instruction adherence.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.04
Completion $0.13

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
DeepInfra
DeepInfra | google/gemma-3-12b-it 131K $0.04 / 1M tokens $0.13 / 1M tokens
Cloudflare
Cloudflare | google/gemma-3-12b-it 80K $0.35 / 1M tokens $0.56 / 1M tokens
Chutes
Chutes | google/gemma-3-12b-it 96K $0.04 / 1M tokens $0.14 / 1M tokens
Novita
Novita | google/gemma-3-12b-it 131K $0.05 / 1M tokens $0.1 / 1M tokens
NCompass
NCompass | google/gemma-3-12b-it 128K $0.04 / 1M tokens $0.13 / 1M tokens
Crusoe
Crusoe | google/gemma-3-12b-it 131K $0.05 / 1M tokens $0.1 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by google