xAI: Grok 2 Vision 1212

Text input Image input Text output Unavailable
Author's Description

Grok 2 Vision 1212 advances image-based AI with stronger visual comprehension, refined instruction-following, and multilingual support. From object recognition to style analysis, it empowers developers to build more intuitive, visually aware applications. Its enhanced steerability and reasoning establish a robust foundation for next-generation image solutions. To read more about this model, check out [xAI's announcement](https://x.ai/blog/grok-1212).

Key Specifications
Cost
$$$$$
Context
32K
Released
Dec 14, 2024
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Top Logprobs Logprobs Response Format Stop Seed Top P Max Tokens Frequency Penalty Temperature Presence Penalty
Features

This model supports the following features:

Response Format
Performance Summary

xAI: Grok 2 Vision 1212, created on December 14, 2024, demonstrates strong overall performance, particularly in its operational efficiency. The model performs among the fastest models, ranking in the top tier for speed (72nd percentile across 7 benchmarks). It offers competitive pricing, with moderate costs (20th percentile across 7 benchmarks). Notably, Grok 2 Vision 1212 exhibits exceptional reliability, achieving a 100% success rate across all benchmarks, indicating minimal technical failures. In terms of specific capabilities, the model shows robust performance in several key areas. It achieves high accuracy in General Knowledge (99.0%), Ethics (99.0%), and Email Classification (99.0%), placing it in the 66th, 51st, and 79th percentiles respectively. Its Instruction Following (67.0% accuracy, 75th percentile) and Coding (91.0% accuracy, 75th percentile) capabilities are also strong. While its Hallucinations accuracy is respectable at 96.0%, its Reasoning performance (58.0% accuracy, 42nd percentile) is a relative weakness. The model's enhanced steerability and reasoning, as highlighted in its description, are foundational for next-generation image solutions, and its strong visual comprehension and refined instruction-following are evident in its benchmark results.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $2
Completion $10

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
xAI
xAI | x-ai/grok-2-vision-1212 32K $2 / 1M tokens $10 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by x-ai