xAI: Grok Vision Beta

Text input Image input Text output
Author's Description

Grok Vision Beta is xAI's experimental language model with vision capability.

Key Specifications
Cost
$$$$$
Context
8K
Released
Nov 18, 2024
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Stop Presence Penalty Top P Temperature Seed Response Format Frequency Penalty Logprobs Max Tokens Top Logprobs
Features

This model supports the following features:

Response Format
Performance Summary

Grok Vision Beta, xAI's experimental language model with vision capabilities, demonstrates exceptional speed, consistently ranking among the fastest models available. While its pricing tends to be at premium levels, it offers outstanding reliability, with minimal technical failures. Performance across benchmarks reveals a mixed but generally strong profile. Grok Vision Beta achieved perfect accuracy in both Email Classification and Ethics, notably being the most accurate model at its price point and speed in these categories. It also performed very well in General Knowledge, scoring 99.5% accuracy, placing it in the 79th percentile. In Coding (Baseline), it achieved a respectable 87.0% accuracy. However, a significant weakness is evident in Instruction Following, where it scored 0.0% accuracy, indicating a critical area for improvement. Its Reasoning capabilities are moderate, with 56.0% accuracy. Overall, Grok Vision Beta excels in classification, ethical reasoning, and general knowledge, while its speed and reliability are significant strengths. The primary area for development is its instruction following ability.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $5
Completion $15

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
xAI
xAI | x-ai/grok-vision-beta 8K $5 / 1M tokens $15 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Free Executions Accuracy Cost Duration
Other Models by x-ai