IBM: Granite 4.1 8B

Text input Text output
Author's Description

Granite 4.1 8B is a dense, decoder-only 8-billion-parameter language model from IBM, part of the Granite 4.1 family. It supports a 131K-token context window and is designed for enterprise tasks...

Key Specifications
Cost
$$
Context
131K
Parameters
8B
Released
Apr 30, 2026
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tool Choice Tools Response Format Temperature Max Tokens Structured Outputs Presence Penalty Stop Top P Frequency Penalty Seed
Features

This model supports the following features:

Structured Outputs Tools Response Format
Performance Summary

IBM Granite 4.1 8B demonstrates competitive response times, ranking in the 55th percentile across benchmarks, and consistently offers highly competitive pricing, placing in the 88th percentile. The model exhibits exceptional reliability with a 98% success rate, indicating minimal technical failures. In terms of performance across categories, Granite 4.1 8B shows strong capabilities in Hallucinations (98.0% accuracy), Ethics (99.0% accuracy), and Email Classification (95.0% accuracy). Notably, it achieved a top 3 ranking in speed for Email Classification and was the most accurate among models with comparable speed. However, the model struggles significantly with Instruction Following (16.0% accuracy) and Reasoning (46.0% accuracy), indicating these areas as key weaknesses. Its performance in General Knowledge (93.0%), Mathematics (71.0%), and Coding (78.0%) is moderate, falling within the lower to mid-range percentiles for accuracy. Overall, Granite 4.1 8B is a cost-effective and reliable model, particularly strong in tasks requiring ethical judgment and avoiding factual errors, but requires further development in complex instruction following and abstract reasoning.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.05
Completion $0.1
Input Cache Read $0.05

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
WandB
WandB | ibm-granite/granite-4.1-8b-20260429 131K $0.05 / 1M tokens $0.1 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by ibm-granite