OpenAI: gpt-oss-20b

Text input Text output Free Option
Author's Description

gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 license. It uses a Mixture-of-Experts (MoE) architecture with 3.6B active parameters per forward pass, optimized for lower-latency inference and deployability on consumer or single-GPU hardware. The model is trained in OpenAI’s Harmony response format and supports reasoning level configuration, fine-tuning, and agentic capabilities including function calling, tool use, and structured outputs.

Key Specifications
Cost
$$
Context
131K
Parameters
20B
Released
Aug 05, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Structured Outputs Response Format Reasoning Temperature Presence Penalty Include Reasoning Tools Frequency Penalty Top P Stop Tool Choice Max Tokens Logit Bias
Features

This model supports the following features:

Structured Outputs Response Format Tools Reasoning
Performance Summary

The OpenAI gpt-oss-20b model, an open-weight 21B parameter Mixture-of-Experts (MoE) model, demonstrates exceptional speed and competitive pricing. It consistently ranks among the fastest models, achieving an Infinityth percentile across 17 benchmarks, and offers among the most competitive pricing, also at an Infinityth percentile across 9 benchmarks. This optimization for lower-latency inference and deployability on consumer or single-GPU hardware is evident in its performance. In terms of specific benchmarks, gpt-oss-20b shows strong performance in several areas. It achieves high accuracy in Ethics (99.0%), General Knowledge (99.0%), and Coding (92.0%), indicating robust understanding in these domains. Its Instruction Following capabilities are also solid at 66.0% accuracy, and Reasoning at 89.8%. Notably, it achieved perfect accuracy in two instances of Keyword Topic Relevance Classification, with one being the fastest recorded duration. However, the model exhibits significant weaknesses in other Keyword Topic Relevance Classification benchmarks, scoring 0.0% accuracy in multiple instances, suggesting inconsistency or sensitivity to specific test variations. Its Mathematics performance is moderate at 66.7% accuracy, but with a very high duration. The model also shows a tendency to hallucinate, with 70.0% accuracy in the Hallucinations benchmark, indicating it doesn't always correctly identify fictional concepts.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.07
Completion $0.3
Input Cache Read $0.035

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Fireworks
Fireworks | openai/gpt-oss-20b 131K $0.07 / 1M tokens $0.3 / 1M tokens
Groq
Groq | openai/gpt-oss-20b 131K $0.075 / 1M tokens $0.3 / 1M tokens
Novita
Novita | openai/gpt-oss-20b 131K $0.03 / 1M tokens $0.11 / 1M tokens
Nebius
Nebius | openai/gpt-oss-20b 131K $0.05 / 1M tokens $0.2 / 1M tokens
DeepInfra
DeepInfra | openai/gpt-oss-20b 131K $0.03 / 1M tokens $0.11 / 1M tokens
NCompass
NCompass | openai/gpt-oss-20b 131K $0.03 / 1M tokens $0.11 / 1M tokens
Phala
Phala | openai/gpt-oss-20b 131K $0.03 / 1M tokens $0.11 / 1M tokens
Together
Together | openai/gpt-oss-20b 131K $0.05 / 1M tokens $0.2 / 1M tokens
WandB
WandB | openai/gpt-oss-20b 131K $0.05 / 1M tokens $0.2 / 1M tokens
Hyperbolic
Hyperbolic | openai/gpt-oss-20b 131K $0.03 / 1M tokens $0.11 / 1M tokens
NextBit
NextBit | openai/gpt-oss-20b 131K $0.1 / 1M tokens $0.45 / 1M tokens
InferenceNet
InferenceNet | openai/gpt-oss-20b 100K $0.03 / 1M tokens $0.11 / 1M tokens
Google
Google | openai/gpt-oss-20b 131K $0.07 / 1M tokens $0.25 / 1M tokens
SiliconFlow
SiliconFlow | openai/gpt-oss-20b 131K $0.03 / 1M tokens $0.11 / 1M tokens
Novita
Novita | openai/gpt-oss-20b 131K $0.03 / 1M tokens $0.11 / 1M tokens
Parasail
Parasail | openai/gpt-oss-20b 131K $0.04 / 1M tokens $0.2 / 1M tokens
Amazon Bedrock
Amazon Bedrock | openai/gpt-oss-20b 131K $0.07 / 1M tokens $0.15 / 1M tokens
Clarifai
Clarifai | openai/gpt-oss-20b 131K $0.045 / 1M tokens $0.18 / 1M tokens
GMICloud
GMICloud | openai/gpt-oss-20b 131K $0.03 / 1M tokens $0.11 / 1M tokens
Novita
Novita | openai/gpt-oss-20b 131K $0.04 / 1M tokens $0.15 / 1M tokens
Chutes
Chutes | openai/gpt-oss-20b 131K $0.03 / 1M tokens $0.11 / 1M tokens
DeepInfra
DeepInfra | openai/gpt-oss-20b 131K $0.03 / 1M tokens $0.14 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by openai