OpenAI: gpt-oss-safeguard-20b

Text input Text output
Author's Description

gpt-oss-safeguard-20b is a safety reasoning model from OpenAI built upon gpt-oss-20b. This open-weight, 21B-parameter Mixture-of-Experts (MoE) model offers lower latency for safety tasks like content classification, LLM filtering, and trust...

Key Specifications
Cost
$$$
Context
131K
Parameters
20B
Released
Oct 29, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tool Choice Include Reasoning Tools Response Format Temperature Max Tokens Reasoning Stop Top P Seed
Features

This model supports the following features:

Tools Reasoning Response Format
Performance Summary

OpenAI's gpt-oss-safeguard-20b, a 21B-parameter Mixture-of-Experts model, demonstrates strong performance as a safety reasoning model. It consistently ranks among the fastest models, placing in the 84th percentile across benchmarks, and offers competitive pricing, typically falling within the 66th percentile. The model exhibits exceptional reliability with a 99% success rate, indicating minimal technical failures. In terms of specific benchmarks, gpt-oss-safeguard-20b excels in several areas. It achieves high accuracy in Coding (95%, 94th percentile), Email Classification (99%, 89th percentile), and Reasoning (94%, 81st percentile), highlighting its proficiency in structured tasks and logical problem-solving. Its Instruction Following capability is also robust at 69% accuracy (73rd percentile). While its General Knowledge (97% accuracy) and Ethics (98% accuracy) scores are high, their percentile rankings (44th and 40th respectively) suggest a competitive landscape in these domains. A notable weakness is its performance in Hallucinations, where its 86% accuracy places it in the 32nd percentile, indicating room for improvement in acknowledging uncertainty. Despite this, its overall speed, cost-effectiveness, and high reliability make it a compelling choice for safety-critical applications.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.075
Completion $0.3
Input Cache Read $0.037

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Groq
Groq | openai/gpt-oss-safeguard-20b 131K $0.075 / 1M tokens $0.3 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by openai