Qwen: Qwen3 Coder 480B A35B (exacto)

Text input Text output
Author's Description

Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model developed by the Qwen team. It is optimized for agentic coding tasks such as function calling, tool use, and long-context reasoning over repositories. The model features 480 billion total parameters, with 35 billion active per forward pass (8 out of 160 experts). Pricing for the Alibaba endpoints varies by context length. Once a request is greater than 128k input tokens, the higher pricing is used.

Key Specifications
Cost
$$$$
Context
262K
Parameters
480B
Released
Jul 22, 2025
Supported Parameters

This model supports the following parameters:

Reasoning Stop Structured Outputs Response Format Temperature Max Tokens Tool Choice Tools
Features

This model supports the following features:

Response Format Tools Reasoning Structured Outputs
Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.38
Completion $1.53

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
BaseTen
BaseTen | qwen/qwen3-coder-480b-a35b-07-25:exacto 262K $0.38 / 1M tokens $1.53 / 1M tokens
Google
Google | qwen/qwen3-coder-480b-a35b-07-25:exacto 262K $1 / 1M tokens $4 / 1M tokens
Cerebras
Cerebras | qwen/qwen3-coder-480b-a35b-07-25:exacto 131K $2 / 1M tokens $2 / 1M tokens
Other Models by qwen