MoonshotAI: Kimi K2

Text input Text output Free Option
Author's Description

Kimi K2 Instruct is a large-scale Mixture-of-Experts (MoE) language model developed by Moonshot AI, featuring 1 trillion total parameters with 32 billion active per forward pass. It is optimized for agentic capabilities, including advanced tool use, reasoning, and code synthesis. Kimi K2 excels across a broad range of benchmarks, particularly in coding (LiveCodeBench, SWE-bench), reasoning (ZebraLogic, GPQA), and tool-use (Tau2, AceBench) tasks. It supports long-context inference up to 128K tokens and is designed with a novel training stack that includes the MuonClip optimizer for stable large-scale MoE training.

Key Specifications
Cost
$$
Context
131K
Parameters
1T (Rumoured)
Released
Jul 11, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Stop Presence Penalty Logit Bias Top P Tool Choice Temperature Seed Min P Tools Frequency Penalty Max Tokens
Features

This model supports the following features:

Tools
Performance Summary

MoonshotAI: Kimi K2, a large-scale MoE model with 1 trillion parameters, demonstrates competitive performance across various metrics. Its speed ranking places it in the 52nd percentile, indicating generally competitive response times, while its price ranking at the 53rd percentile suggests competitive pricing. Notably, Kimi K2 exhibits exceptional reliability, achieving a 100% success rate across all 8 benchmarks, signifying consistent and dependable operation. The model excels in Instruction Following, achieving perfect accuracy in one instance and near-perfect accuracy with top speed in another, highlighting its precision and efficiency. It also shows elite performance in Keyword Topic Relevance Classification with unmatched cost efficiency. Kimi K2 demonstrates strong capabilities in Coding (93% accuracy) and General Knowledge (99.5% accuracy), performing well within the top percentiles. Its perfect accuracy in Ethics is also a significant strength. While its Reasoning performance is solid at 70% accuracy, it is not a standout area compared to its other strengths. The model's design for agentic capabilities, including advanced tool use, reasoning, and code synthesis, is well-reflected in its benchmark results, particularly in coding and instruction following.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.14
Completion $2.49

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Novita
Novita | moonshotai/kimi-k2 131K $0.14 / 1M tokens $2.49 / 1M tokens
Parasail
Parasail | moonshotai/kimi-k2 131K $0.99 / 1M tokens $2.99 / 1M tokens
Targon
Targon | moonshotai/kimi-k2 63K $0.14 / 1M tokens $2.49 / 1M tokens
Moonshot AI
Moonshot AI | moonshotai/kimi-k2 131K $0.14 / 1M tokens $2.49 / 1M tokens
Novita
Novita | moonshotai/kimi-k2 131K $0.57 / 1M tokens $2.3 / 1M tokens
Together
Together | moonshotai/kimi-k2 131K $1 / 1M tokens $3 / 1M tokens
DeepInfra
DeepInfra | moonshotai/kimi-k2 120K $0.14 / 1M tokens $2.49 / 1M tokens
Groq
Groq | moonshotai/kimi-k2 131K $1 / 1M tokens $3 / 1M tokens
BaseTen
BaseTen | moonshotai/kimi-k2 131K $0.6 / 1M tokens $2.5 / 1M tokens
Chutes
Chutes | moonshotai/kimi-k2 75K $0.148 / 1M tokens $0.593 / 1M tokens
BaseTen
BaseTen | moonshotai/kimi-k2 131K $0.14 / 1M tokens $2.49 / 1M tokens
DeepInfra
DeepInfra | moonshotai/kimi-k2 131K $0.55 / 1M tokens $2.2 / 1M tokens
GMICloud
GMICloud | moonshotai/kimi-k2 131K $1 / 1M tokens $3 / 1M tokens
Moonshot AI
Moonshot AI | moonshotai/kimi-k2 131K $0.6 / 1M tokens $2.5 / 1M tokens
Fireworks
Fireworks | moonshotai/kimi-k2 131K $0.6 / 1M tokens $2.5 / 1M tokens
Chutes
Chutes | moonshotai/kimi-k2 131K $0.14 / 1M tokens $2.49 / 1M tokens
AtlasCloud
AtlasCloud | moonshotai/kimi-k2 131K $0.7 / 1M tokens $2.5 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Free Executions Accuracy Cost Duration
Other Models by moonshotai