Author's Description
Kimi K2 Instruct is a large-scale Mixture-of-Experts (MoE) language model developed by Moonshot AI, featuring 1 trillion total parameters with 32 billion active per forward pass. It is optimized for agentic capabilities, including advanced tool use, reasoning, and code synthesis. Kimi K2 excels across a broad range of benchmarks, particularly in coding (LiveCodeBench, SWE-bench), reasoning (ZebraLogic, GPQA), and tool-use (Tau2, AceBench) tasks. It supports long-context inference up to 128K tokens and is designed with a novel training stack that includes the MuonClip optimizer for stable large-scale MoE training.
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
MoonshotAI: Kimi K2, a large-scale MoE model with 1 trillion parameters, demonstrates competitive performance across various metrics. Its speed ranking places it in the 52nd percentile, indicating generally competitive response times, while its price ranking at the 53rd percentile suggests competitive pricing. Notably, Kimi K2 exhibits exceptional reliability, achieving a 100% success rate across all 8 benchmarks, signifying consistent and dependable operation. The model excels in Instruction Following, achieving perfect accuracy in one instance and near-perfect accuracy with top speed in another, highlighting its precision and efficiency. It also shows elite performance in Keyword Topic Relevance Classification with unmatched cost efficiency. Kimi K2 demonstrates strong capabilities in Coding (93% accuracy) and General Knowledge (99.5% accuracy), performing well within the top percentiles. Its perfect accuracy in Ethics is also a significant strength. While its Reasoning performance is solid at 70% accuracy, it is not a standout area compared to its other strengths. The model's design for agentic capabilities, including advanced tool use, reasoning, and code synthesis, is well-reflected in its benchmark results, particularly in coding and instruction following.
Model Pricing
Current Pricing
Feature | Price (per 1M tokens) |
---|---|
Prompt | $0.14 |
Completion | $2.49 |
Price History
Available Endpoints
Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
---|---|---|---|---|
Novita
|
Novita | moonshotai/kimi-k2 | 131K | $0.14 / 1M tokens | $2.49 / 1M tokens |
Parasail
|
Parasail | moonshotai/kimi-k2 | 131K | $0.99 / 1M tokens | $2.99 / 1M tokens |
Targon
|
Targon | moonshotai/kimi-k2 | 63K | $0.14 / 1M tokens | $2.49 / 1M tokens |
Moonshot AI
|
Moonshot AI | moonshotai/kimi-k2 | 131K | $0.14 / 1M tokens | $2.49 / 1M tokens |
Novita
|
Novita | moonshotai/kimi-k2 | 131K | $0.57 / 1M tokens | $2.3 / 1M tokens |
Together
|
Together | moonshotai/kimi-k2 | 131K | $1 / 1M tokens | $3 / 1M tokens |
DeepInfra
|
DeepInfra | moonshotai/kimi-k2 | 120K | $0.14 / 1M tokens | $2.49 / 1M tokens |
Groq
|
Groq | moonshotai/kimi-k2 | 131K | $1 / 1M tokens | $3 / 1M tokens |
BaseTen
|
BaseTen | moonshotai/kimi-k2 | 131K | $0.6 / 1M tokens | $2.5 / 1M tokens |
Chutes
|
Chutes | moonshotai/kimi-k2 | 75K | $0.148 / 1M tokens | $0.593 / 1M tokens |
BaseTen
|
BaseTen | moonshotai/kimi-k2 | 131K | $0.14 / 1M tokens | $2.49 / 1M tokens |
DeepInfra
|
DeepInfra | moonshotai/kimi-k2 | 131K | $0.55 / 1M tokens | $2.2 / 1M tokens |
GMICloud
|
GMICloud | moonshotai/kimi-k2 | 131K | $1 / 1M tokens | $3 / 1M tokens |
Moonshot AI
|
Moonshot AI | moonshotai/kimi-k2 | 131K | $0.6 / 1M tokens | $2.5 / 1M tokens |
Fireworks
|
Fireworks | moonshotai/kimi-k2 | 131K | $0.6 / 1M tokens | $2.5 / 1M tokens |
Chutes
|
Chutes | moonshotai/kimi-k2 | 131K | $0.14 / 1M tokens | $2.49 / 1M tokens |
AtlasCloud
|
AtlasCloud | moonshotai/kimi-k2 | 131K | $0.7 / 1M tokens | $2.5 / 1M tokens |
Benchmark Results
Benchmark | Category | Reasoning | Free | Executions | Accuracy | Cost | Duration |
---|
Other Models by moonshotai
|
Released | Params | Context |
|
Speed | Ability | Cost |
---|---|---|---|---|---|---|---|
MoonshotAI: Kimi VL A3B Thinking | Apr 10, 2025 | 3B | 131K |
Text input
Image input
Text output
|
★ | ★★ | $$ |