MoonshotAI: Kimi Dev 72B

Text input Text output Free Option
Author's Description

Kimi-Dev-72B is an open-source large language model fine-tuned for software engineering and issue resolution tasks. Based on Qwen2.5-72B, it is optimized using large-scale reinforcement learning that applies code patches in real repositories and validates them via full test suite execution—rewarding only correct, robust completions. The model achieves 60.4% on SWE-bench Verified, setting a new benchmark among open-source models for software bug fixing and code reasoning.

Key Specifications
Cost
$$$$
Context
131K
Parameters
72B
Released
Jun 16, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Include Reasoning Response Format Top P Frequency Penalty Reasoning Structured Outputs Temperature
Features

This model supports the following features:

Reasoning Structured Outputs Response Format
Performance Summary

MoonshotAI: Kimi Dev 72B, a large language model optimized for software engineering tasks, demonstrates exceptional speed and competitive pricing, consistently ranking among the fastest and most cost-effective models across various benchmarks. Its reliability is strong, achieving an 89% success rate. The model excels in specialized areas, particularly in Email Classification (99.0% accuracy) and General Knowledge (99.0% accuracy), indicating a robust understanding of diverse information and contextual nuances. Its primary strength lies in its intended application, achieving a remarkable 60.4% on SWE-bench Verified, setting a new open-source benchmark for software bug fixing. However, the model exhibits significant weaknesses in Instruction Following, scoring 0.0% accuracy, suggesting challenges with complex multi-step directives. Hallucinations are also a concern, with a 68.0% accuracy in identifying fictional concepts. While its Coding performance is moderate at 63.8%, and Reasoning at 60.0%, these areas could benefit from further refinement. Mathematics and Ethics show decent performance at 80.8% and 93.0% respectively, though the Ethics percentile ranking is lower than expected for its accuracy.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.29
Completion $1.15

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
SiliconFlow
SiliconFlow | moonshotai/kimi-dev-72b 131K $0.29 / 1M tokens $1.15 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by moonshotai