MoonshotAI: Kimi VL A3B Thinking

Name: MoonshotAI: Kimi VL A3B Thinking
Brand: moonshotai
Availability: OutOfStock
Rating: 1.5 (6 reviews)

Back

Image input Text input Text output Unavailable

Author's Description

Kimi-VL is a lightweight Mixture-of-Experts vision-language model that activates only 2.8B parameters per step while delivering strong performance on multimodal reasoning and long-context tasks. The Kimi-VL-A3B-Thinking variant, fine-tuned with chain-of-thought and reinforcement learning, excels in math and visual reasoning benchmarks like MathVision, MMMU, and MathVista, rivaling much larger models such as Qwen2.5-VL-7B and Gemma-3-12B. It supports 128K context and high-resolution input via its MoonViT encoder.

Key Specifications

Cost

Context

131K

Parameters

Released

Apr 10, 2025

Speed

★★

Ability

★

Reliability

★★★★

Hugging Face

Supported Parameters

This model supports the following parameters:

Stop Max Tokens Seed Reasoning Top P Frequency Penalty Presence Penalty Temperature Top Logprobs Include Reasoning Logprobs Logit Bias Min P

Features

This model supports the following features:

Reasoning

Performance Summary

MoonshotAI's Kimi VL A3B Thinking model, a lightweight Mixture-of-Experts vision-language model, demonstrates exceptional speed and competitive pricing. It consistently ranks among the fastest models across all evaluated benchmarks and offers among the most competitive pricing across five benchmarks. With a 98% success rate across six benchmarks, its reliability is notably high, indicating consistent operational stability. In terms of performance, the model exhibits a significant strength in Ethics, achieving perfect accuracy and being highlighted as the most accurate model at its price point and among models of similar speed. Its General Knowledge is solid at 85% accuracy, placing it in the 26th percentile, while its Reasoning capabilities are moderate at 64% accuracy (51st percentile). Coding performance is fair at 78% accuracy (40th percentile). A notable weakness is observed in Instruction Following, where it scored 0.0% accuracy in both instances, suggesting a critical area for improvement in understanding and executing complex, multi-layered instructions. Despite this, its fine-tuning with chain-of-thought and reinforcement learning appears to contribute to its strong performance in specific reasoning and ethical tasks.

Model Pricing

Current Pricing

Feature	Price (per 1M tokens)
Prompt	$0.02
Completion	$0.08

Price History

Available Endpoints

Provider	Endpoint Name	Context Length	Pricing (Input)	Pricing (Output)
Chutes	Chutes \| moonshotai/kimi-vl-a3b-thinking	131K	$0.02 / 1M tokens	$0.08 / 1M tokens
Chutes	Chutes \| moonshotai/kimi-vl-a3b-thinking	131K	$0.02 / 1M tokens	$0.08 / 1M tokens

Benchmark Results

Benchmark	Category	Reasoning	Strategy	Free	Executions	Accuracy	Cost	Duration

Other Models by moonshotai

	Released	Params	Context	Filter by Modalities All Modalities	Speed	Ability	Cost
MoonshotAI: Kimi K2.7 Code	Jun 12, 2026	—	262K	Image input Text input Text output	★★	★★★	$$$$$
MoonshotAI: Kimi K2.6 Unavailable	Apr 20, 2026	—	262K	Image input Text input Text output	★	★★★★★	$$$$$
MoonshotAI: Kimi K2.6	Apr 20, 2026	—	262K	Image input Text input Text output	★	★★★★★	$$$$$
MoonshotAI: Kimi K2.5	Jan 26, 2026	—	262K	Image input Text input Text output	★	★★★★★	$$$$$
MoonshotAI: Kimi Linear 48B A3B Instruct Unavailable	Nov 07, 2025	48B	1M	Text input Text output	★★★★	★	$$$
MoonshotAI: Kimi Linear 48B A3B Instruct Unavailable	Nov 07, 2025	48B	1M	Text input Text output	—	★	$$$$
MoonshotAI: Kimi K2 Thinking	Nov 06, 2025	~1T	262K	Text input Text output	★	★★★★★	$$$$$
MoonshotAI: Kimi K2 0905	Sep 04, 2025	~32B	262K	Text input Text output	★★	★★★	$$$$
MoonshotAI: Kimi K2 0905 (exacto) Unavailable	Sep 04, 2025	~1T	262K	Text input Text output	—	—	$$$$$
MoonshotAI: Kimi K2 0711	Jul 11, 2025	~1T	131K	Text input Text output	★★★★	★★★★★	$$
MoonshotAI: Kimi Dev 72B Unavailable	Jun 16, 2025	72B	131K	Text input Text output	★	★★	$$$$