Author's Description
Hunyuan-A13B is a 13B active parameter Mixture-of-Experts (MoE) language model developed by Tencent, with a total parameter count of 80B and support for reasoning via Chain-of-Thought. It offers competitive benchmark performance across mathematics, science, coding, and multi-turn reasoning tasks, while maintaining high inference efficiency via Grouped Query Attention (GQA) and quantization support (FP8, GPTQ, etc.).
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
Tencent's Hunyuan A13B Instruct model, a 13B active parameter MoE model with 80B total parameters, demonstrates exceptional performance across several key metrics. It consistently ranks among the fastest models, achieving an Infinityth percentile across six benchmarks, and offers highly competitive pricing, also at an Infinityth percentile across five benchmarks. The model exhibits outstanding reliability with a 100% success rate across all evaluated benchmarks, indicating minimal technical failures. While excelling in speed, cost, and reliability, the model's benchmark performance presents a mixed picture. It shows strong capabilities in Reasoning and General Knowledge, achieving 94.0% and 97.5% accuracy respectively, with its Reasoning performance being particularly notable as the most accurate model at its price point. Coding also demonstrates solid performance at 90.0% accuracy. However, the model exhibits a critical weakness in Instruction Following, scoring 0.0% accuracy in both evaluated instances, suggesting a significant limitation in processing complex or multi-layered directives. Its Email Classification accuracy is also comparatively lower at 91.0% (20th percentile). Overall, Hunyuan A13B Instruct is a highly efficient and cost-effective model, particularly strong in reasoning and general knowledge, but requires significant improvement in instruction following.
Model Pricing
Current Pricing
Feature | Price (per 1M tokens) |
---|---|
Prompt | $0.03 |
Completion | $0.03 |
Price History
Available Endpoints
Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
---|---|---|---|---|
Chutes
|
Chutes | tencent/hunyuan-a13b-instruct | 32K | $0.03 / 1M tokens | $0.03 / 1M tokens |
Benchmark Results
Benchmark | Category | Reasoning | Free | Executions | Accuracy | Cost | Duration |
---|