Author's Description
Virtuoso‑Medium‑v2 is a 32 B model distilled from DeepSeek‑v3 logits and merged back onto a Qwen 2.5 backbone, yielding a sharper, more factual successor to the original Virtuoso Medium. The team harvested ~1.1 B logit tokens and applied "fusion‑merging" plus DPO alignment, which pushed scores past Arcee‑Nova 2024 and many 40 B‑plus peers on MMLU‑Pro, MATH and HumanEval. With a 128 k context and aggressive quantization options (from BF16 down to 4‑bit GGUF), it balances capability with deployability on single‑GPU nodes. Typical use cases include enterprise chat assistants, technical writing aids and medium‑complexity code drafting where Virtuoso‑Large would be overkill.
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
Arcee AI's Virtuoso Medium V2, a 32B model leveraging DeepSeek-v3 logits and a Qwen 2.5 backbone, demonstrates a strong balance of capability and deployability. It consistently ranks among the fastest models, placing in the 92nd percentile across five benchmarks, and offers competitive pricing, ranking in the 54th percentile. Notably, the model exhibits exceptional reliability with a 100% success rate across all benchmarks, indicating minimal technical failures. In terms of performance across categories, Virtuoso Medium V2 achieved perfect accuracy in Ethics (100%), making it the most accurate model at its price point and among models of comparable speed. It also performed strongly in General Knowledge (99.0% accuracy) and Coding (83.0% accuracy). While its Email Classification accuracy was 97.0%, its Instruction Following score of 61.0% suggests a potential area for improvement, though it still ranks in the 66th percentile for this category. Its key strengths lie in its speed, reliability, and ethical reasoning capabilities, making it well-suited for enterprise chat assistants and technical writing aids where high accuracy and consistent performance are crucial.
Model Pricing
Current Pricing
Feature | Price (per 1M tokens) |
---|---|
Prompt | $0.5 |
Completion | $0.8 |
Price History
Available Endpoints
Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
---|---|---|---|---|
Together
|
Together | arcee-ai/virtuoso-medium-v2 | 131K | $0.5 / 1M tokens | $0.8 / 1M tokens |
Benchmark Results
Benchmark | Category | Reasoning | Strategy | Free | Executions | Accuracy | Cost | Duration |
---|
Other Models by arcee-ai
|
Released | Params | Context |
|
Speed | Ability | Cost |
---|---|---|---|---|---|---|---|
Arcee AI: AFM 4.5B | Sep 16, 2025 | 5B | 65K |
Text input
Text output
|
★ | ★★ | $$$ |
Arcee AI: Caller Large Unavailable | May 05, 2025 | — | 32K |
Text input
Text output
|
★★★★★ | ★★★ | $$$$ |
Arcee AI: Spotlight | May 05, 2025 | ~7B | 131K |
Text input
Image input
Text output
|
★★★★★ | ★★★ | $$ |
Arcee AI: Maestro Reasoning | May 05, 2025 | ~32B | 131K |
Text input
Text output
|
★★ | ★★★★ | $$$$$ |
Arcee AI: Virtuoso Large | May 05, 2025 | ~72B | 131K |
Text input
Text output
|
★★★★★ | ★★★★ | $$$$ |
Arcee AI: Coder Large | May 05, 2025 | ~32B | 32K |
Text input
Text output
|
★★★★★ | ★★★★ | $$$ |
Arcee AI: Arcee Blitz Unavailable | May 05, 2025 | ~24B | 32K |
Text input
Text output
|
★★★★ | ★★★★ | $$$ |