Author's Description
Virtuoso‑Medium‑v2 is a 32 B model distilled from DeepSeek‑v3 logits and merged back onto a Qwen 2.5 backbone, yielding a sharper, more factual successor to the original Virtuoso Medium. The team harvested ~1.1 B logit tokens and applied "fusion‑merging" plus DPO alignment, which pushed scores past Arcee‑Nova 2024 and many 40 B‑plus peers on MMLU‑Pro, MATH and HumanEval. With a 128 k context and aggressive quantization options (from BF16 down to 4‑bit GGUF), it balances capability with deployability on single‑GPU nodes. Typical use cases include enterprise chat assistants, technical writing aids and medium‑complexity code drafting where Virtuoso‑Large would be overkill.
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
Arcee AI's Virtuoso-Medium-v2, a 32B model distilled from DeepSeek-v3 and merged onto a Qwen 2.5 backbone, demonstrates a strong balance of capability and deployability. Created on May 5, 2025, it consistently ranks among the fastest models, achieving the 91st percentile across six benchmarks, and offers competitive pricing, placing in the 55th percentile. Its reliability is exceptional, boasting a perfect 100th percentile, indicating minimal technical failures and consistent evaluable responses. Across benchmarks, Virtuoso-Medium-v2 shows particular strength in Ethics, achieving perfect 100% accuracy, making it the most accurate model at its price point and among models of similar speed. It also excels in General Knowledge with 99% accuracy and strong Instruction Following at 61% accuracy (70th percentile). While its Email Classification accuracy is high at 97%, its percentile ranking (50th) suggests this is a common performance level for models in this category. Its Coding (83%) and Reasoning (68%) performances are solid, placing it in the 62nd and 64th percentiles respectively. The model's aggressive quantization options and 128k context length make it suitable for enterprise chat, technical writing, and medium-complexity code drafting, offering a sharp, factual successor to its predecessor.
Model Pricing
Current Pricing
Feature | Price (per 1M tokens) |
---|---|
Prompt | $0.5 |
Completion | $0.8 |
Price History
Available Endpoints
Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
---|---|---|---|---|
Together
|
Together | arcee-ai/virtuoso-medium-v2 | 131K | $0.5 / 1M tokens | $0.8 / 1M tokens |
Benchmark Results
Benchmark | Category | Reasoning | Free | Executions | Accuracy | Cost | Duration |
---|
Other Models by arcee-ai
|
Released | Params | Context |
|
Speed | Ability | Cost |
---|---|---|---|---|---|---|---|
Arcee AI: Caller Large Unavailable | May 05, 2025 | — | 32K |
Text input
Text output
|
★★★★ | ★★★ | $$$$ |
Arcee AI: Spotlight | May 05, 2025 | ~7B | 131K |
Text input
Image input
Text output
|
★★★★★ | ★★★ | $$$ |
Arcee AI: Maestro Reasoning | May 05, 2025 | ~32B | 131K |
Text input
Text output
|
★★ | ★★★★ | $$$$$ |
Arcee AI: Virtuoso Large | May 05, 2025 | ~72B | 131K |
Text input
Text output
|
★★★★★ | ★★★★ | $$$$ |
Arcee AI: Coder Large | May 05, 2025 | ~32B | 32K |
Text input
Text output
|
★★★★★ | ★★★★ | $$$ |
Arcee AI: Arcee Blitz Unavailable | May 05, 2025 | ~24B | 32K |
Text input
Text output
|
★★★★ | ★★★★ | $$$ |