Arcee AI: Virtuoso Medium V2

Text input Text output Unavailable
Author's Description

Virtuoso‑Medium‑v2 is a 32 B model distilled from DeepSeek‑v3 logits and merged back onto a Qwen 2.5 backbone, yielding a sharper, more factual successor to the original Virtuoso Medium. The team harvested ~1.1 B logit tokens and applied "fusion‑merging" plus DPO alignment, which pushed scores past Arcee‑Nova 2024 and many 40 B‑plus peers on MMLU‑Pro, MATH and HumanEval. With a 128 k context and aggressive quantization options (from BF16 down to 4‑bit GGUF), it balances capability with deployability on single‑GPU nodes. Typical use cases include enterprise chat assistants, technical writing aids and medium‑complexity code drafting where Virtuoso‑Large would be overkill.

Key Specifications
Cost
$$$
Context
131K
Parameters
32B (Rumoured)
Released
May 05, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Stop Presence Penalty Logit Bias Top P Tool Choice Temperature Min P Tools Response Format Frequency Penalty Max Tokens
Features

This model supports the following features:

Tools Response Format
Performance Summary

Arcee AI's Virtuoso-Medium-v2, a 32B model distilled from DeepSeek-v3 and merged onto a Qwen 2.5 backbone, demonstrates a strong balance of capability and deployability. Created on May 5, 2025, it consistently ranks among the fastest models, achieving the 91st percentile across six benchmarks, and offers competitive pricing, placing in the 55th percentile. Its reliability is exceptional, boasting a perfect 100th percentile, indicating minimal technical failures and consistent evaluable responses. Across benchmarks, Virtuoso-Medium-v2 shows particular strength in Ethics, achieving perfect 100% accuracy, making it the most accurate model at its price point and among models of similar speed. It also excels in General Knowledge with 99% accuracy and strong Instruction Following at 61% accuracy (70th percentile). While its Email Classification accuracy is high at 97%, its percentile ranking (50th) suggests this is a common performance level for models in this category. Its Coding (83%) and Reasoning (68%) performances are solid, placing it in the 62nd and 64th percentiles respectively. The model's aggressive quantization options and 128k context length make it suitable for enterprise chat, technical writing, and medium-complexity code drafting, offering a sharp, factual successor to its predecessor.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.5
Completion $0.8

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Together
Together | arcee-ai/virtuoso-medium-v2 131K $0.5 / 1M tokens $0.8 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Free Executions Accuracy Cost Duration
Other Models by arcee-ai