Arcee AI: Virtuoso Medium V2

Text input Text output Unavailable
Author's Description

Virtuoso‑Medium‑v2 is a 32 B model distilled from DeepSeek‑v3 logits and merged back onto a Qwen 2.5 backbone, yielding a sharper, more factual successor to the original Virtuoso Medium. The team harvested ~1.1 B logit tokens and applied "fusion‑merging" plus DPO alignment, which pushed scores past Arcee‑Nova 2024 and many 40 B‑plus peers on MMLU‑Pro, MATH and HumanEval. With a 128 k context and aggressive quantization options (from BF16 down to 4‑bit GGUF), it balances capability with deployability on single‑GPU nodes. Typical use cases include enterprise chat assistants, technical writing aids and medium‑complexity code drafting where Virtuoso‑Large would be overkill.

Key Specifications
Cost
$$$
Context
131K
Parameters
32B (Rumoured)
Released
May 05, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tools Logit Bias Tool Choice Response Format Stop Min P Top P Max Tokens Frequency Penalty Temperature Presence Penalty
Features

This model supports the following features:

Tools Response Format
Performance Summary

Arcee AI's Virtuoso Medium V2, a 32B model leveraging DeepSeek-v3 logits and a Qwen 2.5 backbone, demonstrates a strong balance of capability and deployability. It consistently ranks among the fastest models, placing in the 92nd percentile across five benchmarks, and offers competitive pricing, ranking in the 54th percentile. Notably, the model exhibits exceptional reliability with a 100% success rate across all benchmarks, indicating minimal technical failures. In terms of performance across categories, Virtuoso Medium V2 achieved perfect accuracy in Ethics (100%), making it the most accurate model at its price point and among models of comparable speed. It also performed strongly in General Knowledge (99.0% accuracy) and Coding (83.0% accuracy). While its Email Classification accuracy was 97.0%, its Instruction Following score of 61.0% suggests a potential area for improvement, though it still ranks in the 66th percentile for this category. Its key strengths lie in its speed, reliability, and ethical reasoning capabilities, making it well-suited for enterprise chat assistants and technical writing aids where high accuracy and consistent performance are crucial.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.5
Completion $0.8

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Together
Together | arcee-ai/virtuoso-medium-v2 131K $0.5 / 1M tokens $0.8 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by arcee-ai