NeverSleep: Llama 3 Lumimaid 8B

Text input Text output Unavailable
Author's Description

The NeverSleep team is back, with a Llama 3 8B finetune trained on their curated roleplay data. Striking a balance between eRP and RP, Lumimaid was designed to be serious, yet uncensored when necessary. To enhance it's overall intelligence and chat capability, roughly 40% of the training data was not roleplay. This provides a breadth of knowledge to access, while still keeping roleplay as the primary strength. Usage of this model is subject to [Meta's Acceptable Use Policy](https://llama.meta.com/llama3/use-policy/).

Key Specifications
Cost
$$$
Context
24K
Parameters
8B
Released
May 03, 2024
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Logit Bias Stop Seed Min P Top P Max Tokens Frequency Penalty Temperature Presence Penalty
Performance Summary

NeverSleep: Llama 3 Lumimaid 8B demonstrates competitive response times, ranking in the 54th percentile across various benchmarks. It also typically provides cost-effective solutions, placing in the 63rd percentile for pricing. The model's reliability information was not provided, so no specific comment can be made on this aspect. In terms of benchmark performance, Lumimaid 8B shows a mixed profile. Its highest accuracy was observed in Email Classification at 93.0%, though this still places it in the 25th percentile for that category. Performance in General Knowledge (66.5% accuracy, 19th percentile) and Ethics (61.0% accuracy, 17th percentile) indicates a need for improvement in these foundational areas. The model struggled most with Coding, achieving only 45.0% accuracy, placing it in the 21st percentile. A key strength appears to be its cost-efficiency, particularly in Coding where it achieved a 70th percentile for cost, and in Email Classification (63rd percentile). Its primary weakness lies in its overall accuracy across all tested categories, consistently ranking in the lower percentiles. While designed for roleplay, its general intelligence and chat capabilities, which comprise 40% of its training data, do not translate into strong benchmark performance in knowledge, ethics, or coding.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.8
Completion $1.2

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Mancer 2
Mancer 2 | neversleep/llama-3-lumimaid-8b 24K $0.8 / 1M tokens $1.2 / 1M tokens
Featherless
Featherless | neversleep/llama-3-lumimaid-8b 8K $0.8 / 1M tokens $1.2 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by neversleep