NeverSleep: Llama 3 Lumimaid 70B

Text input Text output
Author's Description

The NeverSleep team is back, with a Llama 3 70B finetune trained on their curated roleplay data. Striking a balance between eRP and RP, Lumimaid was designed to be serious, yet uncensored when necessary. To enhance it's overall intelligence and chat capability, roughly 40% of the training data was not roleplay. This provides a breadth of knowledge to access, while still keeping roleplay as the primary strength. Usage of this model is subject to [Meta's Acceptable Use Policy](https://llama.meta.com/llama3/use-policy/).

Key Specifications
Cost
$$$$$
Context
8K
Parameters
70B
Released
May 15, 2024
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Stop Presence Penalty Top P Temperature Min P Seed Frequency Penalty Max Tokens
Performance Summary

NeverSleep: Llama 3 Lumimaid 70B demonstrates exceptional performance in terms of operational efficiency, consistently ranking among the fastest models available and offering highly competitive pricing across all benchmarks. This makes it a cost-effective and responsive solution for various applications. In terms of specific capabilities, Lumimaid shows a mixed performance. Its primary strength lies in Classification, achieving 94.0% accuracy in Email Classification, indicating strong contextual understanding for categorization tasks. Instruction Following is moderate at 45.0% accuracy, suggesting room for improvement in complex multi-step directives. However, the model exhibits significant weaknesses in more abstract and knowledge-based domains. It scored 0.0% accuracy in Coding, Ethics, and General Knowledge benchmarks, indicating a lack of foundational understanding or an inability to apply knowledge in these areas. Reasoning also presents a challenge with 48.0% accuracy. While designed with a focus on roleplay, the benchmark results suggest that the 40% non-roleplay training data has not yet translated into broad general intelligence or specialized knowledge in these critical areas. Its reliability is not explicitly stated as high or low, but the provided data suggests it consistently provides measurable responses.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $4
Completion $6

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Featherless
Featherless | neversleep/llama-3-lumimaid-70b 8K $4 / 1M tokens $6 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Free Executions Accuracy Cost Duration
Other Models by neversleep