Sao10K: Llama 3 8B Lunaris

Text input Text output
Author's Description

Lunaris 8B is a versatile generalist and roleplaying model based on Llama 3. It's a strategic merge of multiple models, designed to balance creativity with improved logic and general knowledge. Created by [Sao10k](https://huggingface.co/Sao10k), this model aims to offer an improved experience over Stheno v3.2, with enhanced creativity and logical reasoning. For best results, use with Llama 3 Instruct context template, temperature 1.4, and min_p 0.1.

Key Specifications
Cost
$
Context
8K
Parameters
300B
Released
Aug 12, 2024
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Stop Presence Penalty Top P Temperature Seed Min P Response Format Frequency Penalty Max Tokens
Features

This model supports the following features:

Response Format
Performance Summary

Sao10K: Llama 3 8B Lunaris, created on August 12, 2024, demonstrates strong performance in terms of operational efficiency. It consistently ranks in the top tier for speed, performing in the 79th percentile across six benchmarks, indicating it is among the fastest models available. Furthermore, its pricing is highly competitive, placing it in the 97th percentile, making it one of the most cost-effective options. While excelling in speed and cost, the model exhibits varied performance across different benchmark categories. It shows notable strengths in Instruction Following, Email Classification, and General Knowledge, achieving 41%, 89%, and 69% accuracy respectively, with particularly high efficiency in duration and cost for these tasks. However, its performance in Coding (4.0% accuracy) and Reasoning (34.7% accuracy) is a significant weakness, placing it in the lower percentiles for these categories. Ethics performance is moderate at 68.5%. The model's reliability is not explicitly detailed in the provided data, but its consistent benchmark completion suggests a functional operational state. Overall, Lunaris 8B is a highly economical and fast model, well-suited for tasks requiring general knowledge, instruction adherence, and classification, but less effective for complex coding or advanced reasoning challenges.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.02
Completion $0.05

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
DeepInfra
DeepInfra | sao10k/l3-lunaris-8b 8K $0.02 / 1M tokens $0.05 / 1M tokens
Novita
Novita | sao10k/l3-lunaris-8b 8K $0.05 / 1M tokens $0.05 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Free Executions Accuracy Cost Duration
Other Models by sao10k