Author's Description
A large LLM created by combining two fine-tuned Llama 70B models into one 120B model. Combines Xwin and Euryale. Credits to - [@chargoddard](https://huggingface.co/chargoddard) for developing the framework used to merge the model - [mergekit](https://github.com/cg123/mergekit). - [@Undi95](https://huggingface.co/Undi95) for helping with the merge ratios. #merge
Key Specifications
Supported Parameters
This model supports the following parameters:
Performance Summary
Goliath 120B, an alpindale model created by merging two fine-tuned Llama 70B models (Xwin and Euryale), consistently ranks among the fastest models and offers highly competitive pricing across various benchmarks. Its speed and cost-efficiency are significant advantages. In terms of performance, Goliath 120B demonstrates a notable strength in classification tasks, achieving 95.0% accuracy in Email Classification, placing it in the 34th percentile for this category. This suggests strong capabilities in discerning context and categorizing information. However, the model exhibits significant weaknesses in other areas. Its performance in Instruction Following is particularly concerning, with 0.0% accuracy, indicating a fundamental limitation in processing and executing complex multi-step instructions. Similarly, its accuracy in Coding (22.0%), Reasoning (39.0%), and Ethics (44.0%) benchmarks is relatively low, placing it in the lower percentiles for these categories. While its cost performance is generally strong across all benchmarks, the low accuracy in critical areas like coding and reasoning suggests that while it's cost-effective, its utility for tasks requiring high precision in these domains may be limited.
Model Pricing
Current Pricing
Feature | Price (per 1M tokens) |
---|---|
Prompt | $5 |
Completion | $6.25 |
Price History
Available Endpoints
Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
---|---|---|---|---|
Mancer 2
|
Mancer 2 | alpindale/goliath-120b | 6K | $5 / 1M tokens | $6.25 / 1M tokens |
NextBit
|
NextBit | alpindale/goliath-120b | 6K | $9 / 1M tokens | $11 / 1M tokens |
Benchmark Results
Benchmark | Category | Reasoning | Free | Executions | Accuracy | Cost | Duration |
---|
Other Models by alpindale
|
Released | Params | Context |
|
Speed | Ability | Cost |
---|---|---|---|---|---|---|---|
Magnum 72B Unavailable | Jul 10, 2024 | 72B | 16K |
Text input
Text output
|
★★ | ★★★★ | $$$$$ |