Goliath 120B

Text input Text output
Author's Description

A large LLM created by combining two fine-tuned Llama 70B models into one 120B model. Combines Xwin and Euryale. Credits to - [@chargoddard](https://huggingface.co/chargoddard) for developing the framework used to merge the model - [mergekit](https://github.com/cg123/mergekit). - [@Undi95](https://huggingface.co/Undi95) for helping with the merge ratios. #merge

Key Specifications
Cost
$$$$$
Context
6K
Parameters
120B
Released
Nov 09, 2023
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Stop Presence Penalty Logit Bias Top P Temperature Min P Seed Frequency Penalty Max Tokens
Performance Summary

Goliath 120B, an alpindale model created by merging two fine-tuned Llama 70B models (Xwin and Euryale), consistently ranks among the fastest models and offers highly competitive pricing across various benchmarks. Its speed and cost-efficiency are significant advantages. In terms of performance, Goliath 120B demonstrates a notable strength in classification tasks, achieving 95.0% accuracy in Email Classification, placing it in the 34th percentile for this category. This suggests strong capabilities in discerning context and categorizing information. However, the model exhibits significant weaknesses in other areas. Its performance in Instruction Following is particularly concerning, with 0.0% accuracy, indicating a fundamental limitation in processing and executing complex multi-step instructions. Similarly, its accuracy in Coding (22.0%), Reasoning (39.0%), and Ethics (44.0%) benchmarks is relatively low, placing it in the lower percentiles for these categories. While its cost performance is generally strong across all benchmarks, the low accuracy in critical areas like coding and reasoning suggests that while it's cost-effective, its utility for tasks requiring high precision in these domains may be limited.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $5
Completion $6.25

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Mancer 2
Mancer 2 | alpindale/goliath-120b 6K $5 / 1M tokens $6.25 / 1M tokens
NextBit
NextBit | alpindale/goliath-120b 6K $9 / 1M tokens $11 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Free Executions Accuracy Cost Duration
Other Models by alpindale