Llama 3.1 is a new state-of-the-art model from Meta available in 8B, 70B and 405B parameter sizes.
14.5M Pulls Updated 3 weeks ago
Updated 3 weeks ago
3 weeks ago
46e0c10c039e · 4.9GB
Readme
Meta Llama 3.1
Llama 3.1 family of models available:
- 8B
- 70B
- 405B
Llama 3.1 405B is the first openly available model that rivals the top AI models when it comes to state-of-the-art capabilities in general knowledge, steerability, math, tool use, and multilingual translation.
The upgraded versions of the 8B and 70B models are multilingual and have a significantly longer context length of 128K, state-of-the-art tool use, and overall stronger reasoning capabilities. This enables Meta’s latest models to support advanced use cases, such as long-form text summarization, multilingual conversational agents, and coding assistants.
Meta also has made changes to their license, allowing developers to use the outputs from Llama models, including the 405B model, to improve other models.
Model evaluations
For this release, Meta has evaluation the performance on over 150 benchmark datasets that span a wide range of languages. In addition, Meta performed extensive human evaluations that compare Llama 3.1 with competing models in real-world scenarios. Meta’s experimental evaluation suggests that our flagship model is competitive with leading foundation models across a range of tasks, including GPT-4, GPT-4o, and Claude 3.5 Sonnet. Additionally, Meta’s smaller models are competitive with closed and open models that have a similar number of parameters.