The Evolution of Llama: From Llama 1 to Llama 3.1 | by Luís Roque | Sep, 2024
data:image/s3,"s3://crabby-images/4e869/4e86997237a6998194ca7409c2aac8f1f2a4b298" alt=""
[ad_1]
A Comprehensive Guide to the Advancements and Innovations in the Family of Llama Models from Meta AI
This post was co-authored with Rafael Guedes.
Meta has released three major versions of its large language model (LLM), Llama, along with a minor (if we can call it that) update (version 3.1). The initial release of Llama in early 2023 marked a significant step forward for the open-source community in natural language processing (NLP). Meta has consistently contributed to this community by sharing its latest LLM versions.
To ensure correctness, we should distinguish between open and open-source LLMs. Open-source software traditionally makes its source code available under specific public use and modification licenses. In the context of LLMs, open LLMs typically disclose model weights and initial code. At the same time, open-source LLMs would also share the entire training process, including training data, with a permissive license. Most models today, including Meta’s Llama, fall under the open LLMs category since they do not release the datasets used for training.
Llama has undergone three key architectural iterations. Version 1 introduced several enhancements to the original Transformer architecture. Version 2 implemented Grouped-Query Attention (GQA) in larger models. Version 3 extended GQA to smaller models, introduced a more efficient…
[ad_2]