LLaMA

Meta가 공개한 65b 파라미터 LLM

Cateogries

Llama 2 - Meta, 차세대 언어모델 Llama 2 공개
Llama 3 - Meta, 차세대 언어모델 Llama 3 공개
Llama 3.1
Llama 3.2 - Revolutionizing edge AI and vision with open, customizable models (메타가 '라마' 시리즈 중 이미지와 텍스트를 모두 이해하는 첫번째 대형멀티모달모델(LMM)을 출시했다)
Llama 3.3 - 70B로 이제 GPT-4급 모델을 노트북에서 실행가능
Llama 4 - 한국어에 가장 친화적인 오픈소스 모델입니다.
nano-llama31 - nanoGPT 스타일의 Llama 3.1 버전
ntransformer - 싱글 RTX 3090에서 Llama 3.1 70B를 실행하는 NVMe-to-GPU 추론 엔진

Linux:

curl -o- https://raw.githubusercontent.com/shawwn/llama-dl/56f50b96072f42fb2520b1ad5a1d6ef30351f23c/llama.sh | bash

참고로 내가 answerver 에서 다운받았을 때 걸렸던 시간은 약 157분 이며, 220G 용량을 차지했다.

Introducing LLaMA: A foundational, 65-billion-parameter language model
LLM에 Stable Diffusion Moment가 오고 있다 | GeekNews
- [원문] Large language models are having their Stable Diffusion moment
LLaMA 모델의 간략한 역사 | GeekNews
- [원문] A brief history of LLaMA models - AGI Sphere
- LLaMA - (7B, 13B, 33B, 65B), CommonCrawl/C4/GitHub/Wikipedia/Gutenberg & Book3/ArXiv/StackExchange
- Alpaca - 52k GPT-3 instructions
- Vicuna - 70k ChatGPT conversations
- Koala - 117k cleaned ChatGPT conversations
- GPT4-x-Alpaca - 20k GPT4 instructions
- WizardLM - 70k instructions synthesized with ChatGPT/GPT-3
- OpenAssistant - 600k human interactions (OpenAssistant Conversations)