Meta veröffentlicht mit der Llama-Reihe Open-Weight-Modelle, die in der Open-Source-Community zur Standardgrundlage geworden sind.
10aktive Modelle
18,4Top-Quality (Meta: Llama 4 Maverick)
28. April 2026Letzte Synchronisation
#1
Meta: Llama 4 Maverick
meta-llama/llama-4-maverick
18,4
Quality
Speed110,7 t/s
Latency707 ms
Input$0.35
Output$0.85
Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward...
#2
Meta: Llama 3.3 70B Instruct
meta-llama/llama-3.3-70b-instruct
14,5
Quality
Speed89,9 t/s
Latency562 ms
Input$0.585
Output$0.71
The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned generative model in 70B (text in/text out). The Llama 3.3 instruction tuned text only model...
#3
Meta: Llama 4 Scout
meta-llama/llama-4-scout
13,5
Quality
Speed122,2 t/s
Latency501 ms
Input$0.17
Output$0.66
Llama 4 Scout 17B Instruct (16E) is a mixture-of-experts (MoE) language model developed by Meta, activating 17 billion parameters out of a total of 109B. It supports native multimodal input...
#4
Meta: Llama 3.1 70B Instruct
meta-llama/llama-3.1-70b-instruct
12,5
Quality
Speed31,2 t/s
Latency658 ms
Input$0.56
Output$0.56
Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 70B instruct-tuned version is optimized for high quality dialogue usecases. It has demonstrated strong...
#5
Meta: Llama 3.1 8B Instruct
meta-llama/llama-3.1-8b-instruct
11,8
Quality
Speed166,5 t/s
Latency460 ms
Input$0.10
Output$0.10
Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 8B instruct-tuned version is fast and efficient. It has demonstrated strong performance compared to...
#6
Meta: Llama 3.2 3B Instruct
meta-llama/llama-3.2-3b-instruct
9,7
Quality
Speed51,8 t/s
Latency612 ms
Input$0.15
Output$0.15
Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for advanced natural language processing tasks like dialogue generation, reasoning, and summarization. Designed with the latest transformer architecture, it...
#7
Meta: Llama 3 70B Instruct
meta-llama/llama-3-70b-instruct
8,9
Quality
Speed45,4 t/s
Latency687 ms
Input$0.65
Output$2.75
Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This 70B instruct-tuned version was optimized for high quality dialogue usecases. It has demonstrated strong...
#8
Meta: Llama 3.2 11B Vision Instruct
meta-llama/llama-3.2-11b-vision-instruct
8,7
Quality
Speed77,4 t/s
Latency437 ms
Input$0.245
Output$0.245
Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...
#9
Meta: Llama 3 8B Instruct
meta-llama/llama-3-8b-instruct
6,4
Quality
Speed82,0 t/s
Latency520 ms
Input$0.045
Output$0.145
Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This 8B instruct-tuned version was optimized for high quality dialogue usecases. It has demonstrated strong...
#10
Meta: Llama 3.2 1B Instruct
meta-llama/llama-3.2-1b-instruct
6,3
Quality
Speed191,2 t/s
Latency628 ms
Input$0.10
Output$0.10
Llama 3.2 1B is a 1-billion-parameter language model focused on efficiently performing natural language tasks, such as summarization, dialogue, and multilingual text analysis. Its smaller size allows it to operate...