PromptLoop
News Analyse Werkstatt Generative Medien Originals Glossar KI-Modelle

Meta — KI-Modelle im Überblick

Meta veröffentlicht mit der Llama-Reihe Open-Weight-Modelle, die in der Open-Source-Community zur Standardgrundlage geworden sind.

10aktive Modelle
18,4Top-Quality (Meta: Llama 4 Maverick)
28. April 2026Letzte Synchronisation
#1

Meta: Llama 4 Maverick

meta-llama/llama-4-maverick
18,4
Quality
Speed110,7 t/s
Latency707 ms
Input$0.35
Output$0.85

Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward...

#2

Meta: Llama 3.3 70B Instruct

meta-llama/llama-3.3-70b-instruct
14,5
Quality
Speed89,9 t/s
Latency562 ms
Input$0.585
Output$0.71

The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned generative model in 70B (text in/text out). The Llama 3.3 instruction tuned text only model...

#3

Meta: Llama 4 Scout

meta-llama/llama-4-scout
13,5
Quality
Speed122,2 t/s
Latency501 ms
Input$0.17
Output$0.66

Llama 4 Scout 17B Instruct (16E) is a mixture-of-experts (MoE) language model developed by Meta, activating 17 billion parameters out of a total of 109B. It supports native multimodal input...

#4

Meta: Llama 3.1 70B Instruct

meta-llama/llama-3.1-70b-instruct
12,5
Quality
Speed31,2 t/s
Latency658 ms
Input$0.56
Output$0.56

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 70B instruct-tuned version is optimized for high quality dialogue usecases. It has demonstrated strong...

#5

Meta: Llama 3.1 8B Instruct

meta-llama/llama-3.1-8b-instruct
11,8
Quality
Speed166,5 t/s
Latency460 ms
Input$0.10
Output$0.10

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 8B instruct-tuned version is fast and efficient. It has demonstrated strong performance compared to...

#6

Meta: Llama 3.2 3B Instruct

meta-llama/llama-3.2-3b-instruct
9,7
Quality
Speed51,8 t/s
Latency612 ms
Input$0.15
Output$0.15

Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for advanced natural language processing tasks like dialogue generation, reasoning, and summarization. Designed with the latest transformer architecture, it...

#7

Meta: Llama 3 70B Instruct

meta-llama/llama-3-70b-instruct
8,9
Quality
Speed45,4 t/s
Latency687 ms
Input$0.65
Output$2.75

Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This 70B instruct-tuned version was optimized for high quality dialogue usecases. It has demonstrated strong...

#8

Meta: Llama 3.2 11B Vision Instruct

meta-llama/llama-3.2-11b-vision-instruct
8,7
Quality
Speed77,4 t/s
Latency437 ms
Input$0.245
Output$0.245

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...

#9

Meta: Llama 3 8B Instruct

meta-llama/llama-3-8b-instruct
6,4
Quality
Speed82,0 t/s
Latency520 ms
Input$0.045
Output$0.145

Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This 8B instruct-tuned version was optimized for high quality dialogue usecases. It has demonstrated strong...

#10

Meta: Llama 3.2 1B Instruct

meta-llama/llama-3.2-1b-instruct
6,3
Quality
Speed191,2 t/s
Latency628 ms
Input$0.10
Output$0.10

Llama 3.2 1B is a 1-billion-parameter language model focused on efficiently performing natural language tasks, such as summarization, dialogue, and multilingual text analysis. Its smaller size allows it to operate...

📬 KI-News direkt ins Postfach