Meta Llama 4 in Amazon Bedrock
Introducing Llama 4
The Meta Llama 4 models, which provide the most scalable iteration of Llama, usher in a new age for the Llama ecosystem. Llama 4 is designed to meet a variety of application needs to its inherent multimodality, mixture-of-experts architecture, extended context windows, notable performance enhancements, and optimized computational efficiency. The Llama 4 variants are versatile for a range of use cases since they are available in sizes that are simple to deploy.
Llama 4 Maverick 17B
Meta Llama 4 Maverick is a low-cost, natively multimodal model for text and visual comprehension with sophisticated intelligence and quick reactions.
Llama 4 Scout 17B
A naturally multimodal model, Llama 4 Scout combines powerful visual and textual intelligence with effective processing powers. The model’s substantial context handling allows for complex data processing, robust codebase reasoning, and thorough multi-document analysis.
What is Meta Llama 4
Meta’s new Llama 4 family of large language models is open-source and was made available in early April 2025. Using a mixture-of-experts (MoE) architecture, it consists of three models: Llama 4 Scout, Llama 4 Maverick, and Llama 4 Behemoth. Because these models are made for multimodal tasks, they can absorb and comprehend inputs that include text, images, and video.
Advantages
More efficient and customized
Llama 3.2 provides on-device processing for a more customized AI experience. Because of their increased efficiency, lower latency, and enhanced performance, the Llama 3.2 models are appropriate for a variety of applications.
Context window for 128K tokens
Llama can capture even more subtle correlations in data to its 128K context length.
Over 15 trillion tokens were used during pretraining
To better understand linguistic nuances, Llama models are trained on more than 15 trillion tokens from publicly available web data sources.
Multilingual support
English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai are among the eight languages that Llama 3.2 supports.
Absence of infrastructure management
Using Llama models has never been simpler to the Amazon Bedrock controlled API. Llama’s power is available to organizations of all sizes without requiring them to worry about the supporting infrastructure. You can safely integrate and use Llama’s generative AI capabilities into your applications using the AWS services you are already familiar with because Amazon Bedrock is serverless, which eliminates the need for infrastructure management. This implies that you may concentrate on developing your AI applications, which is what you do best.
Meet Llama
In the last ten years, Meta has concentrated on providing developers with tools and encouraging cooperation and progress amongst developers, researchers, and organisations. Because Llama models come in a variety of parameter sizes, developers can choose the one that best suits their requirements and inference budget. Because Amazon Bedrock’s Llama models eliminate the need for developers to deal about infrastructure management and scalability, they create a world of opportunities. Developers can start using Llama with Amazon Bedrock, which is a turnkey solution.
Use cases
Language nuances, contextual understanding, image comprehension and visual reasoning, and difficult tasks like visual data analysis, image captioning, conversation production, and translation are all areas in which Llama models thrive. They can also handle multistep tasks with ease. Other applications Sentiment analysis and nuance reasoning, language modelling, dialogue systems, code generation, visual grounding, document visual question answering, image-text retrieval, sophisticated visual reasoning and understanding, text summarization and accuracy, text classification, and following instructions are all areas in which Llama models excel.
Model versions
| Model | Parameters | Modalities | Max Tokens | Languages | Fine-tuning Supported | Main Use Cases |
|---|---|---|---|---|---|---|
| Llama 4 Maverick 17B | 400B total, 128 experts | Text + Image | 1M | English, French, German, Hindi, Italian, Portuguese, Spanish, Thai, Arabic, Indonesian, Tagalog, Vietnamese (Image: English) | No | Multilingual chat, assistant, image understanding, coding assistance, document understanding, research |
| Llama 4 Scout 17B | 109B total, 16 experts | Text + Image | 3.5M (10M soon) | Same as Maverick | No | Multilingual chat, coding, document intelligence, customer support, research, image analysis |
| Llama 3.3 70B | 70B | Text-only | 128K | English, German, French, Italian, Portuguese, Spanish, Thai | No | Content creation, enterprise apps, research, summarization, classification, code generation |
| Llama 3.2 90B | 90B | Text + Image | 128K | English, German, French, Italian, Portuguese, Hindi, Spanish, Thai | Yes | Visual reasoning, multimodal interaction, image QA, captioning, document processing |
| Llama 3.2 11B | 11B | Text + Image | 128K | Same as 90B | Yes | Visual reasoning, multimodal interaction, document understanding |
| Llama 3.2 3B | 3B | Text-only | 128K | Same as 90B | Yes | Low-latency text generation, summarization, sentiment analysis, mobile AI |
| Llama 3.2 1B | 1B | Text-only | 128K | Same as 90B | Yes | Fast, multilingual dialogue, knowledge retrieval, edge devices |
| Llama 3.1 405B | 405B | Text-only | 128K | Same as 90B | No | Enterprise AI, R&D, synthetic data, multilingual translation, reasoning, creativity |
| Llama 3.1 70B | 70B | Text-only | 128K | Same as 90B | Yes | Content creation, conversational AI, summarization, translation |
| Llama 3.1 8B | 8B | Text-only | 128K | Same as 90B | Yes | Lightweight summarization, classification, translation |
| Llama 3 70B | 70B | Text-only | 8K | English | No | Summarization, sentiment analysis, dialogue systems, code generation |
| Llama 3 8B | 8B | Text-only | 8K | English | No | Lightweight summarization, classification, sentiment analysis |
| Llama 2 70B | 70B | Text-only | 4K | English | Yes | Assistant-like chat, large-scale text generation |
| Llama 2 13B | 13B | Text-only | 4K | English | Yes | Assistant-like chat, text classification, translation |

