Meta: Llama 4 Scout

meta-llama/llama-4-scout

Created Apr 5, 2025131,072 context
$0.08/M input tokens$0.45/M output tokens$0.5013/K input imgs

Llama 4 Scout 17B Instruct (16E) is a mixture-of-experts (MoE) language model developed by Meta, activating 17 billion parameters out of a total of 109B. It supports native multimodal input (text and image) and multilingual output (text and code) across 12 supported languages. Designed for assistant-style interaction and visual reasoning, Scout uses 16 experts per forward pass and features a context length of 10 million tokens, with a training corpus of ~40 trillion tokens.

Built for high efficiency and local or commercial deployment, Llama 4 Scout incorporates early fusion for seamless modality integration. It is instruction-tuned for use in multilingual chat, captioning, and image understanding tasks. Released under the Llama 4 Community License, it was last trained on data up to August 2024 and launched publicly on April 5, 2025.

Providers for Llama 4 Scout

OpenRouter routes requests to the best providers that are able to handle your prompt size and parameters, with fallbacks to maximize uptime.

Context
131K
Max Output
131K
Input
$0.08
Output
$0.45
US
bf16
Context
353K
Max Output
353K
Input
$0.09
Output
$0.48
US
bf16
Context
328K
Max Output
8K
Input
$0.1
Output
$0.3
US
fp8
Context
1.05M
Max Output
1.05M
Input
$0.1
Output
$0.3
Context
131K
Max Output
131K
Input
$0.1
Output
$0.5
Context
131K
Max Output
8K
Input
$0.11
Output
$0.34

Throughput

Latency

Apps using Llama 4 Scout

Top public apps this week using this model

1.
Cline
Autonomous coding agent right in your IDE
122Mtokens
2.
Roo Code
A whole dev team of AI agents in your editor
103Mtokens
3.
tyan.ai
new
84.6Mtokens
4.
shapes inc
General purpose social agents
77.6Mtokens
5.
FastLLM
High-performance parallel LLM API request tool
75.4Mtokens
6.
Secret Desires AI
Build your perfect partner
70.7Mtokens
7.
OpenRouter: Chatroom
Chat with multiple LLMs at once
50.2Mtokens
8.
liteLLM
Open-source library to simplify LLM calls
30.7Mtokens
9.
Aider
AI pair programming in your terminal
29.7Mtokens
10.
Open WebUI
Extensible, self-hosted AI interface
22.6Mtokens
11.
SillyTavern
LLM frontend for power users
20.5Mtokens
12.
Vectal
new
13.2Mtokens
13.
pusoy-optimizer.com
new
11.9Mtokens
14.
big-AGI
Precision AI for professionals
6.98Mtokens
15.
Mantella
Skyrim & Fallout 4 mod, naturally speak to NPCs
6.32Mtokens
16.
Chub AI
GenAI for everyone
3.13Mtokens
17.
AI Writing Companion
new
2.43Mtokens
18.
Kortex
new
2.03Mtokens
19.
AnythingLLM
new
2Mtokens
20.
ChatWise
Chatbot with artifacts and search
1.97Mtokens

Recent activity on Llama 4 Scout

Tokens processed per day

Apr 5Apr 6Apr 7Apr 8Apr 9Apr 10Apr 110250M500M750M1B

Uptime stats for Llama 4 Scout

Uptime stats for Llama 4 Scout across all providers

When an error occurs in an upstream provider, we can recover by routing to another healthy provider, if your request filters allow it.

Learn more about our load balancing and customization options.

Sample code and API for Llama 4 Scout

OpenRouter normalizes requests and responses across providers for you.

OpenRouter provides an OpenAI-compatible completion API to 300+ models & providers that you can call directly, or using the OpenAI SDK. Additionally, some third-party SDKs are available.

In the examples below, the OpenRouter-specific headers are optional. Setting them allows your app to appear on the OpenRouter leaderboards.

from openai import OpenAI

client = OpenAI(
  base_url="https://openrouter.ai/api/v1",
  api_key="<OPENROUTER_API_KEY>",
)

completion = client.chat.completions.create(
  extra_headers={
    "HTTP-Referer": "<YOUR_SITE_URL>", # Optional. Site URL for rankings on openrouter.ai.
    "X-Title": "<YOUR_SITE_NAME>", # Optional. Site title for rankings on openrouter.ai.
  },
  extra_body={},
  model="meta-llama/llama-4-scout",
  messages=[
    {
      "role": "user",
      "content": [
        {
          "type": "text",
          "text": "What is in this image?"
        },
        {
          "type": "image_url",
          "image_url": {
            "url": "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg"
          }
        }
      ]
    }
  ]
)
print(completion.choices[0].message.content)

Using third-party SDKs

For information about using third-party SDKs and frameworks with OpenRouter, please see our frameworks documentation.

See the Request docs for all possible fields, and Parameters for explanations of specific sampling parameters.

More models from Meta Llama

    Llama 4 Scout - API, Providers, Stats | OpenRouter