Last refreshed 2026-05-19. Next refresh: weekly.
Why use Llama 3.2 1B on Fireworks AI?
Fireworks AI offers Llama 3.2 1B with pay-as-you-go pricing at $0.10/1M input tokens. Fireworks AI offers a generative AI platform as a service, focusing on rapid product iteration and cost-efficient AI deployment.
Setup recipe
Python + curlpip install openaiexport FIREWORKS_API_KEY=...import os
from openai import OpenAI
client = OpenAI(
api_key=os.environ["FIREWORKS_API_KEY"],llama-3.2-1bRequest example
import os
from openai import OpenAI
client = OpenAI(
api_key=os.environ["FIREWORKS_API_KEY"],
base_url="https://api.fireworks.ai/inference/v1"
)
response = client.chat.completions.create(
model="llama-3.2-1b",
messages=[{"role": "user", "content": "Hello"}]
)
print(response.choices[0].message.content)Gotchas
- Fireworks model IDs use "accounts/fireworks/models/{model-name}" format, e.g. "accounts/fireworks/models/llama4-scout-instruct-basic" or "accounts/fireworks/models/deepseek-r1".
- The examples expect FIREWORKS_API_KEY; rename it only if your application config maps the new variable.
Pricing
| Type | Price (per 1M) |
|---|---|
| Input tokens | $0.10 |
| Output tokens | $0.10 |
Capabilities
No model capability flags are currently sourced.
About Llama 3.2 1B
Llama 3.2 1B is Meta's Llama 3.2 model. It offers a 128K-token context window with weights openly available for self-hosting and scores 28.1 on HumanEval.
FAQ
What does Llama 3.2 1B cost on Fireworks AI?
On Fireworks AI, Llama 3.2 1B costs $0.1 per 1M input tokens and $0.1 per 1M output tokens.
What is the context window for Llama 3.2 1B on Fireworks AI?
Llama 3.2 1B supports a 128k token context window on Fireworks AI.
Who created Llama 3.2 1B?
Llama 3.2 1B was created by AI at Meta as part of the Llama 3.2 model family.
Is Llama 3.2 1B open source?
Llama 3.2 1B has open weights under Llama 3 Community according to the seed data, but that does not necessarily mean an OSI-approved open-source license.