What is Llama 3 used for?

Llama 3 is used for agent workflows and tool use, structured outputs, and coding. The family description and listed model capabilities point to those workloads as the best fit.

How does Llama 3 compare to MOSS-Audio?

Llama 3 by AI at Meta is strongest where you need agent workflows and tool use, while MOSS-Audio by MOSI AI is the closest related family to check for multimodal. Llama 3 has 11 listed variants and reaches up to 8k context, so compare the specs and pricing tables before choosing a production model.

Which Llama 3 model should I use?

For the lowest listed input price, start with Llama 3 8B Instruct through DeepInfra at $0.02/1M input tokens. For the most capable/latest local choice, evaluate Together AI - Llama 3 8B Lite with 8k context and tool use, function calling, and structured outputs.

Llama 3 Models by AI at Meta

AI at MetaLlama 3 CommunityOpen weightsHighlightOpen Source

11 models2024–2025Up to 8k ctxFrom $0.02/1M input

Details

ResearcherAI at Meta

LicenseLlama 3 Community

Commercial useCommercial use: conditional

Models11

Released2024–2025

Max context8k

Capabilities

Function Calling1 of 11 models

Tool Use1 of 11 models

Structured Outputs7 of 11 models

Links

Website HuggingFace

About

Llama 3, developed by Meta AI and released in April 2024, represents a significant advancement in large language models (LLMs). Available in two configurations—8 billion and 70 billion parameters—the models offer both pretrained and instruction-tuned versions, enhancing their adaptability and effectiveness in dialogue scenarios. Llama 3 sets itself apart by being trained on over 15 trillion tokens of publicly available data, a massive expansion over its predecessor, Llama 2, and includes a substantial increase in code data. The models not only excel in performance but also incorporate robust safety features like Llama Guard 2 and Code Shield, underscoring Meta's focus on responsible AI use. Llama 3 models are accessible on platforms such as AWS, Google Cloud, and Hugging Face, with plans for future updates that will expand their capabilities to include multimodal functionalities and multilingual support.

Current Variants

Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.

11 in view

Together AI - Llama 3 8B LiteCurrent

Use when the workload needs 8k context, 8B parameters, and tool use.

2025-078k context8B parameterstool use

Llama 3 Taiwan 70B InstructCurrent

Use when the workload needs 8k context and 70B parameters.

2024-078k context70B parameters

Llama 3 70B InstructCurrent

Use when the workload needs 8k context, 70B parameters, and structured outputs.

2024-048k context70B parametersstructured outputs

Llama 3 8B InstructCurrent

Use when the workload needs 8k context, 8B parameters, and structured outputs.

2024-048k context8B parametersstructured outputs

Llama 3 70BCurrent

Use when the workload needs 8k context and 70B parameters.

2024-048k context70B parameters

Llama 3 8BCurrent

Use when the workload needs 8k context and 8B parameters.

2024-048k context8B parameters

Together AI Llama-3-8B-InstructCurrent

Use when the workload needs 8k context, 8B parameters, and structured outputs.

2024-048k context8B parametersstructured outputs

Together AI Llama-3-70B-InstructCurrent

Use when the workload needs 8k context, 70B parameters, and structured outputs.

2024-048k context70B parametersstructured outputs

DeepInfra Llama 3 8B InstructCurrent

Use when the workload needs 8k context, 8B parameters, and structured outputs.

2024-048k context8B parametersstructured outputs

DeepInfra Llama 3 70B InstructCurrent

Use when the workload needs 8k context, 70B parameters, and structured outputs.

2024-048k context70B parametersstructured outputs

Fireworks Llama-3-8B-InstructCurrent

Use when the workload needs 8k context and 8B parameters.

2024-048k context8B parameters

Current Llama 3 variants with use-when guidance and lifecycle status
Model	Use when	Released	Signals	Status
Together AI - Llama 3 8B Lite	Use when the workload needs 8k context, 8B parameters, and tool use.	2025-07	8k context8B parameterstool use	Current
Llama 3 Taiwan 70B Instruct	Use when the workload needs 8k context and 70B parameters.	2024-07	8k context70B parameters	Current
Llama 3 70B Instruct	Use when the workload needs 8k context, 70B parameters, and structured outputs.	2024-04	8k context70B parametersstructured outputs	Current
Llama 3 8B Instruct	Use when the workload needs 8k context, 8B parameters, and structured outputs.	2024-04	8k context8B parametersstructured outputs	Current
Llama 3 70B	Use when the workload needs 8k context and 70B parameters.	2024-04	8k context70B parameters	Current
Llama 3 8B	Use when the workload needs 8k context and 8B parameters.	2024-04	8k context8B parameters	Current
Together AI Llama-3-8B-Instruct	Use when the workload needs 8k context, 8B parameters, and structured outputs.	2024-04	8k context8B parametersstructured outputs	Current
Together AI Llama-3-70B-Instruct	Use when the workload needs 8k context, 70B parameters, and structured outputs.	2024-04	8k context70B parametersstructured outputs	Current
DeepInfra Llama 3 8B Instruct	Use when the workload needs 8k context, 8B parameters, and structured outputs.	2024-04	8k context8B parametersstructured outputs	Current
DeepInfra Llama 3 70B Instruct	Use when the workload needs 8k context, 70B parameters, and structured outputs.	2024-04	8k context70B parametersstructured outputs	Current
Fireworks Llama-3-8B-Instruct	Use when the workload needs 8k context and 8B parameters.	2024-04	8k context8B parameters	Current

Release Timeline

3 release groups

2025-07

1 current

Together AI - Llama 3 8B Lite

8k context8B parameterstool use

Current

2024-07

1 current

Llama 3 Taiwan 70B Instruct

8k context70B parameters

Current

2024-04

9 current

DeepInfra Llama 3 70B Instruct

8k context70B parametersstructured outputs

Current

DeepInfra Llama 3 8B Instruct

8k context8B parametersstructured outputs

Current

Fireworks Llama-3-8B-Instruct

8k context8B parameters

Current

Llama 3 70B

8k context70B parameters

Current

Llama 3 70B Instruct

8k context70B parametersstructured outputs

Current

Llama 3 8B

8k context8B parameters

Current

Llama 3 8B Instruct

8k context8B parametersstructured outputs

Current

Together AI Llama-3-70B-Instruct

8k context70B parametersstructured outputs

Current

Together AI Llama-3-8B-Instruct

8k context8B parametersstructured outputs

Current

Specifications(11 models)

Llama 3 model specifications comparison
Model	Released	Context	Parameters	Fn Calling	Tool Use	Structured Outputs
Together AI - Llama 3 8B Lite	2025-07	8k	8B	Yes	Yes	Yes
Llama 3 Taiwan 70B Instruct	2024-07	8k	70B	No	No	No
Llama 3 70B Instruct	2024-04	8k	70B	No	No	Yes
Llama 3 8B Instruct	2024-04	8k	8B	No	No	Yes
Llama 3 70B	2024-04	8k	70B	No	No	No
Llama 3 8B	2024-04	8k	8B	No	No	No
Together AI Llama-3-8B-Instruct	2024-04	8k	8B	No	No	Yes
Together AI Llama-3-70B-Instruct	2024-04	8k	70B	No	No	Yes
DeepInfra Llama 3 8B Instruct	2024-04	8k	8B	No	No	Yes
DeepInfra Llama 3 70B Instruct	2024-04	8k	70B	No	No	Yes
Fireworks Llama-3-8B-Instruct	2024-04	8k	8B	No	No	No

Available From(20 providers)

Alibaba Cloud PAI-EAS

AWS Bedrock

Baseten API

Cloudflare Workers AI

Databricks Foundation Model Serving

DeepInfra

Fireworks AI

GCP Vertex AI +12 more

Pricing

Llama 3 model pricing by provider
Model	Provider	Input / 1M	Output / 1M	Type
Llama 3 8B Instruct	DeepInfra	$0.02	$0.05	Serverless
Llama 3 8B Instruct	OpenRouter	$0.03	$0.04	Serverless
Llama 3 8B Instruct	Novita AI	$0.04	$0.04	Serverless
DeepInfra Llama 3 8B Instruct	DeepInfra	$0.05	$0.15	Serverless
Llama 3 8B Instruct	Replicate API	$0.05	$0.25	Serverless
Llama 3 8B	Replicate API	$0.05	$0.25	Serverless
Llama 3 8B Instruct	Lepton AI API	$0.07	$0.07	Serverless
Together AI - Llama 3 8B Lite	Together AI	$0.1	$0.1	Serverless
Llama 3 8B Instruct	GCP Vertex AI	$0.12	$0.36	Serverless
Fireworks Llama-3-8B-Instruct	Fireworks AI	$0.15	$0.15	Serverless
Llama 3 8B Instruct	Together AI	$0.18	$0.18	Serverless
Llama 3 8B Instruct	Fireworks AI	$0.2	$0.2	Serverless
Together AI Llama-3-8B-Instruct	Together AI	$0.2	$0.2	Serverless
Llama 3 8B	Fireworks AI	$0.2	$0.2	Serverless
Llama 3 8B Instruct	AWS Bedrock	$0.3	$0.6	Serverless
Llama 3 8B Instruct	Microsoft Foundry	$0.37	$1.1	Serverless
Llama 3 70B Instruct	Hyperbolic AI Inference	$0.4	$0.4	Serverless
Llama 3 70B Instruct	DeepInfra	$0.45	$0.65	Serverless
DeepInfra Llama 3 70B Instruct	DeepInfra	$0.45	$0.65	Serverless
Llama 3 70B Instruct	OpenRouter	$0.51	$0.74	Serverless
Llama 3 70B Instruct	Novita AI	$0.51	$0.74	Serverless
Llama 3 8B Instruct	IBM watsonx	$0.6	$0.6	Serverless
Together AI Llama-3-70B-Instruct	Together AI	$0.6	$0.75	Serverless
Llama 3 70B Instruct	Replicate API	$0.65	$2.75	Serverless
Llama 3 70B	Replicate API	$0.65	$2.75	Serverless
Llama 3 70B Instruct	Lepton AI API	$0.8	$0.8	Serverless
Llama 3 70B Instruct	Together AI	$0.88	$0.88	Serverless
Llama 3 70B Instruct	Fireworks AI	$0.9	$0.9	Serverless
Llama 3 70B Instruct	AWS Bedrock	$0.99	$0.99	Serverless
Llama 3 70B Instruct	Databricks Foundation Model Serving	$1	$3	Serverless
Llama 3 70B Instruct	GCP Vertex AI	$1.2	$3.6	Serverless
Llama 3 70B Instruct	IBM watsonx	$1.8	$1.8	Serverless
Llama 3 70B Instruct	Microsoft Foundry	$3.78	$11.34	Serverless

Popular comparisons in this family

Frequently Asked Questions

What is Llama 3 used for?: Llama 3 is used for agent workflows and tool use, structured outputs, and coding. The family description and listed model capabilities point to those workloads as the best fit.
How does Llama 3 compare to MOSS-Audio?: Llama 3 by AI at Meta is strongest where you need agent workflows and tool use, while MOSS-Audio by MOSI AI is the closest related family to check for multimodal. Llama 3 has 11 listed variants and reaches up to 8k context, so compare the specs and pricing tables before choosing a production model.
Which Llama 3 model should I use?: For the lowest listed input price, start with Llama 3 8B Instruct through DeepInfra at $0.02/1M input tokens. For the most capable/latest local choice, evaluate Together AI - Llama 3 8B Lite with 8k context and tool use, function calling, and structured outputs.

Models(11)

Together AI - Llama 3 8B Lite

2025-078k8B1 provider

Open Weights

Llama 3 Taiwan 70B Instruct

2024-078k70B1 provider

Open Weights

Llama 3 70B Instruct

2024-048k70B18 providers

Open Weights

Llama 3 8B Instruct

2024-048k8B17 providers

Open Weights

Llama 3 70B

2024-048k70B1 provider

Open Weights

Llama 3 8B

2024-048k8B2 providers

Open Weights

Together AI Llama-3-8B-Instruct

2024-048k8B1 provider

Open Weights

Together AI Llama-3-70B-Instruct

2024-048k70B1 provider

Open Weights

DeepInfra Llama 3 8B Instruct

2024-048k8B1 provider

Open Weights

DeepInfra Llama 3 70B Instruct

2024-048k70B1 provider

Open Weights

Fireworks Llama-3-8B-Instruct

2024-048k8B1 provider

Open Weights

Llama 3 Models by AI at Meta

Details

Capabilities

Links

About

Current Variants

Release Timeline

Specifications(11 models)

Available From(20 providers)

Pricing

Popular comparisons in this family

Frequently Asked Questions

Related Model Families

Models(11)