What is Llama 2 used for?

Llama 2 is used for structured outputs and chatbot and role-playing use cases. The family description and listed model capabilities point to those workloads as the best fit.

How does Llama 2 compare to MOSS-Audio?

Llama 2 by AI at Meta is strongest where you need structured outputs, while MOSS-Audio by MOSI AI is the closest related family to check for multimodal. Llama 2 has 14 listed variants and reaches up to 4k context, so compare the specs and pricing tables before choosing a production model.

Which Llama 2 model should I use?

For the lowest listed input price, start with Llama 2 7B Chat through Replicate API at $0.05/1M input tokens. For the most capable/latest local choice, evaluate Together AI Llama-2-7B-chat with 4k context and structured outputs.

Llama 2 Models by AI at Meta

AI at MetaLlama 2 CommunityOpen weightsOpen SourceHighlight

14 models2023Up to 4k ctxFrom $0.05/1M input

Details

ResearcherAI at Meta

LicenseLlama 2 Community

Commercial useCommercial use: conditional

Models14

Released2023

Max context4k

Capabilities

Structured Outputs7 of 14 models

Links

Website HuggingFace

About

Llama 2, developed by Meta AI and released in July 2023, is a prominent family of large language models designed as an open-source alternative to proprietary chatbots. Its models, available in multiple sizes from 7 billion to 70 billion parameters, provide varying levels of accuracy balanced with computational efficiency. Llama 2 supports both research and commercial use, fostering greater accessibility and innovation in the AI community. Emphasizing safety and usefulness, the model employs techniques like reinforcement learning from human feedback (RLHF) and features specialized Llama 2-Chat models optimized for conversational applications. This makes it a versatile tool for various AI-driven tasks 1 2 3.

Current Variants

Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.

13 in view1 retired

Llama 2 13B ChatCurrent

Use when the workload needs 4k context, 13B parameters, and structured outputs.

2023-074k context13B parametersstructured outputs

Llama 2 7B ChatCurrent

Use when the workload needs 4k context, 7B parameters, and structured outputs.

2023-074k context7B parametersstructured outputs

Llama 2 70BCurrent

Use when the workload needs 4k context and 70B parameters.

2023-074k context70B parameters

Llama 2 13BCurrent

Use when the workload needs 4k context and 13B parameters.

2023-074k context13B parameters

Llama 2 7BCurrent

Use when the workload needs 4k context and 7B parameters.

2023-074k context7B parameters

Llama 2 34B (Unreleased)Current

Use when the workload needs 4k context and 34B parameters.

2023-074k context34B parameters

Together AI Llama-2-7B-chatCurrent

Use when the workload needs 4k context, 7B parameters, and structured outputs.

2023-074k context7B parametersstructured outputs

Together AI Llama-2-13B-chatCurrent

Use when the workload needs 4k context, 13B parameters, and structured outputs.

2023-074k context13B parametersstructured outputs

Together AI Llama-2-70B-chatCurrent

Use when the workload needs 4k context, 70B parameters, and structured outputs.

2023-074k context70B parametersstructured outputs

OctoML Llama-2-70b-chatCurrent

Use when the workload needs 4k context and 70B parameters.

2023-074k context70B parameters

Meta Llama 2 Chat 70BCurrent

Use when the workload needs 4k context, 70B parameters, and structured outputs.

2023-074k context70B parametersstructured outputs

Llama 2 70B Chat on IBM WatsonxCurrent

Use when the workload needs 4k context and 70B parameters.

2023-074k context70B parameters

Vultr Llama 2 70BCurrent

Use when the workload needs 4k context and 70B parameters.

2023-074k context70B parameters

Current Llama 2 variants with use-when guidance and lifecycle status
Model	Use when	Released	Signals	Status
Llama 2 13B Chat	Use when the workload needs 4k context, 13B parameters, and structured outputs.	2023-07	4k context13B parametersstructured outputs	Current
Llama 2 7B Chat	Use when the workload needs 4k context, 7B parameters, and structured outputs.	2023-07	4k context7B parametersstructured outputs	Current
Llama 2 70B	Use when the workload needs 4k context and 70B parameters.	2023-07	4k context70B parameters	Current
Llama 2 13B	Use when the workload needs 4k context and 13B parameters.	2023-07	4k context13B parameters	Current
Llama 2 7B	Use when the workload needs 4k context and 7B parameters.	2023-07	4k context7B parameters	Current
Llama 2 34B (Unreleased)	Use when the workload needs 4k context and 34B parameters.	2023-07	4k context34B parameters	Current
Together AI Llama-2-7B-chat	Use when the workload needs 4k context, 7B parameters, and structured outputs.	2023-07	4k context7B parametersstructured outputs	Current
Together AI Llama-2-13B-chat	Use when the workload needs 4k context, 13B parameters, and structured outputs.	2023-07	4k context13B parametersstructured outputs	Current
Together AI Llama-2-70B-chat	Use when the workload needs 4k context, 70B parameters, and structured outputs.	2023-07	4k context70B parametersstructured outputs	Current
OctoML Llama-2-70b-chat	Use when the workload needs 4k context and 70B parameters.	2023-07	4k context70B parameters	Current
Meta Llama 2 Chat 70B	Use when the workload needs 4k context, 70B parameters, and structured outputs.	2023-07	4k context70B parametersstructured outputs	Current
Llama 2 70B Chat on IBM Watsonx	Use when the workload needs 4k context and 70B parameters.	2023-07	4k context70B parameters	Current
Vultr Llama 2 70B	Use when the workload needs 4k context and 70B parameters.	2023-07	4k context70B parameters	Current

Release Timeline

1 release group

2023-07

13 current · 1 retired

Llama 2 13B

4k context13B parameters

Current

Llama 2 13B Chat

4k context13B parametersstructured outputs

Current

Llama 2 34B (Unreleased)

4k context34B parameters

Current

Llama 2 70B

4k context70B parameters

Current

Llama 2 70B Chat

4k context70B parametersstructured outputs

Archived

Llama 2 70B Chat on IBM Watsonx

4k context70B parameters

Current

Llama 2 7B

4k context7B parameters

Current

Llama 2 7B Chat

4k context7B parametersstructured outputs

Current

Meta Llama 2 Chat 70B

4k context70B parametersstructured outputs

Current

OctoML Llama-2-70b-chat

4k context70B parameters

Current

Together AI Llama-2-13B-chat

4k context13B parametersstructured outputs

Current

Together AI Llama-2-70B-chat

4k context70B parametersstructured outputs

Current

Together AI Llama-2-7B-chat

4k context7B parametersstructured outputs

Current

Vultr Llama 2 70B

4k context70B parameters

Current

Specifications(14 models)

Llama 2 model specifications comparison
Model	Released	Context	Parameters	Structured Outputs
Llama 2 13B Chat	2023-07	4k	13B	Yes
Llama 2 7B Chat	2023-07	4k	7B	Yes
Llama 2 70B	2023-07	4k	70B	No
Llama 2 13B	2023-07	4k	13B	No
Llama 2 7B	2023-07	4k	7B	No
Llama 2 34B (Unreleased)	2023-07	4k	34B	No
Together AI Llama-2-7B-chat	2023-07	4k	7B	Yes
Together AI Llama-2-13B-chat	2023-07	4k	13B	Yes
Together AI Llama-2-70B-chat	2023-07	4k	70B	Yes
OctoML Llama-2-70b-chat	2023-07	4k	70B	No
Meta Llama 2 Chat 70B	2023-07	4k	70B	Yes
Llama 2 70B Chat on IBM Watsonx	2023-07	4k	70B	No
Vultr Llama 2 70B	2023-07	4k	70B	No

Available From(17 providers)

Alibaba Cloud PAI-EAS

AWS Bedrock

Baseten API

Cloudflare Workers AI

Databricks Foundation Model Serving

DeepInfra

Fireworks AI

GCP Vertex AI +9 more

Pricing

Llama 2 model pricing by provider
Model	Provider	Input / 1M	Output / 1M	Type
Llama 2 7B Chat	Replicate API	$0.05	$0.25	Serverless
Llama 2 7B Chat	DeepInfra	$0.07	$0.07	Serverless
Llama 2 7B Chat	Lepton AI API	$0.07	$0.07	Serverless
Llama 2 7B Chat	GCP Vertex AI	$0.08	$0.24	Serverless
Together AI Llama-2-7B-chat	Together AI	$0.1	$0.1	Serverless
Llama 2 13B Chat	Replicate API	$0.1	$0.5	Serverless
Llama 2 13B Chat	DeepInfra	$0.13	$0.13	Serverless
Llama 2 13B Chat	Lepton AI API	$0.13	$0.13	Serverless
Together AI Llama-2-13B-chat	Together AI	$0.15	$0.15	Serverless
Llama 2 13B Chat	GCP Vertex AI	$0.16	$0.48	Serverless
Llama 2 70B Chat on IBM Watsonx	IBM watsonx	$0.185	$0.185	Serverless
Llama 2 7B Chat	Fireworks AI	$0.2	$0.2	Provisioned
Llama 2 7B Chat	Together AI	$0.2	$0.2	Serverless
Llama 2 13B	Fireworks AI	$0.2	$0.2	Serverless
Llama 2 13B Chat	Fireworks AI	$0.2	$0.2	Serverless
Llama 2 7B	Fireworks AI	$0.2	$0.2	Serverless
Llama 2 13B Chat	Together AI	$0.3	$0.3	Serverless
OctoML Llama-2-70b-chat	OctoML (Deprecated)	$0.4	$0.6	Serverless
Together AI Llama-2-70B-chat	Together AI	$0.5	$0.6	Serverless
Llama 2 7B Chat	Microsoft Foundry	$0.52	$0.67	Serverless
Llama 2 13B Chat	IBM watsonx	$0.6	$0.6	Serverless
Llama 2 13B Chat	AWS Bedrock	$0.75	$1	Serverless
Llama 2 13B Chat	Microsoft Foundry	$0.81	$0.94	Serverless
Llama 2 70B	Fireworks AI	$0.9	$0.9	Serverless
Llama 2 13B Chat	Databricks Foundation Model Serving	$0.95	$0.95	Serverless
Meta Llama 2 Chat 70B	AWS Bedrock	$2.1	$2.8	Serverless

Popular comparisons in this family

Frequently Asked Questions

What is Llama 2 used for?: Llama 2 is used for structured outputs and chatbot and role-playing use cases. The family description and listed model capabilities point to those workloads as the best fit.
How does Llama 2 compare to MOSS-Audio?: Llama 2 by AI at Meta is strongest where you need structured outputs, while MOSS-Audio by MOSI AI is the closest related family to check for multimodal. Llama 2 has 14 listed variants and reaches up to 4k context, so compare the specs and pricing tables before choosing a production model.
Which Llama 2 model should I use?: For the lowest listed input price, start with Llama 2 7B Chat through Replicate API at $0.05/1M input tokens. For the most capable/latest local choice, evaluate Together AI Llama-2-7B-chat with 4k context and structured outputs.

Models(14)

Llama 2 13B Chat

2023-074k13B11 providers

Open Weights

Llama 2 7B Chat

2023-074k7B10 providers

Open Weights

Llama 2 70B

2023-074k70B1 provider

Open Weights

Llama 2 13B

2023-074k13B1 provider

Open Weights

Llama 2 7B

2023-074k7B1 provider

Open Weights

Llama 2 34B (Unreleased)

2023-074k34B

Open Weights

Together AI Llama-2-7B-chat

2023-074k7B1 provider

Open Weights

Together AI Llama-2-13B-chat

2023-074k13B1 provider

Open Weights

Together AI Llama-2-70B-chat

2023-074k70B1 provider

Open Weights

OctoML Llama-2-70b-chat

2023-074k70B1 provider

Open Weights

Meta Llama 2 Chat 70B

2023-074k70B1 provider

Open Weights

Llama 2 70B Chat on IBM Watsonx

2023-074k70B1 provider

Open Weights

Vultr Llama 2 70B

2023-074k70B

Open Weights

Llama 2 Models by AI at Meta

Details

Capabilities

Links

About

Current Variants

Release Timeline

Specifications(14 models)

Available From(17 providers)

Pricing

Popular comparisons in this family

Frequently Asked Questions

Related Model Families

Models(14)