LLM Reference

Llama 2 Models by AI at Meta

AI at MetaLlama 2 CommunityOpen SourceHighlight
14 models2023Up to 4k ctxFrom $0.05/1M input

About

Llama 2, developed by Meta AI and released in July 2023, is a prominent family of large language models designed as an open-source alternative to proprietary chatbots. Its models, available in multiple sizes from 7 billion to 70 billion parameters, provide varying levels of accuracy balanced with computational efficiency. Llama 2 supports both research and commercial use, fostering greater accessibility and innovation in the AI community. Emphasizing safety and usefulness, the model employs techniques like reinforcement learning from human feedback (RLHF) and features specialized Llama 2-Chat models optimized for conversational applications. This makes it a versatile tool for various AI-driven tasks 1 2 3.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

13 in view1 retired

Use when the workload needs 4k context, 13B parameters, and structured outputs.

2023-074k context13B parametersstructured outputs

Use when the workload needs 4k context, 7B parameters, and structured outputs.

2023-074k context7B parametersstructured outputs

Use when the workload needs 4k context and 70B parameters.

2023-074k context70B parameters

Use when the workload needs 4k context and 13B parameters.

2023-074k context13B parameters
Llama 2 7BCurrent

Use when the workload needs 4k context and 7B parameters.

2023-074k context7B parameters

Use when the workload needs 4k context and 34B parameters.

2023-074k context34B parameters

Use when the workload needs 4k context, 7B parameters, and structured outputs.

2023-074k context7B parametersstructured outputs

Use when the workload needs 4k context, 13B parameters, and structured outputs.

2023-074k context13B parametersstructured outputs

Use when the workload needs 4k context, 70B parameters, and structured outputs.

2023-074k context70B parametersstructured outputs

Use when the workload needs 4k context and 70B parameters.

2023-074k context70B parameters

Use when the workload needs 4k context, 70B parameters, and structured outputs.

2023-074k context70B parametersstructured outputs

Use when the workload needs 4k context and 70B parameters.

2023-074k context70B parameters

Use when the workload needs 4k context and 70B parameters.

2023-074k context70B parameters

Release Timeline

1 release group
2023-07
13 current · 1 retired
Llama 2 13B
4k context13B parameters
Current
Llama 2 13B Chat
4k context13B parametersstructured outputs
Current
Llama 2 34B (Unreleased)
4k context34B parameters
Current
Llama 2 70B
4k context70B parameters
Current
Llama 2 70B Chat
4k context70B parametersstructured outputs
Archived
Llama 2 70B Chat on IBM Watsonx
4k context70B parameters
Current
Llama 2 7B
4k context7B parameters
Current
Llama 2 7B Chat
4k context7B parametersstructured outputs
Current
Meta Llama 2 Chat 70B
4k context70B parametersstructured outputs
Current
OctoML Llama-2-70b-chat
4k context70B parameters
Current
Together AI Llama-2-13B-chat
4k context13B parametersstructured outputs
Current
Together AI Llama-2-70B-chat
4k context70B parametersstructured outputs
Current
Together AI Llama-2-7B-chat
4k context7B parametersstructured outputs
Current
Vultr Llama 2 70B
4k context70B parameters
Current

Specifications(14 models)

Llama 2 model specifications comparison
ModelReleasedContextParametersStructured Outputs
Llama 2 13B Chat2023-074k13BYes
Llama 2 7B Chat2023-074k7BYes
Llama 2 70B2023-074k70BNo
Llama 2 13B2023-074k13BNo
Llama 2 7B2023-074k7BNo
Llama 2 34B (Unreleased)2023-074k34BNo
Together AI Llama-2-7B-chat2023-074k7BYes
Together AI Llama-2-13B-chat2023-074k13BYes
Together AI Llama-2-70B-chat2023-074k70BYes
OctoML Llama-2-70b-chat2023-074k70BNo
Meta Llama 2 Chat 70B2023-074k70BYes
Llama 2 70B Chat on IBM Watsonx2023-074k70BNo
Vultr Llama 2 70B2023-074k70BNo

Available From(17 providers)

Pricing

Llama 2 model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Llama 2 7B ChatReplicate API$0.05$0.25Serverless
Llama 2 7B ChatDeepInfra$0.07$0.07Serverless
Llama 2 7B ChatLepton AI API$0.07$0.07Serverless
Llama 2 7B ChatGCP Vertex AI$0.08$0.24Serverless
Together AI Llama-2-7B-chatTogether AI$0.1$0.1Serverless
Llama 2 13B ChatReplicate API$0.1$0.5Serverless
Llama 2 13B ChatDeepInfra$0.13$0.13Serverless
Llama 2 13B ChatLepton AI API$0.13$0.13Serverless
Together AI Llama-2-13B-chatTogether AI$0.15$0.15Serverless
Llama 2 13B ChatGCP Vertex AI$0.16$0.48Serverless
Llama 2 70B Chat on IBM WatsonxIBM watsonx$0.185$0.185Serverless
Llama 2 7B ChatFireworks AI$0.2$0.2Provisioned
Llama 2 7B ChatTogether AI$0.2$0.2Serverless
Llama 2 13BFireworks AI$0.2$0.2Serverless
Llama 2 13B ChatFireworks AI$0.2$0.2Serverless
Llama 2 7BFireworks AI$0.2$0.2Serverless
Llama 2 13B ChatTogether AI$0.3$0.3Serverless
OctoML Llama-2-70b-chatOctoML (Deprecated)$0.4$0.6Serverless
Together AI Llama-2-70B-chatTogether AI$0.5$0.6Serverless
Llama 2 7B ChatMicrosoft Foundry$0.52$0.67Serverless
Llama 2 13B ChatIBM watsonx$0.6$0.6Serverless
Llama 2 13B ChatAWS Bedrock$0.75$1Serverless
Llama 2 13B ChatMicrosoft Foundry$0.81$0.94Serverless
Llama 2 70BFireworks AI$0.9$0.9Serverless
Llama 2 13B ChatDatabricks Foundation Model Serving$0.95$0.95Serverless
Meta Llama 2 Chat 70BAWS Bedrock$2.1$2.8Serverless

Frequently Asked Questions

What is Llama 2 used for?
Llama 2 is used for structured outputs and chatbot and role-playing use cases. The family description and listed model capabilities point to those workloads as the best fit.
How does Llama 2 compare to MOSS-Audio?
Llama 2 by AI at Meta is strongest where you need structured outputs, while MOSS-Audio by MOSI Intelligence is the closest related family to check for multimodal. Llama 2 has 14 listed variants and reaches up to 4k context, so compare the specs and pricing tables before choosing a production model.
Which Llama 2 model should I use?
For the lowest listed input price, start with Llama 2 7B Chat through Replicate API at $0.05/1M input tokens. For the most capable/latest local choice, evaluate Together AI Llama-2-7B-chat with 4k context and structured outputs.

Models(14)