Code Llama Models by AI at Meta
Details
Capabilities
About
Code Llama is a family of large language models developed by Meta AI, designed for code generation and understanding tasks. Building upon the Llama 2 architecture, Code Llama offers variations to accommodate diverse needs, featuring model sizes with 7B, 13B, 34B, and 70B parameters that adjust complexity and capability. The family comprises the foundational model for general code tasks, Code Llama - Python for tasks specific to the Python language, and Code Llama - Instruct, which is fine-tuned for interpreting natural language instructions. Trained on extensive code and related data, these models excel in code completion, debugging, and generating code from natural language prompts. Larger models tend to provide enhanced performance, albeit at the cost of increased computational demands 12.
Current Variants
Use-when guidance is derived from seed capabilities, context, release, and replacement fields.
Use when the workload needs 16k context, 70B parameters, and structured outputs.
Use when the workload needs 16k context and 70B parameters.
Use when the workload needs 100k context, 34B parameters, and structured outputs.
Use when the workload needs 100k context, 13B parameters, and structured outputs.
Use when the workload needs 100k context, 7B parameters, and structured outputs.
Use when the workload needs 100k context, 34B parameters, and structured outputs.
Use when the workload needs 100k context, 13B parameters, and structured outputs.
Use when the workload needs 100k context, 7B parameters, and structured outputs.
Use when the workload needs 16k context and 34B parameters.
Use when the workload needs 16k context and 13B parameters.
Use when the workload needs 16k context and 7B parameters.
Use when the workload needs 100k context, 34B parameters, and structured outputs.
Use when the workload needs 100k context and 70B parameters.
Use when the workload needs 100k context, 70B parameters, and structured outputs.
Use when the workload needs 16k context and 34B parameters.
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| CodeLlama 70B Python | Use when the workload needs 16k context, 70B parameters, and structured outputs. | 2024-01 | 16k context70B parametersstructured outputs | Current |
| CodeLlama 70B Instruct | Use when the workload needs 16k context and 70B parameters. | 2024-01 | 16k context70B parameters | Current |
| CodeLlama 34B | Use when the workload needs 100k context, 34B parameters, and structured outputs. | 2023-08 | 100k context34B parametersstructured outputs | Current |
| CodeLlama 13B | Use when the workload needs 100k context, 13B parameters, and structured outputs. | 2023-08 | 100k context13B parametersstructured outputs | Current |
| CodeLlama 7B | Use when the workload needs 100k context, 7B parameters, and structured outputs. | 2023-08 | 100k context7B parametersstructured outputs | Current |
| CodeLlama 34B Python | Use when the workload needs 100k context, 34B parameters, and structured outputs. | 2023-08 | 100k context34B parametersstructured outputs | Current |
| CodeLlama 13B Python | Use when the workload needs 100k context, 13B parameters, and structured outputs. | 2023-08 | 100k context13B parametersstructured outputs | Current |
| CodeLlama 7B Python | Use when the workload needs 100k context, 7B parameters, and structured outputs. | 2023-08 | 100k context7B parametersstructured outputs | Current |
| CodeLlama 34B Instruct | Use when the workload needs 16k context and 34B parameters. | 2023-08 | 16k context34B parameters | Current |
| CodeLlama 13B Instruct | Use when the workload needs 16k context and 13B parameters. | 2023-08 | 16k context13B parameters | Current |
| CodeLlama 7B Instruct | Use when the workload needs 16k context and 7B parameters. | 2023-08 | 16k context7B parameters | Current |
| Together AI CodeLlama-34B-Instruct | Use when the workload needs 100k context, 34B parameters, and structured outputs. | 2023-07 | 100k context34B parametersstructured outputs | Current |
| OctoML CodeLlama-70b-Instruct | Use when the workload needs 100k context and 70B parameters. | 2023-07 | 100k context70B parameters | Current |
| DeepInfra CodeLlama 70B Instruct | Use when the workload needs 100k context, 70B parameters, and structured outputs. | 2023-07 | 100k context70B parametersstructured outputs | Current |
| Fireworks CodeLlama-34b-Instruct | Use when the workload needs 16k context and 34B parameters. | 2023-07 | 16k context34B parameters | Current |
Release Timeline
3 release groupsSpecifications(16 models)
| Model | Released | Context | Parameters | Structured Outputs |
|---|---|---|---|---|
| CodeLlama 70B Python | 2024-01 | 16k | 70B | Yes |
| CodeLlama 70B Instruct | 2024-01 | 16k | 70B | No |
| CodeLlama 34B | 2023-08 | 100k | 34B | Yes |
| CodeLlama 13B | 2023-08 | 100k | 13B | Yes |
| CodeLlama 7B | 2023-08 | 100k | 7B | Yes |
| CodeLlama 34B Python | 2023-08 | 100k | 34B | Yes |
| CodeLlama 13B Python | 2023-08 | 100k | 13B | Yes |
| CodeLlama 7B Python | 2023-08 | 100k | 7B | Yes |
| CodeLlama 34B Instruct | 2023-08 | 16k | 34B | No |
| CodeLlama 13B Instruct | 2023-08 | 16k | 13B | No |
| CodeLlama 7B Instruct | 2023-08 | 16k | 7B | No |
| Together AI CodeLlama-34B-Instruct | 2023-07 | 100k | 34B | Yes |
| OctoML CodeLlama-70b-Instruct | 2023-07 | 100k | 70B | No |
| DeepInfra CodeLlama 70B Instruct | 2023-07 | 100k | 70B | Yes |
| Fireworks CodeLlama-34b-Instruct | 2023-07 | 16k | 34B | No |
Available From(9 providers)
Pricing
Frequently Asked Questions
- What is Code Llama used for?
- Code Llama is used for coding and structured outputs. The family description and listed model capabilities point to those workloads as the best fit.
- How does Code Llama compare to Claude Fable?
- Code Llama by AI at Meta is strongest where you need coding, while Claude Fable by Anthropic is the closest related family to check for vision and multimodal work. Code Llama has 16 listed variants and reaches up to 100k context, while Claude Fable reaches up to 1m context, so compare the specs and pricing tables before choosing a production model.
- Which Code Llama model should I use?
- For the lowest listed input price, start with CodeLlama 7B through Replicate API at $0.05/1M input tokens. For the most capable/latest local choice, evaluate CodeLlama 34B with 100k context and structured outputs.
Models(16)
CodeLlama 70B Python
CodeLlama 70B Instruct
CodeLlama 34B
CodeLlama 13B
CodeLlama 7B
CodeLlama 34B Python
CodeLlama 13B Python
CodeLlama 7B Python
CodeLlama 34B Instruct
CodeLlama 13B Instruct
CodeLlama 7B Instruct
Together AI CodeLlama-34B-Instruct
OctoML CodeLlama-70b-Instruct
DeepInfra CodeLlama 70B Instruct
Fireworks CodeLlama-34b-Instruct




