LLM Reference

CodeLlama 70B

Released
2024-01-29
Last refreshed
2026-05-16
Status
Researched 55d ago
DeprecatedOpen WeightsCommercial use with conditionsClassificationJSON / Tool use

CodeLlama 70B is a legacy integration reference; keep it only while you identify a current replacement.

Use it for

  • Teams maintaining an existing integration
  • Workloads that can use a 16k context window
  • Buyers comparing 4 tracked provider routes

Do not use it for

  • New production launches
  • Vision or document-understanding workloads
Specifications
Released
2024-01-29
Context
16k
Parameters
70B
Architecture
Decoder Only
Specialization
general
Openness
Open weights
License
Llama 2 CommunityCommercial use with conditions
Training
finetuned
Created by

Large-scale open-source AI for social technologies.

Menlo Park, California, United States
Founded 2013
Website
Pricing
Output / 1M
$0.650
Input / 1M
$0.450

Cheapest of 6 routes · DeepInfra

About

CodeLlama 70B is a state-of-the-art generative text model by Meta, specifically designed for code synthesis and understanding. It utilizes an auto-regressive transformer architecture and has been fine-tuned with up to 16,000 tokens, supporting inference with up to 100,000 tokens. The model excels in code completion, infilling, and instruction following, making it versatile for various programming languages and applications. With 70 billion parameters, it offers advanced capabilities for general code generation tasks, while also providing specialized variants for Python and instruction-following. Intended for both commercial and research use, CodeLlama 70B aims to assist developers in generating code, understanding programming concepts, and enhancing productivity in software development .

CodeLlama 70B is an open-weight model in the Code Llama family. The structured metadata tracks a 16k-token context window and structured outputs. This page tracks provider routes through Together AI, NVIDIA NIM, DeepInfra, and 3 more, with the cheapest tracked route listed at $0.45 input and $0.65 output per 1M tokens. No headline benchmark score is tracked for CodeLlama 70B yet.

Top use-case fit

Classification

Included by capability and metadata signals in the decision map.

JSON / Tool use

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 6

Compare API pricing across 4 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MRoute
DeepInfra$0.450$0.650
Serverless
Fireworks AI$0.900$0.900
Provisioned
Together AI$0.900$0.900
Serverless
Replicate API$0.650$2.75
Serverless

Available via routers & gateways(7)

Capabilities

Structured Outputs

Benchmark peer barsfor Classification

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.