llmreference
Together AI

Using CodeLlama 70B on Together AI

Implementation guide · Code Llama · AI at Meta

ServerlessOpen Source

Quick Start

  1. 1
    Create an account at Together AI and generate an API key.
  2. 2
    Use the Together AI SDK or REST API to call codellama-70b — see the documentation for request format.
  3. 3
    You'll be billed $0.90/1M input, $0.90/1M output tokens. See full pricing.

Code Examples

Install
pip install together
API key
TOGETHER_API_KEY
Model ID
codellama-70b

Together uses "organization/model-name" format, e.g. "meta-llama/Llama-4-Scout-17B-16E-Instruct" or "Qwen/QwQ-32B". See the Together model catalog for the exact ID.

from together import Together

client = Together()  # reads TOGETHER_API_KEY from env
response = client.chat.completions.create(
    model="codellama-70b",
    messages=[{"role": "user", "content": "Hello"}]
)
print(response.choices[0].message.content)

About Together AI

Platform for running open-source and proprietary LLMs

Together AI is a platform for running open-source and proprietary LLMs with fast serverless and dedicated endpoints at competitive inference pricing.

Pricing on Together AI

TypePrice (per 1M)
Input tokens$0.90
Output tokens$0.90

Capabilities

Structured Outputs

About CodeLlama 70B

CodeLlama 70B is a state-of-the-art generative text model by Meta, specifically designed for code synthesis and understanding. It utilizes an auto-regressive transformer architecture and has been fine-tuned with up to 16,000 tokens, supporting inference with up to 100,000 tokens. The model excels in code completion, infilling, and instruction following, making it versatile for various programming languages and applications. With 70 billion parameters, it offers advanced capabilities for general code generation tasks, while also providing specialized variants for Python and instruction-following. Intended for both commercial and research use, CodeLlama 70B aims to assist developers in generating code, understanding programming concepts, and enhancing productivity in software development .

Model Specs

Released2024-01-29
Parameters70B
Context16K
ArchitectureDecoder Only

Provider

Together AI
Together AI

San Francisco, California, United States