Using CodeLlama 70B Python on Together AI
Implementation guide · Code Llama · AI at Meta
Quick Start
- 1
- 2Use the Together AI SDK or REST API to call
codellama-70b-python— see the documentation for request format. - 3
Code Examples
pip install togetherTOGETHER_API_KEYcodellama-70b-pythonTogether uses "organization/model-name" format, e.g. "meta-llama/Llama-4-Scout-17B-16E-Instruct" or "Qwen/QwQ-32B". See the Together model catalog for the exact ID.
from together import Together
client = Together() # reads TOGETHER_API_KEY from env
response = client.chat.completions.create(
model="codellama-70b-python",
messages=[{"role": "user", "content": "Hello"}]
)
print(response.choices[0].message.content)About Together AI
Platform for running open-source and proprietary LLMs
Together AI is a platform for running open-source and proprietary LLMs with fast serverless and dedicated endpoints at competitive inference pricing.
Pricing on Together AI
| Type | Price (per 1M) |
|---|---|
| Input tokens | $0.90 |
| Output tokens | $0.90 |
Capabilities
About CodeLlama 70B Python
CodeLlama 70B Python is a specialized AI model by Meta, designed for Python code synthesis and understanding. With 70 billion parameters, it excels in code completion, infilling, and instruction following tasks. The model leverages an optimized transformer architecture and has been fine-tuned with up to 16,000 tokens, making it particularly effective for Python-centric development workflows. While it doesn't support long contexts of 100,000 tokens, it offers powerful capabilities for both commercial and research applications in Python programming environments. More details can be found in the research paper "Code Llama: Open Foundation Models for Code" .