Together Llama 2 Models by Together.ai
About
The Together Llama 2 family of large language models extends the capabilities of Meta's Llama 2, maintaining its open-source nature and being available for research and commercial purposes 14. These models feature a noteworthy advancement in context length, allowing for 32K tokens compared to Llama 2's initial 4K, enabling more complex tasks like multi-document question answering and long-form text summarization 45. Among the family's innovations is the Llama-2-7B-32K-Instruct, tailored through high-quality instruction and conversational data for improved performance in dialogue-based interactions 15. Efficiency in training and inference is enhanced through optimizations such as FlashAttention-2, helping the models operate faster and more resource-efficiently 4. By providing training recipes and datasets, Together AI supports broader community engagement and progression in the open-source LLM domain 5.
Current Variants
Use-when guidance is derived from seed capabilities, context, release, and replacement fields.
Use when the workload needs 32k context, 7B parameters, and structured outputs.
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| Llama 2 7B 32K | Use when the workload needs 32k context, 7B parameters, and structured outputs. | 2023-07 | 32k context7B parametersstructured outputs | Current |
Release Timeline
1 release groupSpecifications(1 models)
| Model | Released | Context | Parameters | Structured Outputs |
|---|---|---|---|---|
| Llama 2 7B 32K | 2023-07 | 32k | 7B | Yes |
Available From(1 provider)
Pricing
| Model | Provider | Input / 1M | Output / 1M | Type |
|---|---|---|---|---|
| Llama 2 7B 32K | Together AI | $0.2 | $0.2 | Serverless |
Frequently Asked Questions
- What is Together Llama 2 used for?
- Together Llama 2 is used for structured outputs. The family description and listed model capabilities point to those workloads as the best fit.
- How does Together Llama 2 compare to Together General?
- Together Llama 2 by Together.ai is strongest where you need structured outputs, while Together General by Together.ai is the closest related family to check for adjacent model selection. Together Llama 2 has 1 listed variant and reaches up to 32k context, while Together General reaches up to 4k context, so compare the specs and pricing tables before choosing a production model.
- Which Together Llama 2 model should I use?
- For the lowest listed input price, start with Llama 2 7B 32K through Together AI at $0.2/1M input tokens. For the most capable/latest local choice, evaluate Llama 2 7B 32K with 32k context and structured outputs.




