LLM Reference

Llama 2 7B 32K

Released
2023-07-18
Last refreshed
2026-05-11
Status
Researched 46d ago
Open SourceClassificationJSON / Tool use

Llama 2 7B 32K is worth evaluating for classification and json / tool use when its provider route and context window match the workload.

Use it for

  • Teams evaluating classification and json / tool use
  • Workloads that can use a 32k context window
  • Buyers comparing 1 tracked provider route

Do not use it for

  • Vision or document-understanding workloads
Specifications
Released
2023-07-18
Context
32k
Parameters
7B
Architecture
Decoder Only
Knowledge cutoff
2022-09
Specialization
general
Training
finetuned
Created by

Blazing-fast, cost-effective AI inference solutions

San Francisco, California, United States
Founded 2022
Website
Pricing
Output / 1M
$0.200
Input / 1M
$0.200

Cheapest of 1 route · Together AI

About

LLaMA-2-7B-32K is an open-source language model engineered by Together, derived from Meta's LLaMA-2 7B. It boasts a unique extended context length of up to 32,000 tokens, which enhances its ability to tackle tasks involving long-range context, such as multi-document question answering and lengthy text summarization. The model integrates optimizations, including FlashAttention-2, to boost inference and training efficiency. It combines pre-training with instruction tuning data for improved task performance and offers fine-tuning examples for specialized applications, like book summarization or multi-document Q&A. This model marks a substantial progress in the domain of large language models, serving as a potent tool for natural language processing tasks 1311.

Llama 2 7B 32K is an open-source model in the Together Llama 2 family. The structured metadata tracks a 32k-token context window and structured outputs. This page tracks provider routes through Together AI, with the cheapest tracked route listed at $0.2 input and $0.2 output per 1M tokens. No headline benchmark score is tracked for Llama 2 7B 32K yet.

Top use-case fit

Classification

Included by capability and metadata signals in the decision map.

JSON / Tool use

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MRoute
Together AI$0.200$0.200
Serverless

Capabilities

Structured Outputs

Benchmark peer barsfor Classification

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(8)