What is the context window of Llama 2 7B 32K?

Llama 2 7B 32K has a context window of 32k tokens.

How much does Llama 2 7B 32K cost?

Llama 2 7B 32K is available at $0.2/1M input tokens through Together AI.

When was Llama 2 7B 32K released?

Llama 2 7B 32K was released on 2023-07-18.

Which providers offer Llama 2 7B 32K?

Llama 2 7B 32K is available from 1 provider: Together AI.

Llama 2 7B 32K

Name: Llama 2 7B 32K
Author: Together.ai

Released

2023-07-18

Last refreshed

2026-05-11

Status

Researched 46d ago

Open SourceClassificationJSON / Tool use

Llama 2 7B 32K is worth evaluating for classification and json / tool use when its provider route and context window match the workload.

Use it for

Teams evaluating classification and json / tool use
Workloads that can use a 32k context window
Buyers comparing 1 tracked provider route

Do not use it for

Vision or document-understanding workloads

Specifications

Family: Together Llama 2
Released: 2023-07-18
Context: 32k
Parameters: 7B
Architecture: Decoder Only
Knowledge cutoff: 2022-09
Specialization: general
Training: finetuned

Created by

Together.ai

Blazing-fast, cost-effective AI inference solutions

San Francisco, California, United States

Founded 2022

Website

Pricing

Output / 1M

$0.200

Input / 1M

$0.200

Cheapest of 1 route · Together AI

Providers(1)

Together AI

View 1 provider route

About

LLaMA-2-7B-32K is an open-source language model engineered by Together, derived from Meta's LLaMA-2 7B. It boasts a unique extended context length of up to 32,000 tokens, which enhances its ability to tackle tasks involving long-range context, such as multi-document question answering and lengthy text summarization. The model integrates optimizations, including FlashAttention-2, to boost inference and training efficiency. It combines pre-training with instruction tuning data for improved task performance and offers fine-tuning examples for specialized applications, like book summarization or multi-document Q&A. This model marks a substantial progress in the domain of large language models, serving as a potent tool for natural language processing tasks 1311.

Llama 2 7B 32K is an open-source model in the Together Llama 2 family. The structured metadata tracks a 32k-token context window and structured outputs. This page tracks provider routes through Together AI, with the cheapest tracked route listed at $0.2 input and $0.2 output per 1M tokens. No headline benchmark score is tracked for Llama 2 7B 32K yet.

Top use-case fit

Classification

Included by capability and metadata signals in the decision map.

JSON / Tool use

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
Together AI	$0.200	$0.200	Serverless

Capabilities

Structured Outputs

Benchmark peer barsfor Classification

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(8)

Best LLMs for ClassificationListed Best Open Source LLMsListed Best Small Language Models (SLMs)Listed Cheapest LLM APIs You Can Call Right NowListed Best Mainstream LLM APIs, RankedListed Best Free LLMs You Can Use Right NowListed Best LLMs for WritingListed Best LLMs for MarketingListed