LLM Reference

OctoML Llama-2-70b-chat

Released
2023-07-18
Last refreshed
2026-05-19
Status
Researched 30d ago
Open weightsCommercial use: conditional

OctoML Llama-2-70b-chat is worth evaluating for general LLM work when its provider route and context window match the workload.

Use it for

  • Teams evaluating general LLM work
  • Workloads that can use a 4k context window
  • Buyers comparing 1 tracked provider route

Do not use it for

  • Vision or document-understanding workloads
  • Strict JSON or tool-calling flows
Specifications
Family
Llama 2
Released
2023-07-18
Context
4k
Parameters
70B
Architecture
Decoder Only
Knowledge cutoff
2022-09
Specialization
general
Openness
Open weights
License
Llama 2 CommunityCommercial use: conditional
Created by

Large-scale open-source AI for social technologies.

Menlo Park, California, United States
Founded 2013
Website
Pricing
Output / 1M
$0.600
Input / 1M
$0.400

Cheapest of 1 route · OctoML (Deprecated)

About

OctoML Llama-2-70b-chat is Meta's Llama 2 model. Weights are openly available for self-hosting.

OctoML Llama-2-70b-chat is an open-weight model in the Llama 2 family. The structured metadata tracks a 4k-token context window. This page tracks provider routes through OctoML (Deprecated), with the cheapest tracked route listed at $0.4 input and $0.6 output per 1M tokens. No headline benchmark score is tracked for OctoML Llama-2-70b-chat yet.

Top use-case fit

No primary decision-task fit is mapped for this model yet.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MRoute
OctoML (Deprecated)$0.400$0.600
Serverless

Capabilities

No model capability flags are currently sourced.

Benchmark peer barsfor Coding

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.