OctoML Llama-2-70b-chat
OctoML Llama-2-70b-chat is worth evaluating for general LLM work when its provider route and context window match the workload.
Use it for
- Teams evaluating general LLM work
- Workloads that can use a 4k context window
- Buyers comparing 1 tracked provider route
Do not use it for
- Vision or document-understanding workloads
- Strict JSON or tool-calling flows
- Family
- Llama 2
- Released
- 2023-07-18
- Context
- 4k
- Parameters
- 70B
- Architecture
- Decoder Only
- Knowledge cutoff
- 2022-09
- Specialization
- general
- Openness
- Open weights
- License
- Llama 2 CommunityCommercial use: conditional
Large-scale open-source AI for social technologies.
Cheapest of 1 route · OctoML (Deprecated)
About
OctoML Llama-2-70b-chat is Meta's Llama 2 model. Weights are openly available for self-hosting.
OctoML Llama-2-70b-chat is an open-weight model in the Llama 2 family. The structured metadata tracks a 4k-token context window. This page tracks provider routes through OctoML (Deprecated), with the cheapest tracked route listed at $0.4 input and $0.6 output per 1M tokens. No headline benchmark score is tracked for OctoML Llama-2-70b-chat yet.
Top use-case fit
No primary decision-task fit is mapped for this model yet.
Provider price ladder
Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.
| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| OctoML (Deprecated) | $0.400 | $0.600 | Serverless |
Capabilities
No model capability flags are currently sourced.
Benchmark peer barsfor Coding
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.
Large-scale open-source AI for social technologies.
Cheapest of 1 route · OctoML (Deprecated)