Llama 3.1 Swallow 70B Instruct

Name: Llama 3.1 Swallow 70B Instruct
Author: Tokyo Institute of Technology

Released

2025-01-01

Last refreshed

2026-06-30

Status

Researched 182d ago

Open weightsCommercial use: conditional

Llama 3.1 Swallow 70B Instruct is worth evaluating for general LLM work when its provider route and context window match the workload.

Use it for

Teams evaluating general LLM work
Workloads that can use a 4k context window
Buyers comparing 1 tracked provider route

Do not use it for

Vision or document-understanding workloads
Strict JSON or tool-calling flows

Specifications

Family: Swallow
Released: 2025-01-01
Context: 4k
Parameters: 70B
Architecture: Decoder Only
Knowledge cutoff: 2023
Specialization: general
Openness: Open weights
License: Llama 2 CommunityCommercial use: conditional
Training: Pretrained

Created by

Tokyo Institute of Technology

Integrating AI with advanced robotics

Tokyo, Japan

Founded 1881

Website

Pricing

Output / 1M

Input / 1M

Cheapest of 1 route · NVIDIA NIM

Providers(1)

NVIDIA NIM

View 1 provider route

About

Japanese bilingual model from Institute of Science Tokyo, fine-tuned from Llama 3.1.

Llama 3.1 Swallow 70B Instruct is an open-weight model in the Swallow family. The structured metadata tracks a 4k-token context window. This page tracks provider routes through NVIDIA NIM. No headline benchmark score is tracked for Llama 3.1 Swallow 70B Instruct yet.

Top use-case fit

No primary decision-task fit is mapped for this model yet.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
NVIDIA NIM	-	-	ServerlessPartial

Available via routers & gateways(1)

NVIDIA LLM Router Blueprint

Router

NVIDIA's open-source AI blueprint for LLM routing that selects the optimal model per prompt via intent classification or neural auto-routing; being deprecated 2026-06-20.

Free OSSNVIDIA NIM