Last refreshed 2026-05-01. Next refresh: weekly.
Why use Llama 3.1 Swallow 8B Instruct on NVIDIA NIM?
NVIDIA NIM offers Llama 3.1 Swallow 8B Instruct with competitive pricing. NVIDIA NIM is NVIDIA's deployment platform for GPU-accelerated inference microservices.
Input / 1M
-
Output / 1M
-
Cache
Not sourced
Batch
Not sourced
Setup recipe
Docs fallbackInstall
Use the provider REST API or SDKAuth
Create a provider API keyCall
model: institute-of-science-tokyo/llama-3.1-swallow-8b-instruct-v0.1Model ID
institute-of-science-tokyo/llama-3.1-swallow-8b-instruct-v0.1Request example
Curated snippets for this provider are not sourced yet. Use NVIDIA NIM documentation with model ID
institute-of-science-tokyo/llama-3.1-swallow-8b-instruct-v0.1.Gotchas
- Use provider model ID "institute-of-science-tokyo/llama-3.1-swallow-8b-instruct-v0.1", not the LLMReference slug "llama-3.1-swallow-8b-instruct".
Pricing
| Type | Rate |
|---|---|
| GPU Hour Rate | $1.00/GPU·hr |
| GPU Config | 1xH100 |
Capabilities
No model capability flags are currently sourced.
About Llama 3.1 Swallow 8B Instruct
Japanese bilingual model from Institute of Science Tokyo, fine-tuned from Llama 3.1 8B.
FAQ
What is the context window for Llama 3.1 Swallow 8B Instruct on NVIDIA NIM?
Llama 3.1 Swallow 8B Instruct supports a 4,000 token context window on NVIDIA NIM.
What API model ID do I use for Llama 3.1 Swallow 8B Instruct on NVIDIA NIM?
Use the model ID institute-of-science-tokyo/llama-3.1-swallow-8b-instruct-v0.1 when calling NVIDIA NIM's API.
Who created Llama 3.1 Swallow 8B Instruct?
Llama 3.1 Swallow 8B Instruct was created by Tokyo Institute of Technology as part of the Swallow model family.
Is Llama 3.1 Swallow 8B Instruct open source?
Llama 3.1 Swallow 8B Instruct is open source according to the seed data.
Model Specs
Released2025-01-01
Parameters8B
Context4K
ArchitectureDecoder Only