Llama 3.1 Swallow 8B Instruct on NVIDIA NIM
Swallow · Tokyo Institute of Technology
Last refreshed 2026-06-30. Next refresh: weekly.
Why use Llama 3.1 Swallow 8B Instruct on NVIDIA NIM?
NVIDIA NIM offers Llama 3.1 Swallow 8B Instruct with competitive pricing. NVIDIA NIM is NVIDIA's deployment platform for GPU-accelerated inference microservices.
Setup recipe
Docs fallbackUse the provider REST API or SDKCreate a provider API keymodel: institute-of-science-tokyo/llama-3.1-swallow-8b-instruct-v0.1institute-of-science-tokyo/llama-3.1-swallow-8b-instruct-v0.1Request example
institute-of-science-tokyo/llama-3.1-swallow-8b-instruct-v0.1.Gotchas
- Use provider model ID "institute-of-science-tokyo/llama-3.1-swallow-8b-instruct-v0.1", not the LLMReference slug "llama-3.1-swallow-8b-instruct".
Pricing
| Type | Rate |
|---|---|
| GPU Hour Rate | $1.00/GPU·hr |
| GPU Config | 1xH100 |
Capabilities
No model capability flags are currently sourced.
About Llama 3.1 Swallow 8B Instruct
Japanese bilingual model from Institute of Science Tokyo, fine-tuned from Llama 3.1 8B.
FAQ
What is the context window for Llama 3.1 Swallow 8B Instruct on NVIDIA NIM?
Llama 3.1 Swallow 8B Instruct supports a 4k token context window on NVIDIA NIM.
What API model ID do I use for Llama 3.1 Swallow 8B Instruct on NVIDIA NIM?
Use the model ID institute-of-science-tokyo/llama-3.1-swallow-8b-instruct-v0.1 when calling NVIDIA NIM's API.
Who created Llama 3.1 Swallow 8B Instruct?
Llama 3.1 Swallow 8B Instruct was created by Tokyo Institute of Technology as part of the Swallow model family.
Is Llama 3.1 Swallow 8B Instruct open source?
Llama 3.1 Swallow 8B Instruct has open weights under Llama 2 Community according to the seed data, but that does not necessarily mean an OSI-approved open-source license.