Last refreshed 2026-06-29. Next refresh: weekly.
Why use Colosseum 355B Instruct on NVIDIA NIM?
NVIDIA NIM offers Colosseum 355B Instruct with competitive pricing. NVIDIA NIM is NVIDIA's deployment platform for GPU-accelerated inference microservices.
Setup recipe
Docs fallbackUse the provider REST API or SDKCreate a provider API keymodel: igenius/colosseum_355b_instruct_16kigenius/colosseum_355b_instruct_16kRequest example
igenius/colosseum_355b_instruct_16k.Gotchas
- Use provider model ID "igenius/colosseum_355b_instruct_16k", not the LLMReference slug "colosseum-355b-instruct".
Pricing
| Type | Rate |
|---|---|
| GPU Hour Rate | $1.00/GPU·hr |
Capabilities
No model capability flags are currently sourced.
About Colosseum 355B Instruct
Flagship instruction-tuned model optimized for Italian reasoning, coding, multilingual tasks, and long-context RAG. Supports complex document analysis and regulatory compliance reasoning.
FAQ
What is the context window for Colosseum 355B Instruct on NVIDIA NIM?
Colosseum 355B Instruct supports a 16k token context window on NVIDIA NIM.
What API model ID do I use for Colosseum 355B Instruct on NVIDIA NIM?
Use the model ID igenius/colosseum_355b_instruct_16k when calling NVIDIA NIM's API.
Who created Colosseum 355B Instruct?
Colosseum 355B Instruct was created by iGenius as part of the Colosseum model family.
Is Colosseum 355B Instruct open source?
Colosseum 355B Instruct has open weights under Llama 3 Community according to the seed data, but that does not necessarily mean an OSI-approved open-source license.