Last refreshed 2026-05-01. Next refresh: weekly.
Why use Colosseum 355B Instruct on NVIDIA NIM?
NVIDIA NIM offers Colosseum 355B Instruct with competitive pricing. NVIDIA NIM is NVIDIA's deployment platform for GPU-accelerated inference microservices.
Input / 1M
-
Output / 1M
-
Cache
Not sourced
Batch
Not sourced
Setup recipe
Docs fallbackInstall
Use the provider REST API or SDKAuth
Create a provider API keyCall
model: igenius/colosseum_355b_instruct_16kModel ID
igenius/colosseum_355b_instruct_16kRequest example
Curated snippets for this provider are not sourced yet. Use NVIDIA NIM documentation with model ID
igenius/colosseum_355b_instruct_16k.Gotchas
- Use provider model ID "igenius/colosseum_355b_instruct_16k", not the LLMReference slug "colosseum-355b-instruct".
Pricing
| Type | Rate |
|---|---|
| GPU Hour Rate | $1.00/GPU·hr |
Capabilities
No model capability flags are currently sourced.
About Colosseum 355B Instruct
Flagship instruction-tuned model optimized for Italian reasoning, coding, multilingual tasks, and long-context RAG. Supports complex document analysis and regulatory compliance reasoning.
FAQ
What is the context window for Colosseum 355B Instruct on NVIDIA NIM?
Colosseum 355B Instruct supports a 16,000 token context window on NVIDIA NIM.
What API model ID do I use for Colosseum 355B Instruct on NVIDIA NIM?
Use the model ID igenius/colosseum_355b_instruct_16k when calling NVIDIA NIM's API.
Who created Colosseum 355B Instruct?
Colosseum 355B Instruct was created by iGenius as part of the Colosseum model family.
Is Colosseum 355B Instruct open source?
Colosseum 355B Instruct is open source according to the seed data.
Model Specs
Released2025-01-01
Parameters355B
Context16K
ArchitectureDecoder Only