LLM ReferenceLLM Reference
NVIDIA NIM

Colosseum 355B Instruct on NVIDIA NIM

Colosseum · iGenius

Serverless

Last refreshed 2026-05-01. Next refresh: weekly.

Why use Colosseum 355B Instruct on NVIDIA NIM?

NVIDIA NIM offers Colosseum 355B Instruct with competitive pricing. NVIDIA NIM is NVIDIA's deployment platform for GPU-accelerated inference microservices.

Input / 1M
-
Output / 1M
-
Cache
Not sourced
Batch
Not sourced

Setup recipe

Docs fallback
Install
Use the provider REST API or SDK
Auth
Create a provider API key
Call
model: igenius/colosseum_355b_instruct_16k
Model ID
igenius/colosseum_355b_instruct_16k

Request example

Curated snippets for this provider are not sourced yet. Use NVIDIA NIM documentation with model ID igenius/colosseum_355b_instruct_16k.

Gotchas

  • Use provider model ID "igenius/colosseum_355b_instruct_16k", not the LLMReference slug "colosseum-355b-instruct".

Pricing

TypeRate
GPU Hour Rate$1.00/GPU·hr

Capabilities

No model capability flags are currently sourced.

About Colosseum 355B Instruct

Flagship instruction-tuned model optimized for Italian reasoning, coding, multilingual tasks, and long-context RAG. Supports complex document analysis and regulatory compliance reasoning.

FAQ

What is the context window for Colosseum 355B Instruct on NVIDIA NIM?

Colosseum 355B Instruct supports a 16,000 token context window on NVIDIA NIM.

What API model ID do I use for Colosseum 355B Instruct on NVIDIA NIM?

Use the model ID igenius/colosseum_355b_instruct_16k when calling NVIDIA NIM's API.

Who created Colosseum 355B Instruct?

Colosseum 355B Instruct was created by iGenius as part of the Colosseum model family.

Is Colosseum 355B Instruct open source?

Colosseum 355B Instruct is open source according to the seed data.

Get Started

Model Specs

Released2025-01-01
Parameters355B
Context16K
ArchitectureDecoder Only