LLM Reference

Holotron Models by H Company

H CompanyNVIDIA Open ModelOpen weightsAgentsMultimodal
2 models2026Up to 262k ctx

Details

ResearcherH Company
Commercial useCommercial use allowed
Models2
Released2026
Max context262k

Capabilities

VisionAll models
MultimodalAll models

About

H Company's Holotron model series, built on NVIDIA Nemotron architecture with Mamba-2 SSM layers for high-throughput computer-use agents. Features native long-context support (256K tokens) and the NVIDIA Open Model License.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

2 in view

Use when the workload needs computer use, 262k context, and 30 parameters.

2026-04computer use262k context30 parameters

Use when the workload needs computer use, 131k context, and 12 parameters.

2026-03computer use131k context12 parameters

Release Timeline

2 release groups
2026-04
1 current
Holotron-3-Nano
computer use262k context30 parameters
Current
2026-03
1 current
Holotron-12B
computer use131k context12 parameters
Current

Specifications(2 models)

Holotron model specifications comparison
ModelReleasedContextParametersVisionMultimodal
Holotron-3-Nano2026-04262k30YesYes
Holotron-12B2026-03131k12YesYes

Frequently Asked Questions

What is Holotron used for?
Holotron is used for agents, multimodal, and computer use. The family description and listed model capabilities point to those workloads as the best fit.
How does Holotron compare to MOSS-Audio?
Holotron by H Company is strongest where you need agents, while MOSS-Audio by MOSI Intelligence is the closest related family to check for multimodal. Holotron has 2 listed variants and reaches up to 262k context, so compare the specs and pricing tables before choosing a production model.
Which Holotron model should I use?
If price is the main constraint, use the pricing table first because Holotron does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate Holotron-3-Nano with 262k context and multimodal inputs.