LLM Reference
Vultr

Vultr

Researched 3d ago

Vultr Holdings Corporation

CodingClassificationInference

Vultr offers 2 tracked models (1 with output token pricing). This catalog covers coding and classification; open any model detail page for benchmarks, batch tiers, and migration prompts.

Covers 2 workload areas across 2 tracked models; last verified 2026-06-29.

Use it for

  • Teams comparing token and batch pricing across this provider's models
  • Operators routing coding and classification workloads through this API

Do not use it for

  • Final benchmark picks without opening the relevant model detail page

Tracked models

2

Models available through this provider

Priced output routes

1

Models with output token pricing tracked

Cheapest output

$2.75

Mixtral 8x7B on this route

Batch-ready models

0

No batch pricing tracked

Latest model release

2023-12-11

934d since newest release

Freshness

2026-06-29

Researched 3d ago

fresh

Information

Models2
CompanyVultr Holdings Corporation
Founded2014
West Palm Beach, Florida, USA

Vultr is a cloud infrastructure company headquartered in West Palm Beach, Florida. The company provides infrastructure-as-a-service (IaaS) including bare metal servers, cloud servers, and GPU instances across a global network of 30+ data centers.

Catalog freshness

The newest model tracked on this provider was released 2023-12-11 (934d ago).

Where this host wins

  • Coding: 1 tracked model with SWE-bench / HumanEval-style scores.
  • Classification: 1 tracked model with MMLU-class moderation/safety coverage.

Getting started

Official product, docs, and pricing links — confirm quotas and regions in the vendor docs.

Compliance notes

No verified compliance claims (SOC 2, ISO, HIPAA) tracked for this provider yet — check the vendor's trust center for current certifications.

Platform Overview

Vultr offers cloud GPU infrastructure with NVIDIA H100, A100, and AMD MI355X instances for AI workloads. The platform uses hourly billing and supports users deploying their own LLM inference workloads. Does not host pre-trained LLM models or provide managed LLM APIs.

Compare per-model pricing, input and output token costs, batch availability, and benchmark coverage.

Available Models(2)

View all →

All models available as Serverless

ModelInput (per 1M)Output (per 1M)
Mixtral 8x7B$0.55$2.75
Mistral 7B v0.1

Where else to run this