LLM Reference

vLLM

Researched todayInference PlatformTier 3

vllm-project

InferenceOpen SourceSelf-Hosted

vLLM does not have tracked models in LLMReference yet — open the provider docs link above or browse the models index for adjacent hosts.

Portfolio context: 0 decision-task tags, 0 catalog rows, latest research stamp 2026-06-03.

Use this portfolio page for

  • Catalog orientation before locking a model SKU

Do not stop here for

  • Final benchmark picks without opening the relevant model detail page

Catalog rows

0

Models linked to this provider in seed data

Priced output routes

0

Add output pricing to unlock comparisons

Cheapest output

Unknown

Need positive token_out rows

Batch-ready SKUs

0

No batch pricing tracked

Latest catalog ship

Unknown

From model.release ISO prefixes

Freshness

2026-06-03

Researched today

fresh

Catalog release signal

No ISO-prefixed release dates on linked models — lag metric withheld.

Where this host wins

Task positioning unavailable until catalog models pick up capability tags or benchmarks.

Getting started

Official entry points from seed metadata — confirm quotas and regions in vendor docs.

SDKs & libraries

Compliance notes (verbatim seed excerpts)

Not yet verified from seed copy — no SOC/ISO/HIPAA-class sentences detected to quote verbatim.

Platform Overview

vLLM is an Apache-2.0 open-source inference runtime for serving large language models on user-managed hardware. It supports offline batched inference and an OpenAI-compatible server, with model availability determined by the Hugging Face model repository loaded into the runtime. LLMReference tracks it as a self-hosted runtime with no published token prices.

Platform Details

TypeInference Platform
TierTier 3
Models0

Organization

vllm-project

vLLM is a high-throughput, memory-efficient open-source inference engine and serving library for large language models. It is self-hosted rather than a managed token-priced API, and it ships an OpenAI-compatible HTTP server for local or private deployments.