LLM ReferenceLLM Reference

DeepInfra

Inference PlatformTier 2

DeepInfra

AI

Platform Overview

DeepInfra offers serverless AI inference with a simple API, supporting hundreds of models across text generation, embeddings, and more. Pay-per-token pricing with no upfront commitments.

Available Models(58)

View all →

Platform Details

TypeInference Platform
TierTier 2
Models58

Organization

DeepInfra
Founded2023
San Francisco, California, United States

DeepInfra is a cloud inference platform offering cost-effective access to open-source AI models. It provides serverless inference for leading models from Meta, Mistral, Alibaba, and others with competitive token-based pricing.