How many models does Lepton AI API offer?

Lepton AI API currently offers 14 models through its API.

What are Lepton AI API's most popular models?

Lepton AI API's top models include WizardLM-2 7B, WizardLM-2 8x22B, OpenChat 3.5 (0106), Mixtral 8x7B, Mistral 7B v0.1.

What is Lepton AI API's pricing?

Lepton AI API pricing ranges from $0.07/1M to $0.80/1M input tokens depending on the model.

Lepton AI API

Researched 3d agoInference PlatformTier 2

Lepton AI

CodingClassificationJSON / Tool useAI

Lepton AI API exposes 14 tracked models (14 with output token pricing in seed data). Task coverage across this catalog includes coding, classification, and json / tool use; open any model detail page for benchmarks, batch tiers, and migration prompts.
Portfolio context: 3 decision-task tags, 14 catalog rows, latest research stamp 2026-06-01.

Use this portfolio page for

Teams comparing token and batch economics on this surface
Operators routing coding, classification, and json / tool use workloads through this API

Do not stop here for

Final benchmark picks without opening the relevant model detail page

Catalog rows

Models linked to this provider in seed data

Priced output routes

Rows with token_out in seed data

Cheapest output

$0.070

WizardLM-2 7B on this route

Batch-ready SKUs

No batch pricing tracked

Latest catalog ship

2024-04-18

777d since dated release field

Freshness

2026-06-01

Researched 3d ago

fresh

Catalog release signal

Latest ISO-dated model.release in this catalog is 2024-04-18 (777d ago).

Where this host wins

Coding: 7 tracked models with SWE-bench / HumanEval-style scores.
Classification: 10 tracked models with MMLU-class moderation/safety coverage.
JSON/tool-use: 9 tracked models with BFCL / Nexus strict-JSON routing coverage.

Getting started

Official entry points from seed metadata — confirm quotas and regions in vendor docs.

Product Docs Portal Pricing

Compliance notes (verbatim seed excerpts)

Not yet verified from seed copy — no SOC/ISO/HIPAA-class sentences detected to quote verbatim.

Platform Overview

Lepton AI is a comprehensive cloud-native platform designed to simplify the development and deployment of AI applications. It offers a user-friendly interface that allows developers to build models natively in Python, eliminating the need for complex containerization or Kubernetes expertise. The platform supports local debugging, enabling users to test their models before deployment with a simple command. With a flexible API for easy integration into various applications and support for heterogeneous hardware, Lepton AI optimizes performance based on specific application needs. This flexibility allows for efficient scaling, accommodating workloads that can expand up to 1TB of memory. The platform provides a robust set of tools and infrastructure to enhance AI workflows. Its cloud-native architecture supports high-performance computing, featuring smart scheduling and dynamic batching to minimize downtime. Lepton AI enables continuous deployment through GitHub integration, facilitating rapid iteration and scaling of AI applications. The platform also includes built-in monitoring, logging, and autoscaling capabilities, ensuring that applications remain responsive and efficient in production environments. With these features, Lepton AI streamlines the entire AI development process, from model creation to deployment and maintenance, making it accessible for organizations of various sizes looking to innovate with AI technologies.

Compare per-model pricing, input and output token costs, batch availability, and benchmark coverage.

Available Models(14)

View all →

All models available as Serverless

Model	Input (per 1M)	Output (per 1M)
Llama 3 70B Instruct	$0.80	$0.80
Llama 3 8B Instruct	$0.07	$0.07
Gemma 7B Instruct	$0.07	$0.07
WizardLM-2 7B	$0.07	$0.07
WizardLM-2 8x22B	$0.50	$0.50
OpenChat 3.5 (0106)	$0.07	$0.07
Dolphin 2.6 Mixtral 8x7B	$0.30	$0.30
Nous Hermes 13B	$0.13	$0.13
Mixtral 8x7B	$0.30	$0.30
MythoMax L2 13B	$0.13	$0.13

View full catalog →

Platform Details

TypeInference Platform

TierTier 2

Models14

Organization

Lepton AI

Founded2023

Sacramento, California, United States

Lepton AI is building a scalable and efficient AI Application platform. Their platform aims to simplify the development and deployment of AI applications, making it easier for businesses to leverage artificial intelligence technologies. The company focuses on providing tools and infrastructure to streamline AI workflows, enabling faster development cycles and more efficient resource utilization. While specific details about their platform's features are not provided in the context, Lepton AI's mission is to make AI application development more accessible and efficient for developers and businesses alike.

Links

Website X / Twitter LinkedIn Crunchbase