LLM Reference
Venice AI

Venice AI

Researched 17d agoInference PlatformTier 3

Venice AI

CodingRAGAgentsLong contextVisionClassificationJSON / Tool use

Venice AI offers 3 tracked models (0 with output token pricing). This catalog covers coding, rag, and agents; open any model detail page for benchmarks, batch tiers, and migration prompts.

Covers 7 workload areas across 3 tracked models; last verified 2026-06-15.

Use it for

  • Operators routing coding, rag, and agents workloads through this API

Do not use it for

  • Final benchmark picks without opening the relevant model detail page
  • Strict price-per-token comparisons until output pricing is sourced

Tracked models

3

Models available through this provider

Priced output routes

0

Output pricing not yet tracked

Cheapest output

Unknown

Output pricing not yet tracked

Batch-ready models

0

No batch pricing tracked

Latest model release

2025-12-01

213d since newest release

Freshness

2026-06-15

Researched 17d ago

fresh

Information

TypeInference Platform
TierTier 3
Models3
CompanyVenice AI

Venice AI is a private, uncensored AI platform offering access to advanced open-source models for generative text, code, image generation, and conversations via decentralized infrastructure.

Catalog freshness

The newest model tracked on this provider was released 2025-12-01 (213d ago).

Where this host wins

  • Coding: 2 tracked models with SWE-bench / HumanEval-style scores.
  • RAG: 2 tracked models with ruler / needle retrieval benchmarks.
  • Agentic: 1 tracked model with BFCL, tau-bench, and SWE-bench tool-use coverage.
  • Long-context: 2 tracked models with context-token or InfiniteBench-class signal.

Getting started

Official product, docs, and pricing links — confirm quotas and regions in the vendor docs.

Compliance notes

No verified compliance claims (SOC 2, ISO, HIPAA) tracked for this provider yet — check the vendor's trust center for current certifications.

Platform Overview

Private, uncensored AI platform offering access to advanced open-source models for generative text, code, image generation, and conversations via decentralized compute resources.

Compare per-model pricing, input and output token costs, batch availability, and benchmark coverage.

Available Models(3)

View all →

All models available as Serverless

Contact provider for pricing

Where else to run this