How many Arcee AI models does LLMReference track?

LLMReference currently tracks 9 models available through Arcee AI's API. Arcee AI's full catalog may be larger.

What are Arcee AI's most popular models?

Arcee AI's top models include Llama 3.3 70B Instruct (free), Qwen2.5-Coder-32B-Instruct, DeepSeek R1 Distill Llama 70B, Gemma 2 27B Instruct, Mistral NeMo Instruct (2407).

What is Arcee AI's pricing?

Arcee AI pricing ranges from $0.15/1M to $0.6/1M input tokens depending on the model.

Arcee AI

Researched 3d agoAI LabTier 3

Arcee AI

CodingRAGAgentsLong contextClassificationJSON / Tool useapi

Arcee AI offers 9 tracked models (5 with output token pricing). This catalog covers coding, rag, and agents; open any model detail page for benchmarks, batch tiers, and migration prompts.
Covers 6 workload areas across 9 tracked models; last verified 2026-06-29.

Use it for

Teams comparing token and batch pricing across this provider's models
Operators routing coding, rag, and agents workloads through this API

Do not use it for

Final benchmark picks without opening the relevant model detail page

Tracked models

Models available through this provider

Priced output routes

Models with output token pricing tracked

Cheapest output

$0.450

Mistral NeMo Instruct (2407) on this route

Batch-ready models

No batch pricing tracked

Latest model release

2026-04-01

92d since newest release

Freshness

2026-06-29

Researched 3d ago

fresh

Information

TypeAI Lab

TierTier 3

Models9

CompanyArcee AI

Arcee AI is a custom model fine-tuning and inference API platform specializing in efficient, domain-adapted language models.

Links

Website X / Twitter

Catalog freshness

The newest model tracked on this provider was released 2026-04-01 (92d ago).

Where this host wins

Coding: 2 tracked models with SWE-bench / HumanEval-style scores.
RAG: 5 tracked models with ruler / needle retrieval benchmarks.
Agentic: 4 tracked models with BFCL, tau-bench, and SWE-bench tool-use coverage.
Long-context: 7 tracked models with context-token or InfiniteBench-class signal.

Compliance notes

No verified compliance claims (SOC 2, ISO, HIPAA) tracked for this provider yet — check the vendor's trust center for current certifications.

Platform Overview

Arcee AI is a custom model fine-tuning and inference API platform specializing in efficient, domain-adapted language models.

Compare per-model pricing, input and output token costs, batch availability, and benchmark coverage.

Available Models(9)

View all →

Model	Input (per 1M)	Output (per 1M)	Type
Trinity-Large-Thinking
Trinity-Large-Preview			Serverless
Trinity Mini			Serverless
Trinity Nano			Serverless
DeepSeek R1 Distill Llama 70B	$0.35	$1.05	Serverless
Llama 3.3 70B Instruct (free)	$0.6	$1.8	Serverless
Qwen2.5-Coder-32B-Instruct	$0.4	$1.2	Serverless
Mistral NeMo Instruct (2407)	$0.15	$0.45	Serverless
Gemma 2 27B Instruct	$0.25	$0.75	Serverless

Where else to run this

DeepSeek R1 Distill Llama 70B on Arcee AI

Provider setup and pricing

Qwen2.5-Coder-32B-Instruct on Arcee AI

Provider setup and pricing

Mistral NeMo Instruct (2407) on Arcee AI

Provider setup and pricing

DeepSeek R1 Distill Llama 70B on DeepInfra

Alternative host

Qwen2.5-Coder-32B-Instruct on Cloudflare Workers AI

Alternative host

Mistral NeMo Instruct (2407) on NVIDIA NIM

Alternative host