LLM Reference
Arcee AI

Arcee AI

Researched 3d agoAI LabTier 3
CodingRAGAgentsLong contextClassificationJSON / Tool useapi

Arcee AI offers 9 tracked models (5 with output token pricing). This catalog covers coding, rag, and agents; open any model detail page for benchmarks, batch tiers, and migration prompts.

Covers 6 workload areas across 9 tracked models; last verified 2026-06-29.

Use it for

  • Teams comparing token and batch pricing across this provider's models
  • Operators routing coding, rag, and agents workloads through this API

Do not use it for

  • Final benchmark picks without opening the relevant model detail page

Tracked models

9

Models available through this provider

Priced output routes

5

Models with output token pricing tracked

Cheapest output

$0.450

Mistral NeMo Instruct (2407) on this route

Batch-ready models

0

No batch pricing tracked

Latest model release

2026-04-01

92d since newest release

Freshness

2026-06-29

Researched 3d ago

fresh

Information

TypeAI Lab
TierTier 3
Models9
CompanyArcee AI

Arcee AI is a custom model fine-tuning and inference API platform specializing in efficient, domain-adapted language models.

Read more ->

Catalog freshness

The newest model tracked on this provider was released 2026-04-01 (92d ago).

Where this host wins

  • Coding: 2 tracked models with SWE-bench / HumanEval-style scores.
  • RAG: 5 tracked models with ruler / needle retrieval benchmarks.
  • Agentic: 4 tracked models with BFCL, tau-bench, and SWE-bench tool-use coverage.
  • Long-context: 7 tracked models with context-token or InfiniteBench-class signal.

Compliance notes

No verified compliance claims (SOC 2, ISO, HIPAA) tracked for this provider yet — check the vendor's trust center for current certifications.

Platform Overview

Arcee AI is a custom model fine-tuning and inference API platform specializing in efficient, domain-adapted language models.

Compare per-model pricing, input and output token costs, batch availability, and benchmark coverage.

Available Models(9)

View all →
ModelInput (per 1M)Output (per 1M)Type
Trinity-Large-Thinking
Trinity-Large-Preview
Serverless
Trinity Mini
Serverless
Trinity Nano
Serverless
DeepSeek R1 Distill Llama 70B$0.35$1.05
Serverless
Llama 3.3 70B Instruct (free)$0.6$1.8
Serverless
Qwen2.5-Coder-32B-Instruct$0.4$1.2
Serverless
Mistral NeMo Instruct (2407)$0.15$0.45
Serverless
Gemma 2 27B Instruct$0.25$0.75
Serverless

Where else to run this