Arcee AI
Researched 3d agoAI LabTier 3Arcee AI offers 9 tracked models (5 with output token pricing). This catalog covers coding, rag, and agents; open any model detail page for benchmarks, batch tiers, and migration prompts.
Covers 6 workload areas across 9 tracked models; last verified 2026-06-29.
Use it for
- Teams comparing token and batch pricing across this provider's models
- Operators routing coding, rag, and agents workloads through this API
Do not use it for
- Final benchmark picks without opening the relevant model detail page
Tracked models
9
Models available through this provider
Priced output routes
5
Models with output token pricing tracked
Cheapest output
$0.450
Mistral NeMo Instruct (2407) on this route
Batch-ready models
0
No batch pricing tracked
Latest model release
2026-04-01
92d since newest release
Freshness
2026-06-29
Researched 3d ago
Information
Arcee AI is a custom model fine-tuning and inference API platform specializing in efficient, domain-adapted language models.
Read more ->Catalog freshness
The newest model tracked on this provider was released 2026-04-01 (92d ago).
Where this host wins
- Coding: 2 tracked models with SWE-bench / HumanEval-style scores.
- RAG: 5 tracked models with ruler / needle retrieval benchmarks.
- Agentic: 4 tracked models with BFCL, tau-bench, and SWE-bench tool-use coverage.
- Long-context: 7 tracked models with context-token or InfiniteBench-class signal.
Compliance notes
No verified compliance claims (SOC 2, ISO, HIPAA) tracked for this provider yet — check the vendor's trust center for current certifications.
Platform Overview
Arcee AI is a custom model fine-tuning and inference API platform specializing in efficient, domain-adapted language models.
Compare per-model pricing, input and output token costs, batch availability, and benchmark coverage.
Available Models(9)
View all →| Model | Input (per 1M) | Output (per 1M) | Type |
|---|---|---|---|
| Trinity-Large-Thinking | |||
| Trinity-Large-Preview | Serverless | ||
| Trinity Mini | Serverless | ||
| Trinity Nano | Serverless | ||
| DeepSeek R1 Distill Llama 70B | $0.35 | $1.05 | Serverless |
| Llama 3.3 70B Instruct (free) | $0.6 | $1.8 | Serverless |
| Qwen2.5-Coder-32B-Instruct | $0.4 | $1.2 | Serverless |
| Mistral NeMo Instruct (2407) | $0.15 | $0.45 | Serverless |
| Gemma 2 27B Instruct | $0.25 | $0.75 | Serverless |