Bitdeer AI
Researched 3d agoBitdeer Technologies Group
Bitdeer AI offers 18 tracked models (18 with output token pricing). This catalog covers coding, rag, and agents; open any model detail page for benchmarks, batch tiers, and migration prompts.
Covers 7 workload areas across 18 tracked models; last verified 2026-06-29.
Use it for
- Teams comparing token and batch pricing across this provider's models
- Operators routing coding, rag, and agents workloads through this API
Do not use it for
- Final benchmark picks without opening the relevant model detail page
Tracked models
18
Models available through this provider
Priced output routes
18
Models with output token pricing tracked
Cheapest output
$0.240
Gemma 2 9B on this route
Batch-ready models
0
No batch pricing tracked
Latest model release
2025-01-20
528d since newest release
Freshness
2026-06-29
Researched 3d ago
Information
Bitdeer Technologies Group is a Singapore-based AI infrastructure and cloud computing company. The company provides AI Cloud, a serverless and dedicated inference platform for deploying and scaling LLM and generative AI models.
Catalog freshness
The newest model tracked on this provider was released 2025-01-20 (528d ago).
Where this host wins
- Coding: 10 tracked models with SWE-bench / HumanEval-style scores.
- RAG: 3 tracked models with ruler / needle retrieval benchmarks.
- Agentic: 2 tracked models with BFCL, tau-bench, and SWE-bench tool-use coverage.
- Long-context: 12 tracked models with context-token or InfiniteBench-class signal.
Getting started
Official product, docs, and pricing links — confirm quotas and regions in the vendor docs.
Compliance notes
No verified compliance claims (SOC 2, ISO, HIPAA) tracked for this provider yet — check the vendor's trust center for current certifications.
Platform Overview
Bitdeer AI Cloud offers both serverless (pay-per-use) and dedicated (reserved GPU) deployment options for LLM inference. The platform provides OpenAI-compatible API endpoints and supports fine-tuning on dedicated instances. Models are hosted on NVIDIA H100/H200/A100 GPUs with automatic scaling.
Compare per-model pricing, input and output token costs, batch availability, and benchmark coverage.
Available Models(18)
View all →All models available as Serverless
| Model | Input (per 1M) | Output (per 1M) |
|---|---|---|
| DeepSeek R1 | $0.1 | $0.3 |
| DeepSeek V3 | $0.1 | $0.3 |
| Llama 3.2 11B Vision Instruct | $0.15 | $0.45 |
| Llama 3.2 1B Instruct | $0.15 | $0.45 |
| Llama 3.2 90B Vision Instruct | $0.15 | $0.45 |
| Mistral NeMo (2407) | $0.18 | $0.54 |
| Gemma 2 27B | $0.08 | $0.24 |
| Gemma 2 9B | $0.08 | $0.24 |
| Qwen2.5-0.5B | $0.12 | $0.36 |
| Qwen2.5-1.5B | $0.12 | $0.36 |