LLM Reference
Bitdeer AI

Bitdeer AI

Researched 3d ago

Bitdeer Technologies Group

CodingRAGAgentsLong contextVisionClassificationJSON / Tool useInference

Bitdeer AI offers 18 tracked models (18 with output token pricing). This catalog covers coding, rag, and agents; open any model detail page for benchmarks, batch tiers, and migration prompts.

Covers 7 workload areas across 18 tracked models; last verified 2026-06-29.

Use it for

  • Teams comparing token and batch pricing across this provider's models
  • Operators routing coding, rag, and agents workloads through this API

Do not use it for

  • Final benchmark picks without opening the relevant model detail page

Tracked models

18

Models available through this provider

Priced output routes

18

Models with output token pricing tracked

Cheapest output

$0.240

Gemma 2 9B on this route

Batch-ready models

0

No batch pricing tracked

Latest model release

2025-01-20

528d since newest release

Freshness

2026-06-29

Researched 3d ago

fresh

Information

Models18
CompanyBitdeer Technologies Group
Founded2018
Singapore

Bitdeer Technologies Group is a Singapore-based AI infrastructure and cloud computing company. The company provides AI Cloud, a serverless and dedicated inference platform for deploying and scaling LLM and generative AI models.

Catalog freshness

The newest model tracked on this provider was released 2025-01-20 (528d ago).

Where this host wins

  • Coding: 10 tracked models with SWE-bench / HumanEval-style scores.
  • RAG: 3 tracked models with ruler / needle retrieval benchmarks.
  • Agentic: 2 tracked models with BFCL, tau-bench, and SWE-bench tool-use coverage.
  • Long-context: 12 tracked models with context-token or InfiniteBench-class signal.

Getting started

Official product, docs, and pricing links — confirm quotas and regions in the vendor docs.

Compliance notes

No verified compliance claims (SOC 2, ISO, HIPAA) tracked for this provider yet — check the vendor's trust center for current certifications.

Platform Overview

Bitdeer AI Cloud offers both serverless (pay-per-use) and dedicated (reserved GPU) deployment options for LLM inference. The platform provides OpenAI-compatible API endpoints and supports fine-tuning on dedicated instances. Models are hosted on NVIDIA H100/H200/A100 GPUs with automatic scaling.

Compare per-model pricing, input and output token costs, batch availability, and benchmark coverage.

Available Models(18)

View all →

All models available as Serverless

ModelInput (per 1M)Output (per 1M)
DeepSeek R1$0.1$0.3
DeepSeek V3$0.1$0.3
Llama 3.2 11B Vision Instruct$0.15$0.45
Llama 3.2 1B Instruct$0.15$0.45
Llama 3.2 90B Vision Instruct$0.15$0.45
Mistral NeMo (2407)$0.18$0.54
Gemma 2 27B$0.08$0.24
Gemma 2 9B$0.08$0.24
Qwen2.5-0.5B$0.12$0.36
Qwen2.5-1.5B$0.12$0.36
View full catalog →

Where else to run this