LLM Reference
AWS Bedrock

AWS Bedrock Models — Pricing & Benchmarks

130 models available · Amazon Web Services

AWS Bedrock hosts 130 AI models in this catalog. The lowest listed input price is Amazon Titan Text Large at $0.02/1M input tokens. LLM Reference lets you compare these models across all 80 providers without switching tabs.

ModelInput (per 1M)Output (per 1M)Context
Amazon Titan Text Large$0.02$0.08
Mistral Voxtral Mini 3B 2507$0.04$0.04
Nova Micro$0.04$0.14128k
Nemotron 3 Nano 30B-A3B$0.06$0.24
Nova Lite$0.06$0.24300k
NVIDIA Nemotron Nano 9B v2$0.06$0.23
GPT OSS Safeguard 20B$0.07$0.2131k
Amazon Titan Embeddings G1 - Text$0.1512
GLM-4 9B$0.1$0.1131k
Llama 3.2 1B Instruct$0.1$0.1128k
Ministral 8B$0.1$0.132k
Mistral Ministral 3B$0.1$0.1
Mistral Voxtral Small 24B 2507$0.1$0.3
Qwen3-30B-A3B$0.1$0.3128k
Llama 3.2 3B Instruct$0.15$0.15128k
Mistral 7B Instruct$0.15$0.2
Mistral 7B Instruct v0.2$0.15$0.232k
Mistral 7B v0.1$0.15$0.28k
Mistral Ministral 3 8B$0.15$0.15
OpenAI GPT OSS Safeguard 120B$0.15$0.6
Qwen3-32B$0.15$0.6240k
Qwen3-Coder-30B-A3B-Instruct$0.15$0.62262k
Qwen3-Next-80B-A3B$0.15$1.20
Titan Text Lite$0.15$0.24k
Llama 4 Scout 17B-16E Instruct$0.17$0.2210m
Gemma 3 4B IT$0.2$0.2128k
Jamba 1.5 Mini$0.2$0.4256k
Llama 3.2 11B Instruct$0.2$0.27128k
Llama 3.2 11B Vision$0.2$0.27128k
Llama 3.2 11B Vision Instruct$0.2$0.27128k
Ministral 3 14B$0.2$0.2
NVIDIA Nemotron Nano 12B v2 VL BF16$0.2$0.6
Titan Text Express$0.2$0.68k
Llama 3.1 8B Instruct$0.22$0.22128k
Gemma 3 27B PT$0.23$0.38128k
Llama 4 Maverick 17B Instruct FP8$0.24$0.971m
Ministral 14B$0.24$0.2432k
Claude 3 Haiku$0.25$1.25200k
Command Light$0.3$0.64k
Gemma 3 12B$0.3$0.333k
Llama 3 8B Instruct$0.3$0.68k
MiniMax M2.1$0.3$1.20200k
Amazon Titan Text G1 - Lite$0.4$0.84k
MiniMax M2$0.4$1.60197k
Mistral Devstral 2 123B$0.4$2.00
Qwen3-235B-A22B$0.4$1.20128k
Mixtral 8x7B$0.45$0.732k
Mixtral 8x7B Instruct v0.1$0.45$0.4533k
Mistral Large 2$0.48$2.40128k
Cohere Command R$0.5$1.50128k
Command R$0.5$1.50128k
Gemma 3 27B$0.5$0.5131k
Kimi K2$0.5$2.00262k
Mistral Large 3 675B Instruct$0.5$1.50128k
Mistral Magistral Small 2509$0.5$1.50
Qwen3-Coder-Next$0.5$1.20256k
Titan Text Premier$0.5$1.5032k
Qwen3-VL-235B-A22B$0.53$2.66128k
DeepSeek V3.1$0.6$1.7364k
Kimi K2 Thinking$0.6$2.50256k
Kimi K2.5$0.6$3.00256k
Palmyra X5$0.6$6.001m
DeepSeek V3.2$0.62$1.85160k
Llama 3.1 70B Instruct$0.72$0.72128k
Llama 3.3 70B Instruct (free)$0.72$0.7266k
DeepSeek V3$0.75$3.0064k
Llama 2 13B Chat$0.75$1.004k
Amazon Titan Text G1 - Express$0.8$1.608k
Claude 3.5 Haiku$0.8$4.00200k
Claude Haiku 4.5$0.8$4.00200k
Claude Instant 1.2$0.8$2.40100k
Nova Pro$0.8$3.20300k
Llama 3.3 70B Instruct$0.96$1.28128k
Llama 3 70B Instruct$0.99$0.998k
Mistral Small$1.00$3.0032k
Grok 4.3$1.25$2.501m
DeepSeek R1$1.35$5.40128k
Llama 3.2 90B Instruct$1.35$1.80128k
Llama 3.2 90B Vision$1.35$1.80128k
Llama 3.2 90B Vision Instruct$1.35$1.80128k
Command$1.50$2.004k
Llama 2 70B Chat$1.95$2.564k
Jamba 1.5 Large$2.00$8.00256k
Mistral Large$2.00$6.0032k
Mistral Pixtral Large$2.00$6.00
Meta Llama 2 Chat 70B$2.10$2.804k
Llama 3.1 405B Instruct$2.40$2.40128k
Amazon Nova Premier$2.50$12.501m
GPT-5.4$2.75$16.501.05m
Claude 3 Sonnet$3.00$15.00200k
Claude 3.5 Sonnet$3.00$15.00200k
Claude 3.7 Sonnet$3.00$15.00200k
Claude Sonnet 4.6$3.00$15.001m
Cohere Command R Plus$3.00$15.00128k
Command R+$3.00$15.00128k
Claude Opus 4.7$5.00$25.001m
GPT-5.5$5.50$33.001.05m
AI21 Labs J2-Mid$6.00$6.008k
Claude 3.5 Sonnet v2$6.00$30.00200k
Claude 2$8.00$24.00100k
Claude 2.1$8.00$24.00200k
Claude Sonnet 4$8.00$40.00200k
AI21 Labs J2-Ultra$9.00$9.008k
Claude Sonnet 4.5$9.00$45.00200k
Claude Fable 5$10.00$50.001m
Jurassic-2 Mid$12.50$12.508k
Claude 3 Opus$15.00$75.00200k
Claude Opus 4.1$15.00$75.00200k
Claude Opus 4.6$15.00$75.001m
Claude Opus 4.5$18.00$90.00200k
Jurassic-2 Ultra$18.80$18.808k
Amazon Nova 2 Pro1m
Amazon Nova 2 Sonic
Amazon Nova Canvas
Amazon Nova Multimodal Embeddings
Amazon Nova Reel
Amazon Titan Image Generator G1 v2
Amazon Titan Multimodal Embeddings G1
Amazon Titan Text Embeddings V2
Claude Mythos 51m
Claude Opus 4.81m
Claude Sonnet 51m
Cohere Embed English
Cohere Embed Multilingual
Cohere Embed v4
Cohere Rerank 3.5
Gemma 4 26B A4B256k
Gemma 4 31B256k
Gemma 4 E2B128k
Qwen3-Coder-480B-A35B-Instruct262k

Where else to run this

Pricing Overview

Cheapest$0.02/1M
Most expensive$18.80/1M

About AWS Bedrock

Amazon Bedrock is a comprehensive, fully managed service for building and scaling generative AI applications. The platform provides access to a diverse array of high-performing foundation models (FMs) from leading AI companies through a unified API, enabling users to select the most suitable models for their specific use cases. Key features include model customization using proprietary data through techniques like fine-tuning and Retrieval Augmented Generation (RAG), which significantly enhances the relevance and accuracy of AI outputs. Additionally, the platform supports the automation of complex tasks with agents capable of executing multi-step operations, making it versatile for applications ranging from text generation and image creation to conversational AI. Beyond its robust technical capabilities, Amazon Bedrock offers a serverless experience that streamlines infrastructure management, allowing developers to focus on application development without the burden of managing underlying resources. The platform prioritizes security and compliance, ensuring that data remains within the AWS ecosystem and adheres to industry standards. Bedrock's flexible pricing models, including pay-as-you-go options, enable organizations to effectively manage costs while scaling their AI initiatives. This combination of advanced features, ease of use, and cost-effectiveness positions Amazon Bedrock as a powerful tool for businesses looking to innovate rapidly in the generative AI space, ultimately enhancing productivity and operational efficiency.

Full provider profile →