How much does Falcon 7B cost?

Falcon 7B is available at $0.52/1M input tokens through Microsoft Foundry.

When was Falcon 7B released?

Falcon 7B was released on 2023-11-28.

Which providers offer Falcon 7B?

Falcon 7B is available from 3 providers: Microsoft Foundry, GCP Vertex AI, Alibaba Cloud PAI-EAS.

Falcon 7B

Name: Falcon 7B
Author: Technology Innovation Institute (TII)

Released

2023-11-28

Last refreshed

2026-05-19

Status

Researched 16d ago

ClassificationJSON / Tool use

Falcon 7B is worth evaluating for classification and json / tool use when its provider route and context window match the workload.

Use it for

Teams evaluating classification and json / tool use
Buyers comparing 3 tracked provider routes

Do not use it for

Vision or document-understanding workloads

Specifications

Family: Falcon
Released: 2023-11-28
Parameters: 7B
Architecture: Decoder Only
Specialization: general
Training: finetuned

Created by

Technology Innovation Institute (TII)

Innovative open-source AI for global impact

Abu Dhabi, United Arab Emirates

Founded 2019

Website

Pricing

Output / 1M

$0.670

Input / 1M

$0.520

Cheapest of 3 routes · Microsoft Foundry

Providers(3)

Microsoft Foundry GCP Vertex AI Alibaba Cloud PAI-EAS

View 3 provider routes

About

Falcon-7B, developed by the Technology Innovation Institute, is a cutting-edge large language model boasting a decoder-only architecture with 7 billion parameters. It's trained on 1,500 billion tokens from the curated web dataset, RefinedWeb, enhancing its performance in language tasks. The model is equipped with advanced features like FlashAttention and multiquery attention, optimizing speed and memory usage. With 32 layers and rotary positional embeddings, it manages a sequence length of up to 2048 tokens efficiently. Renowned for tasks such as text generation, summarization, translation, and conversational AI, Falcon-7B is open-source under Apache 2.0, suitable even for consumer hardware, needing at least 16GB of memory for inference 236.

Falcon 7B is a model in the Falcon family. The structured metadata tracks structured outputs. This page tracks provider routes through Microsoft Foundry, GCP Vertex AI, and Alibaba Cloud PAI-EAS, with the cheapest tracked route listed at $0.52 input and $0.67 output per 1M tokens. No headline benchmark score is tracked for Falcon 7B yet.

Top use-case fit

Classification

Included by capability and metadata signals in the decision map.

JSON / Tool use

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 3

Compare API pricing across 3 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
Microsoft Foundry	$0.520	$0.670	Provisioned
Alibaba Cloud PAI-EAS	-	-	ServerlessPartial
GCP Vertex AI	-	-	ServerlessPartial

Capabilities

Structured Outputs

Benchmark peer barsfor Classification

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(8)

Best LLMs for ClassificationListed Best Open Source LLMsListed Best Small Language Models (SLMs)Listed Cheapest LLM APIs You Can Call Right NowListed Best Mainstream LLM APIs, RankedListed Best Free LLMs You Can Use Right NowListed Best LLMs for WritingListed Best LLMs for MarketingListed