What is Falcon used for?

Falcon is used for structured outputs, coding, and long-context generation. The family description and listed model capabilities point to those workloads as the best fit.

How does Falcon compare to MOSS-Audio?

Falcon by Technology Innovation Institute (TII) is strongest where you need structured outputs, while MOSS-Audio by MOSI AI is the closest related family to check for multimodal. Falcon has 4 listed variants and reaches up to 128k context, so compare the specs and pricing tables before choosing a production model.

Which Falcon model should I use?

For the lowest listed input price, start with Falcon 7B through Microsoft Foundry at $0.52/1M input tokens. For the most capable/latest local choice, evaluate Falcon 40B with structured outputs.

Falcon Models by Technology Innovation Institute (TII)

Technology Innovation Institute (TII)Apache 2.0Open sourceOpen Source

4 models2023–2024Up to 128k ctxFrom $0.52/1M input

Details

ResearcherTechnology Innovation Institute (TII)

LicenseApache 2.0OSI-approved

Commercial useCommercial use: permitted

Models4

Released2023–2024

Max context128k

Capabilities

Structured Outputs2 of 4 models

Links

Website HuggingFace

About

The Falcon family of large language models (LLMs), developed by the Technology Innovation Institute (TII) in Abu Dhabi, offers a diverse range of models that are open-source and freely available for research and commercial applications. Notably, this includes the Falcon-40B, which excelled on the Hugging Face Open LLM Leaderboard, and the even more advanced Falcon-180B, which matches the performance of many proprietary models. These models benefit from training on extensive datasets like the RefinedWeb dataset, known for its high-quality, filtered, and deduplicated web content. Additionally, the Falcon family includes instruction-tuned versions, such as Falcon-7B-Instruct and Falcon-40B-Instruct, optimized for conversational interactions. Recently, the Falcon Mamba 7B was introduced, offering improved memory efficiency and enhanced long-text generation through its novel state-space language model (SSLM) architecture. This family of models underscores a strong commitment to open-source AI, making advanced language capabilities accessible to a broad audience of researchers and users 1481011.

Current Variants

Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.

4 in view

Falcon 3 7B InstructCurrent

Use when the workload needs 128k context and 7B parameters.

2024-12128k context7B parameters

Falcon 180BCurrent

Use when the workload needs 180B parameters.

2023-11180B parameters

Falcon 40BCurrent

Use when the workload needs 40B parameters and structured outputs.

2023-1140B parametersstructured outputs

Falcon 7BCurrent

Use when the workload needs 7B parameters and structured outputs.

2023-117B parametersstructured outputs

Current Falcon variants with use-when guidance and lifecycle status
Model	Use when	Released	Signals	Status
Falcon 3 7B Instruct	Use when the workload needs 128k context and 7B parameters.	2024-12	128k context7B parameters	Current
Falcon 180B	Use when the workload needs 180B parameters.	2023-11	180B parameters	Current
Falcon 40B	Use when the workload needs 40B parameters and structured outputs.	2023-11	40B parametersstructured outputs	Current
Falcon 7B	Use when the workload needs 7B parameters and structured outputs.	2023-11	7B parametersstructured outputs	Current

Release Timeline

2 release groups

2024-12

1 current

Falcon 3 7B Instruct

128k context7B parameters

Current

2023-11

3 current

Falcon 180B

180B parameters

Current

Falcon 40B

40B parametersstructured outputs

Current

Falcon 7B

7B parametersstructured outputs

Current

Specifications(4 models)

Falcon model specifications comparison
Model	Released	Context	Parameters	Structured Outputs
Falcon 3 7B Instruct	2024-12	128k	7B	No
Falcon 180B	2023-11	—	180B	No
Falcon 40B	2023-11	—	40B	Yes
Falcon 7B	2023-11	—	7B	Yes

Available From(6 providers)

Alibaba Cloud PAI-EAS

Scale AI GenAI Platform

Pricing

Falcon model pricing by provider
Model	Provider	Input / 1M	Output / 1M	Type
Falcon 7B	Microsoft Foundry	$0.52	$0.67	Provisioned
Falcon 40B	Replicate API	$0.65	$2.75	Serverless
Falcon 40B	Microsoft Foundry	$1.54	$1.77	Provisioned

Popular comparisons in this family

Frequently Asked Questions

What is Falcon used for?: Falcon is used for structured outputs, coding, and long-context generation. The family description and listed model capabilities point to those workloads as the best fit.
How does Falcon compare to MOSS-Audio?: Falcon by Technology Innovation Institute (TII) is strongest where you need structured outputs, while MOSS-Audio by MOSI AI is the closest related family to check for multimodal. Falcon has 4 listed variants and reaches up to 128k context, so compare the specs and pricing tables before choosing a production model.
Which Falcon model should I use?: For the lowest listed input price, start with Falcon 7B through Microsoft Foundry at $0.52/1M input tokens. For the most capable/latest local choice, evaluate Falcon 40B with structured outputs.

Models(4)

Falcon 3 7B Instruct

2024-12128k7B1 provider

Open Source

Falcon 180B

2023-11180B2 providers

Open Source

Falcon 40B

2023-1140B4 providers

Open Source

Falcon 7B

2023-117B3 providers

Open Source

Falcon Models by Technology Innovation Institute (TII)

Details

Capabilities

Links

About

Current Variants

Release Timeline

Specifications(4 models)

Available From(6 providers)

Pricing

Popular comparisons in this family

Frequently Asked Questions

Related Model Families

Models(4)