What is Firefunction used for?

Firefunction is used for structured outputs. The family description and listed model capabilities point to those workloads as the best fit.

How does Firefunction compare to Fireworks Functions?

Firefunction by Fireworks AI is strongest where you need structured outputs, while Fireworks Functions by Fireworks AI is the closest related family to check for adjacent model selection. Firefunction has 2 listed variants and reaches up to 32k context, so compare the specs and pricing tables before choosing a production model.

Which Firefunction model should I use?

For the lowest listed input price, start with Firefunction V1 through Fireworks AI at $0.5/1M input tokens. For the most capable/latest local choice, evaluate Firefunction V2 with 32k context.

Firefunction Models by Fireworks AI

Fireworks AILlama 3 CommunityOpen weights

2 models2024Up to 32k ctxFrom $0.5/1M input

Details

ResearcherFireworks AI

LicenseLlama 3 Community

Commercial useCommercial use: conditional

Models2

Released2024

Max context32k

Links

Website HuggingFace

About

The Firefunction family of large language models is specifically designed for efficient function calling, enabling seamless interaction with external APIs and access to real-time data. The initial version, Firefunction v1, was built on the Mixtral 8x7B model and achieved GPT-4-level accuracy in practical scenarios, with the advantages of faster speeds and open-source accessibility. Its successor, Firefunction v2, utilizes the Llama 3 model to enhance its function-calling capabilities while maintaining robust conversational and instruction-following features. This version excels in multi-turn dialogues, parallel function calling, and consistently surpasses Llama 3 in function-calling tasks, delivering results akin to GPT-4o on public benchmarks. These models are tailored for real-world use, combining accuracy, speed, and cost effectiveness.

Current Variants

Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.

2 in view

Firefunction V1Current

Use when the workload needs 8k context and 46B parameters.

2024-018k context46B parameters

Firefunction V2Current

Use when the workload needs 32k context and 70B parameters.

2024-0132k context70B parameters

Current Firefunction variants with use-when guidance and lifecycle status
Model	Use when	Released	Signals	Status
Firefunction V1	Use when the workload needs 8k context and 46B parameters.	2024-01	8k context46B parameters	Current
Firefunction V2	Use when the workload needs 32k context and 70B parameters.	2024-01	32k context70B parameters	Current

Release Timeline

1 release group

2024-01

2 current

Firefunction V1

8k context46B parameters

Current

Firefunction V2

32k context70B parameters

Current

Specifications(2 models)

Firefunction model specifications comparison
Model	Released	Context	Parameters
Firefunction V1	2024-01	8k	46B
Firefunction V2	2024-01	32k	70B

Available From(1 provider)

Fireworks AI

Pricing

Firefunction model pricing by provider
Model	Provider	Input / 1M	Output / 1M	Type
Firefunction V1	Fireworks AI	$0.5	$0.5	Provisioned
Firefunction V2	Fireworks AI	$0.9	$0.9	Serverless

Popular comparisons in this family

Frequently Asked Questions

What is Firefunction used for?: Firefunction is used for structured outputs. The family description and listed model capabilities point to those workloads as the best fit.
How does Firefunction compare to Fireworks Functions?: Firefunction by Fireworks AI is strongest where you need structured outputs, while Fireworks Functions by Fireworks AI is the closest related family to check for adjacent model selection. Firefunction has 2 listed variants and reaches up to 32k context, so compare the specs and pricing tables before choosing a production model.
Which Firefunction model should I use?: For the lowest listed input price, start with Firefunction V1 through Fireworks AI at $0.5/1M input tokens. For the most capable/latest local choice, evaluate Firefunction V2 with 32k context.