Firefunction Models by Fireworks AI
About
The Firefunction family of large language models is specifically designed for efficient function calling, enabling seamless interaction with external APIs and access to real-time data. The initial version, Firefunction v1, was built on the Mixtral 8x7B model and achieved GPT-4-level accuracy in practical scenarios, with the advantages of faster speeds and open-source accessibility. Its successor, Firefunction v2, utilizes the Llama 3 model to enhance its function-calling capabilities while maintaining robust conversational and instruction-following features. This version excels in multi-turn dialogues, parallel function calling, and consistently surpasses Llama 3 in function-calling tasks, delivering results akin to GPT-4o on public benchmarks. These models are tailored for real-world use, combining accuracy, speed, and cost effectiveness.
Current Variants
Use-when guidance is derived from seed capabilities, context, release, and replacement fields.
Use when the workload needs 8k context and 46B parameters.
Use when the workload needs 32k context and 70B parameters.
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| Firefunction V1 | Use when the workload needs 8k context and 46B parameters. | 2024-01 | 8k context46B parameters | Current |
| Firefunction V2 | Use when the workload needs 32k context and 70B parameters. | 2024-01 | 32k context70B parameters | Current |
Release Timeline
1 release groupSpecifications(2 models)
| Model | Released | Context | Parameters |
|---|---|---|---|
| Firefunction V1 | 2024-01 | 8k | 46B |
| Firefunction V2 | 2024-01 | 32k | 70B |
Available From(1 provider)
Pricing
| Model | Provider | Input / 1M | Output / 1M | Type |
|---|---|---|---|---|
| Firefunction V1 | Fireworks AI | $0.5 | $0.5 | Provisioned |
| Firefunction V2 | Fireworks AI | $0.9 | $0.9 | Serverless |
Frequently Asked Questions
- What is Firefunction used for?
- Firefunction is used for structured outputs. The family description and listed model capabilities point to those workloads as the best fit.
- How does Firefunction compare to Fireworks Functions?
- Firefunction by Fireworks AI is strongest where you need structured outputs, while Fireworks Functions by Fireworks AI is the closest related family to check for adjacent model selection. Firefunction has 2 listed variants and reaches up to 32k context, so compare the specs and pricing tables before choosing a production model.
- Which Firefunction model should I use?
- For the lowest listed input price, start with Firefunction V1 through Fireworks AI at $0.5/1M input tokens. For the most capable/latest local choice, evaluate Firefunction V2 with 32k context.






