Cogito Models by Deep Cogito
About
The Cogito family is a series of hybrid open-weight reasoning models from Deep Cogito, trained with Iterated Distillation and Amplification (IDA). Models span 3B to 671B parameters, support both direct and extended-thinking (reasoning) modes, and are fine-tuned from Llama and Qwen base checkpoints (v1 Preview) and DeepSeek V3 Base (v2.1). Available via Fireworks AI, Together AI, and other inference providers.
Current Variants
Use-when guidance is derived from seed capabilities, context, release, and replacement fields.
Use when the workload needs 128K context, 671B parameters, and reasoning.
Use when the workload needs 70B parameters, reasoning, and tool use.
Use when the workload needs 109B parameters, reasoning, and tool use.
Use when the workload needs 405B parameters, reasoning, and tool use.
Use when the workload needs 671B parameters, reasoning, and tool use.
Use when the workload needs 128K context, 3B parameters, and reasoning.
Use when the workload needs 128K context, 70B parameters, and reasoning.
Use when the workload needs 128K context, 8B parameters, and reasoning.
Use when the workload needs 128K context, 14B parameters, and reasoning.
Use when the workload needs 128K context, 32B parameters, and reasoning.
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| Cogito v2.1 671B | Use when the workload needs 128K context, 671B parameters, and reasoning. | 2025-11 | 128K context671B parametersreasoning | Current |
| Cogito v2 Preview Llama 70B | Use when the workload needs 70B parameters, reasoning, and tool use. | 2025-07 | 70B parametersreasoningtool use | Current |
| Cogito v2 Preview Llama 109B MoE | Use when the workload needs 109B parameters, reasoning, and tool use. | 2025-07 | 109B parametersreasoningtool use | Current |
| Cogito v2 Preview Llama 405B | Use when the workload needs 405B parameters, reasoning, and tool use. | 2025-07 | 405B parametersreasoningtool use | Current |
| Cogito v2 Preview DeepSeek 671B MoE | Use when the workload needs 671B parameters, reasoning, and tool use. | 2025-07 | 671B parametersreasoningtool use | Current |
| Cogito v1 Preview Llama 3B | Use when the workload needs 128K context, 3B parameters, and reasoning. | 2025-04 | 128K context3B parametersreasoning | Current |
| Cogito v1 Preview Llama 70B | Use when the workload needs 128K context, 70B parameters, and reasoning. | 2025-04 | 128K context70B parametersreasoning | Current |
| Cogito v1 Preview Llama 8B | Use when the workload needs 128K context, 8B parameters, and reasoning. | 2025-04 | 128K context8B parametersreasoning | Current |
| Cogito v1 Preview Qwen-14B | Use when the workload needs 128K context, 14B parameters, and reasoning. | 2025-04 | 128K context14B parametersreasoning | Current |
| Cogito v1 Preview Qwen-32B | Use when the workload needs 128K context, 32B parameters, and reasoning. | 2025-04 | 128K context32B parametersreasoning | Current |
Release Timeline
3 release groupsSpecifications(10 models)
| Model | Released | Context | Parameters | Reasoning | Fn Calling | Tool Use | Structured Outputs | Code Exec |
|---|---|---|---|---|---|---|---|---|
| Cogito v2.1 671B | 2025-11 | 128K | 671B | Yes | Yes | Yes | Yes | Yes |
| Cogito v2 Preview Llama 70B | 2025-07 | — | 70B | Yes | Yes | Yes | No | No |
| Cogito v2 Preview Llama 109B MoE | 2025-07 | — | 109B | Yes | Yes | Yes | No | No |
| Cogito v2 Preview Llama 405B | 2025-07 | — | 405B | Yes | Yes | Yes | No | No |
| Cogito v2 Preview DeepSeek 671B MoE | 2025-07 | — | 671B | Yes | Yes | Yes | No | No |
| Cogito v1 Preview Llama 3B | 2025-04 | 128K | 3B | Yes | Yes | Yes | Yes | No |
| Cogito v1 Preview Llama 70B | 2025-04 | 128K | 70B | Yes | Yes | Yes | Yes | No |
| Cogito v1 Preview Llama 8B | 2025-04 | 128K | 8B | Yes | Yes | Yes | Yes | No |
| Cogito v1 Preview Qwen-14B | 2025-04 | 128K | 14B | Yes | Yes | Yes | Yes | No |
| Cogito v1 Preview Qwen-32B | 2025-04 | 128K | 32B | Yes | Yes | Yes | Yes | No |
Available From(1 provider)
Pricing
| Model | Provider | Input / 1M | Output / 1M | Type |
|---|---|---|---|---|
| Cogito v1 Preview Llama 3B | Fireworks AI | $0.1 | $0.1 | Serverless |
| Cogito v1 Preview Llama 8B | Fireworks AI | $0.2 | $0.2 | Serverless |
| Cogito v1 Preview Qwen-14B | Fireworks AI | $0.2 | $0.2 | Serverless |
| Cogito v1 Preview Llama 70B | Fireworks AI | $0.9 | $0.9 | Serverless |
| Cogito v1 Preview Qwen-32B | Fireworks AI | $0.9 | $0.9 | Serverless |
Frequently Asked Questions
- What is Cogito used for?
- Cogito is used for reasoning, agent workflows and tool use, and structured outputs. The family description and listed model capabilities point to those workloads as the best fit.
- How does Cogito compare to Claude 3?
- Cogito by Deep Cogito is strongest where you need reasoning, while Claude 3 by Anthropic is the closest related family to check for vision and multimodal work. Cogito has 10 listed variants and reaches up to 128K context, while Claude 3 reaches up to 200K context, so compare the specs and pricing tables before choosing a production model.
- Which Cogito model should I use?
- For the lowest listed input price, start with Cogito v1 Preview Llama 3B through Fireworks AI at $0.1/1M input tokens. For the most capable/latest local choice, evaluate Cogito v2.1 671B with 128K context and reasoning, tool use, function calling, and structured outputs.
Models(10)
Cogito v2.1 671B
Cogito v2 Preview Llama 70B
Cogito v2 Preview Llama 109B MoE
Cogito v2 Preview Llama 405B
Cogito v2 Preview DeepSeek 671B MoE
Cogito v1 Preview Llama 3B
Cogito v1 Preview Llama 70B
Cogito v1 Preview Llama 8B
Cogito v1 Preview Qwen-14B
Cogito v1 Preview Qwen-32B
