LLM ReferenceLLM Reference

Cogito Models by Deep Cogito

10 models2025Up to 128K ctxFrom $0.1/1M input

About

The Cogito family is a series of hybrid open-weight reasoning models from Deep Cogito, trained with Iterated Distillation and Amplification (IDA). Models span 3B to 671B parameters, support both direct and extended-thinking (reasoning) modes, and are fine-tuned from Llama and Qwen base checkpoints (v1 Preview) and DeepSeek V3 Base (v2.1). Available via Fireworks AI, Together AI, and other inference providers.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

10 in view

Use when the workload needs 128K context, 671B parameters, and reasoning.

2025-11128K context671B parametersreasoning

Use when the workload needs 70B parameters, reasoning, and tool use.

2025-0770B parametersreasoningtool use

Use when the workload needs 109B parameters, reasoning, and tool use.

2025-07109B parametersreasoningtool use

Use when the workload needs 405B parameters, reasoning, and tool use.

2025-07405B parametersreasoningtool use

Use when the workload needs 671B parameters, reasoning, and tool use.

2025-07671B parametersreasoningtool use

Use when the workload needs 128K context, 3B parameters, and reasoning.

2025-04128K context3B parametersreasoning

Use when the workload needs 128K context, 70B parameters, and reasoning.

2025-04128K context70B parametersreasoning

Use when the workload needs 128K context, 8B parameters, and reasoning.

2025-04128K context8B parametersreasoning

Use when the workload needs 128K context, 14B parameters, and reasoning.

2025-04128K context14B parametersreasoning

Use when the workload needs 128K context, 32B parameters, and reasoning.

2025-04128K context32B parametersreasoning

Release Timeline

3 release groups
2025-11
1 current
Cogito v2.1 671B
128K context671B parametersreasoning
Current
2025-07
4 current
Cogito v2 Preview DeepSeek 671B MoE
671B parametersreasoningtool use
Current
Cogito v2 Preview Llama 109B MoE
109B parametersreasoningtool use
Current
Cogito v2 Preview Llama 405B
405B parametersreasoningtool use
Current
Cogito v2 Preview Llama 70B
70B parametersreasoningtool use
Current
2025-04
5 current
Cogito v1 Preview Llama 3B
128K context3B parametersreasoning
Current
Cogito v1 Preview Llama 70B
128K context70B parametersreasoning
Current
Cogito v1 Preview Llama 8B
128K context8B parametersreasoning
Current
Cogito v1 Preview Qwen-14B
128K context14B parametersreasoning
Current
Cogito v1 Preview Qwen-32B
128K context32B parametersreasoning
Current

Specifications(10 models)

Cogito model specifications comparison
ModelReleasedContextParametersReasoningFn CallingTool UseStructured OutputsCode Exec
Cogito v2.1 671B2025-11128K671BYesYesYesYesYes
Cogito v2 Preview Llama 70B2025-0770BYesYesYesNoNo
Cogito v2 Preview Llama 109B MoE2025-07109BYesYesYesNoNo
Cogito v2 Preview Llama 405B2025-07405BYesYesYesNoNo
Cogito v2 Preview DeepSeek 671B MoE2025-07671BYesYesYesNoNo
Cogito v1 Preview Llama 3B2025-04128K3BYesYesYesYesNo
Cogito v1 Preview Llama 70B2025-04128K70BYesYesYesYesNo
Cogito v1 Preview Llama 8B2025-04128K8BYesYesYesYesNo
Cogito v1 Preview Qwen-14B2025-04128K14BYesYesYesYesNo
Cogito v1 Preview Qwen-32B2025-04128K32BYesYesYesYesNo

Available From(1 provider)

Pricing

Cogito model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Cogito v1 Preview Llama 3BFireworks AI$0.1$0.1Serverless
Cogito v1 Preview Llama 8BFireworks AI$0.2$0.2Serverless
Cogito v1 Preview Qwen-14BFireworks AI$0.2$0.2Serverless
Cogito v1 Preview Llama 70BFireworks AI$0.9$0.9Serverless
Cogito v1 Preview Qwen-32BFireworks AI$0.9$0.9Serverless

Frequently Asked Questions

What is Cogito used for?
Cogito is used for reasoning, agent workflows and tool use, and structured outputs. The family description and listed model capabilities point to those workloads as the best fit.
How does Cogito compare to Claude 3?
Cogito by Deep Cogito is strongest where you need reasoning, while Claude 3 by Anthropic is the closest related family to check for vision and multimodal work. Cogito has 10 listed variants and reaches up to 128K context, while Claude 3 reaches up to 200K context, so compare the specs and pricing tables before choosing a production model.
Which Cogito model should I use?
For the lowest listed input price, start with Cogito v1 Preview Llama 3B through Fireworks AI at $0.1/1M input tokens. For the most capable/latest local choice, evaluate Cogito v2.1 671B with 128K context and reasoning, tool use, function calling, and structured outputs.

Models(10)