LLM ReferenceLLM Reference

Cogito v1 Preview Llama 3B

cogito-v1-preview-llama-3b

Researched 11d ago

Last refreshed 2026-05-07. Next refresh: weekly.

Open SourceRAGAgentsLong contextClassificationJSON / Tool use

Cogito v1 Preview Llama 3B is worth evaluating for rag, agents, and long context when its provider route and context window match the workload.

Decision context: RAG task fit, 1 tracked provider route, and research from 2026-05-07.

Use it for

  • Teams evaluating rag, agents, and long context
  • Workloads that can use a 128K context window
  • Buyers comparing 1 tracked provider route

Do not use it for

  • Vision or document-understanding workloads

Cheapest output

$0.100

Fireworks AI per 1M tokens

Provider routes

1

Tracked API hosts

Quality / dollar

Unknown

No task benchmark coverage yet

Freshness

2026-05-07

Researched 11d ago

fresh

Top use-case fit

RAG

Included by capability and metadata signals in the decision map.

Agents

Included by capability and metadata signals in the decision map.

Long context

Included by capability and metadata signals in the decision map.

Provider price ladder

ProviderInput / 1MOutput / 1MRoute
Fireworks AI$0.100$0.100
Serverless

Benchmark peer barsfor RAG

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

About

Cogito v1 Preview Llama 3B is Deep Cogito's smallest hybrid reasoning model. Fine-tuned from Llama 3.2 3B using Iterated Distillation and Amplification (IDA). Supports both direct and extended-thinking (reasoning) modes, tool calling, and 30+ languages.

Cogito v1 Preview Llama 3B has a 128K-token context window.

Cogito v1 Preview Llama 3B input tokens at $0.1/1M, output at $0.1/1M.

Capabilities

ReasoningFunction CallingTool UseStructured Outputs

Rankings

Specifications

FamilyCogito
Released2025-04-08
Parameters3B
Context128K
ArchitectureDecoder Only
Specializationgeneral
LicenseLlama 3 Community
Trainingfinetuned

Created by

Building general superintelligence through advanced reasoning and iterative self-improvement.

San Francisco, California, United States
Founded 2024
Website

Providers(1)