LLM Reference

Cogito v1 Preview Llama 8B

Released
2025-04-08
Last refreshed
2026-06-29
Status
Researched 56d ago
Open weightsCommercial use: conditionalRAGAgentsLong contextClassificationJSON / Tool use

Cogito v1 Preview Llama 8B is worth evaluating for rag, agents, and long context when its provider route and context window match the workload.

Use it for

  • Teams evaluating rag, agents, and long context
  • Workloads that can use a 128k context window
  • Buyers comparing 1 tracked provider route

Do not use it for

  • Vision or document-understanding workloads
Specifications
Family
Cogito
Released
2025-04-08
Context
128k
Parameters
8B
Architecture
Decoder Only
Knowledge cutoff
2023-12
Specialization
general
Openness
Open weights
License
Llama 3 CommunityCommercial use: conditional
Training
Fine-tuned
Created by

Building general superintelligence through advanced reasoning and iterative self-improvement.

San Francisco, California, United States
Founded 2024
Website
Pricing
Output / 1M
$0.200
Input / 1M
$0.200

Cheapest of 1 route · Fireworks AI

About

Cogito v1 Preview Llama 8B is a hybrid reasoning model fine-tuned from Llama 3.1 8B using Iterated Distillation and Amplification (IDA). Supports direct and extended-thinking modes, tool calling, and 30+ languages.

Cogito v1 Preview Llama 8B is an open-weight model in the Cogito family. The structured metadata tracks a 128k-token context window, reasoning, function calling, tool use, and structured outputs. This page tracks provider routes through Fireworks AI, with the cheapest tracked route listed at $0.2 input and $0.2 output per 1M tokens. No headline benchmark score is tracked for Cogito v1 Preview Llama 8B yet.

Top use-case fit: coding, agents, and build tasks

RAG

Included by capability and metadata signals in the decision map.

Agents

Included by capability and metadata signals in the decision map.

Long context

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MRoute
Fireworks AI$0.200$0.200
Serverless

Available via routers & gateways(1)

Capabilities

ReasoningFunction CallingTool UseStructured Outputs

Benchmark peer barsfor RAG

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Frequently asked questions

What is the context window of Cogito v1 Preview Llama 8B?

Cogito v1 Preview Llama 8B has a context window of 128k tokens.

How much does Cogito v1 Preview Llama 8B cost?

Cogito v1 Preview Llama 8B is available at $0.2/1M input tokens through Fireworks AI.

When was Cogito v1 Preview Llama 8B released?

Cogito v1 Preview Llama 8B was released on 2025-04-08.

Which providers offer Cogito v1 Preview Llama 8B?

Cogito v1 Preview Llama 8B is available from 1 provider: Fireworks AI.