LLM Reference

Sonar Models by Perplexity Labs

Perplexity LabsProprietaryProprietaryHighlight
6 models2025Up to 200k ctxFrom $1/1M input

Details

ResearcherPerplexity Labs
LicenseProprietary
Commercial useCommercial use with conditions
Models6
Released2025
Max context200k

Capabilities

Structured OutputsAll models

Links

Website

About

The Sonar family of Large Language Models (LLMs), developed by Perplexity AI, marks a notable progression in terms of cost-efficiency, speed, and performance compared to predecessors like PPLX and Mixtral 5. These models are distinct for their ability to provide real-time internet access and deliver up-to-date information, which is a significant improvement over traditional LLMs 5. The Sonar family comprises different model configurations, such as "small" and "large" models, each tailored for specific tasks and featuring varying context window lengths 67. Built upon the foundation of the Llama 3.1 model, Sonar models are further refined with Perplexity's proprietary search capabilities to enhance accuracy and relevance 10. They offer both online and chat versions, catering to a wide range of applications that require either rapid responses or more extended conversational interactions 57. Additionally, these models are available through APIs, facilitating their integration into various applications seamlessly 67.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

5 in view1 retired

Use when the workload needs 200k context and structured outputs.

2025-11200k contextstructured outputs

Use when the workload needs 128k context and structured outputs.

2025-03128k contextstructured outputs

Use when the workload needs 128k context and structured outputs.

2025-03128k contextstructured outputs
SonarCurrent

Use when the workload needs 127k context and structured outputs.

2025-01127k contextstructured outputs
Sonar ProCurrent

Use when the workload needs 200k context and structured outputs.

2025-01200k contextstructured outputs

Release Timeline

3 release groups
2025-11
1 current
Sonar Pro Search
200k contextstructured outputs
Current
2025-03
2 current
Sonar Deep Research
128k contextstructured outputs
Current
Sonar Reasoning Pro
128k contextstructured outputs
Current
2025-01
2 current · 1 retired
Sonar
127k contextstructured outputs
Current
Sonar Pro
200k contextstructured outputs
Current
Sonar Reasoning
127k contextstructured outputs
Replaced

Replaced By

Keep for legacy integrations; evaluate Sonar Reasoning Pro before new work.

Specifications(6 models)

Sonar model specifications comparison
ModelReleasedContextStructured Outputs
Sonar Pro Search2025-11200kYes
Sonar Deep Research2025-03128kYes
Sonar Reasoning Pro2025-03128kYes
Sonar2025-01127kYes
Sonar Pro2025-01200kYes

Pricing

Sonar model pricing by provider
ModelProviderInput / 1MOutput / 1MType
SonarOpenRouter$1$1Serverless
SonarPerplexity Labs$1$1Serverless
Sonar Reasoning ProOpenRouter$2$8Serverless
Sonar Deep ResearchOpenRouter$2$8Serverless
Sonar Reasoning ProPerplexity Labs$2$8Serverless
Sonar Deep ResearchPerplexity Labs$2$8Serverless
Sonar ProOpenRouter$3$15Serverless
Sonar Pro SearchOpenRouter$3$15Serverless
Sonar ProPerplexity Labs$3$15Serverless
Sonar Pro SearchPerplexity Labs$3$15Serverless

Frequently Asked Questions

What is Sonar used for?
Sonar is used for structured outputs. The family description and listed model capabilities point to those workloads as the best fit.
How does Sonar compare to Claude Fable?
Sonar by Perplexity Labs is strongest where you need structured outputs, while Claude Fable by Anthropic is the closest related family to check for vision and multimodal work. Sonar has 6 listed variants and reaches up to 200k context, while Claude Fable reaches up to 1m context, so compare the specs and pricing tables before choosing a production model.
Which Sonar model should I use?
For the lowest listed input price, start with Sonar through Perplexity Labs at $1/1M input tokens. For the most capable/latest local choice, evaluate Sonar Pro Search with 200k context and structured outputs.