What is the context window of Llama 3 Sonar Small 32K Chat?

Llama 3 Sonar Small 32K Chat has a context window of 32K tokens.

How much does Llama 3 Sonar Small 32K Chat cost?

Llama 3 Sonar Small 32K Chat is available at $0.20/1M input tokens through Perplexity Labs.

When was Llama 3 Sonar Small 32K Chat released?

Llama 3 Sonar Small 32K Chat was released on 2024-05-05.

Which providers offer Llama 3 Sonar Small 32K Chat?

Llama 3 Sonar Small 32K Chat is available from 1 provider: Perplexity Labs.

Llama 3 Sonar Small 32K Chat

Name: Llama 3 Sonar Small 32K Chat
Author: Perplexity Labs

llama-3-sonar-small-32k-chat

Deprecated

Open Source

About

The Llama 3 Sonar Small 32K Chat model by Perplexity AI is a large language model optimized for chat applications. It stands out for its cost-effectiveness, speed, and enhanced performance compared to earlier Sonar models. This model supports a context window of 32,000 tokens, allowing it to sustain lengthy conversation histories, although some users note it might not fully utilize this memory capacity. It targets use in conversational AI environments such as chatbots and virtual assistants, providing coherent and contextually aware responses. Despite its relatively smaller size within the Llama 3 lineup, the model ensures a balance between performance and resource efficiency. However, like other LLMs, it can sometimes deliver inaccurate or outdated information, making independent verification essential.

Llama 3 Sonar Small 32K Chat has a 32K-token context window.

Llama 3 Sonar Small 32K Chat input tokens at $0.2/1M, output at $0.2/1M.