LLM Reference
Replicate API

Llama 3 70B on Replicate API

Llama 3 · AI at Meta

Serverless

Pricing

TypePrice (per 1M)
Input tokens$0.65
Output tokens$2.75

Capabilities

VisionMultimodalReasoningFunction CallingTool UseJSON ModeCode Execution

About Llama 3 70B

The Llama 3 70B model is a state-of-the-art large language model with 70 billion parameters, released by Meta on April 18, 2024. It's based on an auto-regressive transformer architecture and has been optimized for dialogue applications using supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF). The model supports an 8,000-token context length and has been trained on over 15 trillion tokens from public online sources. It excels in tasks such as conversational AI, text generation, and natural language understanding, outperforming many existing open-source chat models on industry benchmarks. The model is designed with a focus on safety and helpfulness, making it suitable for both commercial and research applications, particularly in English. For more details, visit the Hugging Face link .

Get Started

Model Specs

Released2024-04-18
Parameters70B
Context8K
ArchitectureDecoder Only