LLM Reference

Llama 2 70B Chat

About

Llama 2 70B Chat is a large-scale language model with 70 billion parameters, designed for conversational AI applications. Released on July 18, 2023, it's part of Meta's Llama 2 family, featuring advanced transformer architecture optimized through supervised fine-tuning and reinforcement learning with human feedback. The model excels in generating human-like responses, outperforming many open-source alternatives and rivaling closed-source models like ChatGPT. Trained on 2 trillion tokens from diverse public sources, it's suitable for commercial and research applications in English, particularly for assistant-like functionalities. The model is available on Hugging Face for further exploration and implementation .

Capabilities

MultimodalFunction CallingTool UseJSON Mode

Providers(14)

ProviderInput (per 1M)Output (per 1M)Type
Databricks Foundation Model Serving$0.5$1.5
Serverless
Azure OpenAI$1.54$1.77
Serverless
Provisioned
GCP Vertex AI
Serverless
Alibaba Cloud PAI-EAS
Serverless
Replicate API
Serverless
AWS Bedrock$1.95$2.56
Serverless
Snowflake Cortex$0.9$0.9
Serverless
OCI Generative AI
Serverless
NVIDIA NIM
Provisioned
deepinfra API
Serverless
Lepton AI API
Serverless
Together AI API$0.9$0.9
Serverless
IBM watsonx$1.8$1.8
Serverless
Scale AI GenAI Platform
Serverless

Specifications

FamilyLlama 2
Released2023-07-18
Parameters70B
Context4K
ArchitectureDecoder Only
Specializationgeneral