LLM ReferenceLLM Reference

Llama 3 8B Instruct

llama3-8b-instruct

Open Source

About

The Llama 3 8B Instruct model, released on April 18, 2024, is Meta's latest instruction-following language model with 8 billion parameters. It utilizes an auto-regressive transformer architecture with Grouped-Query Attention for improved scalability. Trained on over 15 trillion tokens and fine-tuned with 10 million human-annotated examples, it excels in dialogue and conversational tasks. The model outperforms its predecessors on industry benchmarks, scoring 68.4 on MMLU (5-shot). Designed for commercial and research applications, it prioritizes safety and helpfulness, making it suitable for chatbots, virtual assistants, and other interactive AI applications. For more details, visit the Hugging Face page [1].

Llama 3 8B Instruct has a 8K-token context window.

Llama 3 8B Instruct input tokens at $0.03/1M, output at $0.04/1M.

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution

Providers(17)

Compare all →
ProviderInput (per 1M)Output (per 1M)Type
AWS Bedrock$0.3$0.6Serverless
DeepInfra$0.05$0.15Serverless
OctoAI API$0.15$0.15Serverless
Fireworks AI$0.2$0.2Serverless
Alibaba Cloud PAI-EASServerless
Baseten APIServerless
Lepton AI API$0.07$0.07Serverless
GCP Vertex AI$0.12$0.36Serverless
Cloudflare Workers AIServerless
NVIDIA NIMProvisioned
Together AI$0.18$0.18Serverless
Perplexity Labs$0.20$0.20Serverless
Databricks Foundation Model ServingProvisioned
IBM watsonx$0.6$0.6Serverless
Microsoft Foundry$0.37$1.1ServerlessProvisioned
OpenRouter$0.03$0.04Serverless
Replicate API$0.05$0.25Serverless

Benchmark Scores(6)

BenchmarkScoreVersionSource
Google-Proof Q&A44.8diamondresearch
HellaSwag91.110-shotresearch
HumanEval68.2pass@1research
Massive Multitask Language Understanding76.95-shotresearch
Instruction-Following Evaluation59.5v2https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard
MMLU PRO40.5https://huggingface.co/spaces/TIGER-Lab/MMLU-Pro

Rankings

Specifications

FamilyLlama 3
Released2024-04-18
Parameters8B
Context8K
ArchitectureDecoder Only
Specializationgeneral
Trainingfinetuned

Created by

Large-scale open-source AI for social technologies.

Menlo Park, California, United States
Founded 2013
Website