LLM ReferenceLLM Reference

Llama 3.1 8B Instruct

Open Source

About

The Llama 3.1 8B Instruct model, released on July 23, 2024, is a multilingual large language model with 8 billion parameters, optimized for instruction-following tasks. It features an enhanced transformer architecture, supporting languages like English, German, French, and others. The model excels in dialogue applications, having been fine-tuned using supervised fine-tuning and reinforcement learning with human feedback. Trained on approximately 15 trillion tokens with a December 2023 data cutoff, it outperforms many existing open-source and closed chat models in various benchmarks. Ideal for commercial and research applications such as conversational agents and content generation, the model can be accessed on Hugging Face .

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution

Providers(12)

Compare all →
ProviderInput (per 1M)Output (per 1M)Type
OctoAI API$0.15$0.15Serverless
Together AI$0.18$0.18Serverless
Fireworks AI$0.2$0.2Serverless
NVIDIA NIMProvisioned
GroqCloud$0.05$0.08Serverless
Microsoft Foundry$0.3$0.61Provisioned
Databricks Foundation Model ServingProvisioned
Hyperbolic AI Inference$0.10$0.10Serverless
OpenRouter$0.02$0.05Serverless
IBM watsonx$0.15$0.5Serverless
AWS BedrockServerless
Replicate API$0.25$0.25Serverless

Benchmark Scores(2)

BenchmarkScoreVersionSource
BFCL25.8v4https://gorilla.cs.berkeley.edu/leaderboard.html
MMLU PRO44.3https://huggingface.co/spaces/TIGER-Lab/MMLU-Pro

Rankings

Specifications

FamilyLlama 3.1
Released2024-07-23
Parameters8B
Context128K
ArchitectureDecoder Only
Specializationgeneral
Trainingfinetuning

Created by

Large-scale open-source AI for social technologies.

Menlo Park, California, United States
Founded 2013
Website