llmreference
Databricks Foundation Model Serving

Using Llama 3.1 8B Instruct on Databricks Foundation Model Serving

Implementation guide · Llama 3.1 · AI at Meta

ProvisionedOpen Source

Quick Start

  1. 1
    Create an account at Databricks Foundation Model Serving and generate an API key.
  2. 2
    Use the Databricks Foundation Model Serving SDK or REST API to call llama3.1-8b-instruct — see the documentation for request format.

Code Examples

About Databricks Foundation Model Serving

Databricks offers a comprehensive AI platform that integrates a lakehouse model, combining the flexibility of data lakes with the management capabilities of data warehouses. The platform features a natural language interface for conversational data querying, automated infrastructure management for optimized performance, and robust governance tools ensuring data privacy and compliance. It supports a wide range of functionalities including data engineering, real-time streaming, and a marketplace for data sharing, while enabling seamless collaboration among data scientists, engineers, and DevOps teams . The platform's capabilities extend to advanced machine learning operations (MLOps), facilitating the entire lifecycle of AI model development. It includes built-in support for popular libraries like TensorFlow and PyTorch, tools for monitoring data quality and model performance, and automated workflows for building production-ready ETL pipelines. The platform also integrates with large language models (LLMs) for generative AI applications, emphasizing cost efficiency and ease of use. This comprehensive suite of tools empowers organizations to effectively leverage AI while maintaining control over their data and models .

Databricks offers a comprehensive Data Intelligence Platform that unifies data, analytics, and AI capabilities. Their platform, known as the Databricks Lakehouse, combines the best features of data lakes and data warehouses, enabling organizations to handle large-scale data processing, analytics, and machine learning workloads in a single, unified environment. Key features of Databricks' AI platform include: 1. Apache Spark integration: As the creators of Apache Spark, Databricks provides optimized performance for big data processing and analytics. 2. Delta Lake: An open-source storage layer that brings reliability to data lakes, ensuring data quality and consistency. 3. MLflow: An open-source platform for managing the machine learning lifecycle, including experimentation, reproducibility, and deployment. 4. Collaborative notebooks: Interactive environments for data scientists and analysts to work together on data exploration, model development, and visualization. 5. AutoML: Automated machine learning capabilities to streamline the model development process. 6. Generative AI support: Tools and frameworks for developing and deploying generative AI models. 7. Data governance: Unity Catalog provides centralized governance and security controls across the entire data estate. 8. Scalable infrastructure: Cloud-native architecture that allows for elastic scaling of compute resources. Databricks' platform is designed to democratize data and AI, making it accessible to organizations of all sizes. It's used by over 10,000 organizations worldwide, including more than 50% of the Fortune 500 companies, for various use cases such as data engineering, machine learning, and business analytics.

Pricing on Databricks Foundation Model Serving

Capabilities

Structured Outputs

About Llama 3.1 8B Instruct

The Llama 3.1 8B Instruct model, released on July 23, 2024, is a multilingual large language model with 8 billion parameters, optimized for instruction-following tasks. It features an enhanced transformer architecture, supporting languages like English, German, French, and others. The model excels in dialogue applications, having been fine-tuned using supervised fine-tuning and reinforcement learning with human feedback. Trained on approximately 15 trillion tokens with a December 2023 data cutoff, it outperforms many existing open-source and closed chat models in various benchmarks. Ideal for commercial and research applications such as conversational agents and content generation, the model can be accessed on Hugging Face .

Model Specs

Released2024-07-23
Parameters8B
Context128K
ArchitectureDecoder Only
Knowledge cutoff2023-12

Provider

Databricks Foundation Model Serving
Databricks Foundation Model Serving

Databricks

San Francisco, California, United States