llmreference
Databricks Foundation Model Serving

Using Llama 3 70B Instruct on Databricks Foundation Model Serving

Implementation guide · Llama 3 · AI at Meta

ServerlessOpen Source

Quick Start

  1. 1
    Create an account at Databricks Foundation Model Serving and generate an API key.
  2. 2
    Use the Databricks Foundation Model Serving SDK or REST API to call llama3-70b-instruct — see the documentation for request format.
  3. 3
    You'll be billed $1.00/1M input, $3.00/1M output tokens. See full pricing.

Code Examples

About Databricks Foundation Model Serving

Databricks offers a comprehensive AI platform that integrates a lakehouse model, combining the flexibility of data lakes with the management capabilities of data warehouses. The platform features a natural language interface for conversational data querying, automated infrastructure management for optimized performance, and robust governance tools ensuring data privacy and compliance. It supports a wide range of functionalities including data engineering, real-time streaming, and a marketplace for data sharing, while enabling seamless collaboration among data scientists, engineers, and DevOps teams . The platform's capabilities extend to advanced machine learning operations (MLOps), facilitating the entire lifecycle of AI model development. It includes built-in support for popular libraries like TensorFlow and PyTorch, tools for monitoring data quality and model performance, and automated workflows for building production-ready ETL pipelines. The platform also integrates with large language models (LLMs) for generative AI applications, emphasizing cost efficiency and ease of use. This comprehensive suite of tools empowers organizations to effectively leverage AI while maintaining control over their data and models .

Databricks offers a comprehensive Data Intelligence Platform that unifies data, analytics, and AI capabilities. Their platform, known as the Databricks Lakehouse, combines the best features of data lakes and data warehouses, enabling organizations to handle large-scale data processing, analytics, and machine learning workloads in a single, unified environment. Key features of Databricks' AI platform include: 1. Apache Spark integration: As the creators of Apache Spark, Databricks provides optimized performance for big data processing and analytics. 2. Delta Lake: An open-source storage layer that brings reliability to data lakes, ensuring data quality and consistency. 3. MLflow: An open-source platform for managing the machine learning lifecycle, including experimentation, reproducibility, and deployment. 4. Collaborative notebooks: Interactive environments for data scientists and analysts to work together on data exploration, model development, and visualization. 5. AutoML: Automated machine learning capabilities to streamline the model development process. 6. Generative AI support: Tools and frameworks for developing and deploying generative AI models. 7. Data governance: Unity Catalog provides centralized governance and security controls across the entire data estate. 8. Scalable infrastructure: Cloud-native architecture that allows for elastic scaling of compute resources. Databricks' platform is designed to democratize data and AI, making it accessible to organizations of all sizes. It's used by over 10,000 organizations worldwide, including more than 50% of the Fortune 500 companies, for various use cases such as data engineering, machine learning, and business analytics.

Pricing on Databricks Foundation Model Serving

TypePrice (per 1M)
Input tokens$1.00
Output tokens$3.00

Capabilities

Structured Outputs

About Llama 3 70B Instruct

The Llama 3 70B Instruct model is a large language model with 70 billion parameters, released by Meta on April 18, 2024. It's an instruction-tuned variant optimized for conversational applications, utilizing an advanced auto-regressive transformer architecture. The model excels in following instructions and engaging in dialogue, having been trained on over 15 trillion tokens with a December 2023 knowledge cutoff. It demonstrates superior performance on industry benchmarks, scoring 82.0 on the MMLU (5-shot) test. The model incorporates extensive safety measures and optimizations, including RLHF, to enhance helpfulness and reduce harmful content generation. For more details, visit the model's Hugging Face page [1].

Model Specs

Released2024-04-18
Parameters70B
Context8K
ArchitectureDecoder Only
Knowledge cutoff2023-12

More Models on Databricks Foundation Model Serving

Provider

Databricks Foundation Model Serving
Databricks Foundation Model Serving

Databricks

San Francisco, California, United States