LLM ReferenceLLM Reference
GCP Vertex AI

Using Llama 3.1 405B Instruct on GCP Vertex AI

Implementation guide · Llama 3.1 · AI at Meta

ServerlessOpen Source

Quick Start

  1. 1
    Create an account at GCP Vertex AI and generate an API key.
  2. 2
    Use the GCP Vertex AI SDK or REST API to call llama3.1-405b-instruct — see the documentation for request format.
  3. 3
    You'll be billed $5.00/1M input, $16.00/1M output tokens. See full pricing.

About GCP Vertex AI

Google Cloud Vertex AI is a comprehensive machine learning platform that provides end-to-end solutions for developing, deploying, and managing AI models. The platform offers a unified interface that integrates various tools and services, enabling users to efficiently handle the entire machine learning lifecycle. Key features include AutoML capabilities for building custom models with minimal coding, a managed notebook environment for prototyping, and robust MLOps tools for model monitoring and versioning. Vertex AI supports both pre-trained models and custom training, making it versatile for a wide range of applications such as natural language processing, image recognition, and predictive analytics. The platform's design focuses on increasing productivity and accelerating time-to-market for AI solutions. By consolidating multiple AI tools into a single ecosystem, Vertex AI reduces manual effort and enhances collaboration among data scientists and engineers. Its scalable architecture allows organizations to efficiently manage large datasets and complex models, while the pay-as-you-go pricing model makes it accessible for businesses of all sizes. Additionally, Vertex AI's integration with popular open-source frameworks like TensorFlow and PyTorch enables users to leverage existing models and tools, fostering innovation and facilitating the development of customized AI applications tailored to specific business needs.

Google for Startups is not primarily an AI platform itself, but rather a comprehensive support program that connects startups with Google's resources, including AI technologies. The program aims to help startups leverage AI and other innovative technologies to solve important challenges and grow their businesses. Key aspects of Google for Startups' AI-related offerings include: 1. Access to Google's AI and machine learning tools and technologies 2. Mentorship and guidance on implementing AI solutions 3. Specialized programs like the Growth Academy: AI for Health, which supports startups using AI in healthcare and wellbeing 4. Funding opportunities for AI-focused startups, particularly those led by underrepresented founders 5. Networking and knowledge-sharing events centered on AI applications in various industries While not exclusively focused on AI, Google for Startups recognizes the transformative potential of AI and actively supports startups in integrating these technologies into their products and services to drive innovation and growth.

Pricing on GCP Vertex AI

TypePrice (per 1M)
Input tokens$5.00
Output tokens$16.00

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution

About Llama 3.1 405B Instruct

Llama 3.1 405B Instruct is Meta's advanced large language model released on July 23, 2024, featuring 405 billion parameters. It utilizes an optimized transformer architecture with supervised fine-tuning and reinforcement learning for enhanced instruction-following capabilities. The model supports multiple languages, was trained on 15 trillion tokens, and fine-tuned with 25 million synthetic examples. It excels in multilingual dialogue and text generation, making it ideal for assistant-like applications. Llama 3.1 incorporates robust safety measures and ethical considerations, outperforming many existing models on various industry benchmarks. AI engineers can access the model via its Hugging Face page for implementation in diverse NLP tasks.

Model Specs

Released2024-07-23
Parameters405B
Context128K
ArchitectureDecoder Only

Provider

GCP Vertex AI
GCP Vertex AI

Google Cloud Platform (GCP)

Mountain View, California, United States