LLM Reference

Using Llama 4 Scout 17B-16E Instruct on Novita AI

Implementation guide · Llama 4 · AI at Meta

ServerlessOpen Source

Quick Start

  1. 1
    Create an account at Novita AI and generate an API key.
  2. 2
    Use the Novita AI SDK or REST API to call llama-4-scout-17b-16e-instruct.
  3. 3
    You'll be billed $0.18/1M input, $0.59/1M output tokens. See full pricing.

Code Examples

Code examples for this provider have not been sourced yet.

About Novita AI

Novita AI offers a GPU-based inference API for image, video, and language model generation with a broad catalog of open-source models.

Pricing on Novita AI

TypePrice (per 1M)
Input tokens$0.18
Output tokens$0.59

Capabilities

Structured Outputs

About Llama 4 Scout 17B-16E Instruct

Meta's Llama 4 Scout is a 17-billion parameter mixture-of-experts model with 16 expert routing. Optimized for efficient inference on edge and cloud environments with strong multi-turn conversation capabilities. Available on Cloudflare Workers AI.

Model Specs

Released2025-04-05
Parameters17B
Context328K
ArchitectureMixture of Experts
Knowledge cutoff2024-08

Provider

Novita AI