Using Llama 4 Maverick 17B Instruct FP8 on Novita AI
Implementation guide · Llama 4 · AI at Meta
ServerlessOpen Source
Quick Start
- 1
- 2Use the Novita AI SDK or REST API to call
llama-4-maverick-17b-128e-instruct-fp8. - 3
Code Examples
Code examples for this provider have not been sourced yet.
About Novita AI
Novita AI offers a GPU-based inference API for image, video, and language model generation with a broad catalog of open-source models.
Pricing on Novita AI
| Type | Price (per 1M) |
|---|---|
| Input tokens | $0.27 |
| Output tokens | $0.85 |
Capabilities
Structured Outputs
About Llama 4 Maverick 17B Instruct FP8
Meta's Llama 4 Maverick 17B with 128 experts, FP8-optimized for cost-efficient inference. Supports native Model Router integration on Microsoft Foundry.
Model Specs
Released2025-04-05
Parameters17B
Context1M
ArchitectureMixture of Experts
Knowledge cutoff2024-08