Last refreshed 2026-05-01. Next refresh: weekly.
Why use Fuyu-8B on NVIDIA NIM?
NVIDIA NIM offers Fuyu-8B with competitive pricing. NVIDIA NIM is NVIDIA's deployment platform for GPU-accelerated inference microservices.
Setup recipe
Docs fallbackUse the provider REST API or SDKCreate a provider API keymodel: fuyu-8bfuyu-8bRequest example
fuyu-8b.Gotchas
No curated gotchas have been sourced for this exact provider/model route yet.
Pricing
| Type | Rate |
|---|---|
| GPU Hour Rate | $1.00/GPU·hr |
| GPU Config | 1xH100 |
Capabilities
No model capability flags are currently sourced.
About Fuyu-8B
Fuyu-8B, developed by Adept AI, is a sophisticated multimodal large language model that excels in both text and image processing. It employs a streamlined decoder-only transformer architecture, allowing it to integrate image patches directly into its layers, effectively handling images of any resolution without complex training stages. Notably, Fuyu-8B can tackle a wide array of tasks, from visual question answering and image captioning to document understanding and optical character recognition. Despite its capabilities, it has certain limitations, such as challenges with generating faces and potential biases. The model's design prioritizes speed and real-time application suitability, with some versions available as open-source under specific licenses 12.
FAQ
Who created Fuyu-8B?
Fuyu-8B was created by Adept AI as part of the Fuyu model family.
Is Fuyu-8B open source?
Fuyu-8B's open source status is unknown in the seed data.