Japanese Stable VLM
About
The Japanese Stable VLM by Stability AI is a vision-language model that specializes in generating Japanese descriptions for images and providing answers to related questions. It employs an auto-regressive architecture and is trained on a variety of public datasets, excelling in tasks such as image captioning and visual question answering. Tailored for following instructions and producing responses in Japanese that mimic human communication, this model is available for commercial use, distinguishing it from some of Stability AI's earlier models. Users are advised to consider the potential biases and limitations due to the training data.
Capabilities
MultimodalFunction CallingTool UseJSON Mode
Providers(1)
| Provider | Input (per 1M) | Output (per 1M) | Type | |
|---|---|---|---|---|
| Fireworks AI Platform | — | — | Provisioned |