MOSS-Audio 4B Instruct on Hugging Face Inference Endpoints
MOSS-Audio · MOSI Intelligence
Last refreshed 2026-06-04. Next refresh: weekly.
Why use MOSS-Audio 4B Instruct on Hugging Face Inference Endpoints?
Hugging Face Inference Endpoints offers MOSS-Audio 4B Instruct with competitive pricing. Hugging Face is a leading AI community and platform dedicated to democratizing artificial intelligence.
Setup recipe
Docs fallbackUse the provider REST API or SDKCreate a provider API keymodel: OpenMOSS-Team/MOSS-Audio-4B-InstructOpenMOSS-Team/MOSS-Audio-4B-InstructRequest example
OpenMOSS-Team/MOSS-Audio-4B-Instruct.Gotchas
- Use provider model ID "OpenMOSS-Team/MOSS-Audio-4B-Instruct", not the LLMReference slug "moss-audio-4b-instruct".
Capabilities
About MOSS-Audio 4B Instruct
MOSS-Audio 4B Instruct is the instruction-following 4.6B variant of MOSI Intelligence and OpenMOSS Team's open-weight audio understanding model. It combines a MOSS-Audio encoder with a Qwen3-4B language backbone for speech, environmental sound, music, captioning, time-aware question answering, timestamped ASR, and audio-grounded reasoning.
FAQ
What API model ID do I use for MOSS-Audio 4B Instruct on Hugging Face Inference Endpoints?
Use the model ID OpenMOSS-Team/MOSS-Audio-4B-Instruct when calling Hugging Face Inference Endpoints's API.
Who created MOSS-Audio 4B Instruct?
MOSS-Audio 4B Instruct was created by MOSI Intelligence as part of the MOSS-Audio model family.
Is MOSS-Audio 4B Instruct open source?
MOSS-Audio 4B Instruct is open source under Apache 2.0 according to the seed data.