PixArt-Σ
pixart-sigma
Open SourceMultimodal
About
PixArt-Σ (Sigma) by Huawei Noah's Ark Lab. A 600M-parameter Diffusion Transformer capable of generating images at up to 4K (3840×2560) resolution without an upscaler. Uses 'weak-to-strong training' on higher-quality data. Competitive with Adobe Firefly 2, Imagen 2, and DALL-E 3 at a fraction of the parameter count. Open source under Apache 2.0. Available via Fal API (fal-ai/pixart-sigma).
Capabilities
VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode ExecutionPrompt CachingBatch APIAudioFine-tuning