LLM Reference

PixArt Models by Huawei Noah's Ark Lab

Huawei Noah's Ark LabApache 2.0Open source
1 model2024

Details

LicenseApache 2.0OSI-approved
Commercial useCommercial use: permitted
Models1
Released2024

Capabilities

MultimodalAll models

About

Huawei Noah's Ark Lab's PixArt family of diffusion transformer text-to-image models. Known for efficient training and high-quality output — PixArt-α introduced fast DiT training, PixArt-Σ extended to 4K resolution generation with 'weak-to-strong training'.

Current Variants

Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.

1 in view
PixArt-ΣCurrent

Use when the workload needs image generation, 600M parameters, and multimodal inputs.

2024-03image generation600M parametersmultimodal inputs

Release Timeline

1 release group
2024-03
1 current
PixArt-Σ
image generation600M parametersmultimodal inputs
Current

Specifications(1 models)

PixArt model specifications comparison
ModelReleasedParametersMultimodal
PixArt-Σ2024-03600MYes

Frequently Asked Questions

What is PixArt used for?
PixArt is used for image generation and vision and multimodal work. The family description and listed model capabilities point to those workloads as the best fit.
How does PixArt compare to Pangu?
PixArt by Huawei Noah's Ark Lab is strongest where you need image generation, while Pangu by Huawei Noah's Ark Lab is the closest related family to check for coding. PixArt has 1 listed variant, while Pangu reaches up to 128k context, so compare the specs and pricing tables before choosing a production model.
Which PixArt model should I use?
If price is the main constraint, use the pricing table first because PixArt does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate PixArt-Σ with multimodal inputs.