Toggle light / dark theme

Pixart-σ weak-to-strong training of diffusion transformer for 4K text-to-image generation.

PixArt-Σ

Weak-to-strong training of diffusion transformer for 4K text-to-image generation.

In this paper, we introduce PixArt-Sigma, a Diffusion Transformer model~(DiT) capable of directly generating images at 4K resolution.


Join the discussion on this paper page.