Planning Generated Images In Stages: Meta improves image models by plotting and revising generations step-by-step
Text-to-image generators that use diffusion or flow-matching typically compose a whole image at once (although they refine the whole image in steps).