Text-to-images
updated
Training-Free Consistent Text-to-Image Generation
Paper
• 2402.03286
• Published
• 67
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation
Paper
• 2402.04324
• Published
• 26
λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion
Models by Leveraging CLIP Latent Space
Paper
• 2402.05195
• Published
• 19
FiT: Flexible Vision Transformer for Diffusion Model
Paper
• 2402.12376
• Published
• 48
DiLightNet: Fine-grained Lighting Control for Diffusion-based Image
Generation
Paper
• 2402.11929
• Published
• 11
Paper
• 2402.13144
• Published
• 100
OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable
Virtual Try-on
Paper
• 2403.01779
• Published
• 30
StableDrag: Stable Dragging for Point-based Image Editing
Paper
• 2403.04437
• Published
• 27
FlashFace: Human Image Personalization with High-fidelity Identity
Preservation
Paper
• 2403.17008
• Published
• 22
Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image
Synthesis
Paper
• 2404.13686
• Published
• 29
Align Your Steps: Optimizing Sampling Schedules in Diffusion Models
Paper
• 2404.14507
• Published
• 23
Editable Image Elements for Controllable Synthesis
Paper
• 2404.16029
• Published
• 12
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video
Generation
Paper
• 2405.01434
• Published
• 56
Customizing Text-to-Image Models with a Single Image Pair
Paper
• 2405.01536
• Published
• 22
Stylus: Automatic Adapter Selection for Diffusion Models
Paper
• 2404.18928
• Published
• 15
DressCode: Autoregressively Sewing and Generating Garments from Text
Guidance
Paper
• 2401.16465
• Published
• 12
InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation
Paper
• 2404.19427
• Published
• 74
MotionLCM: Real-time Controllable Motion Generation via Latent
Consistency Model
Paper
• 2404.19759
• Published
• 27
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
Paper
• 2404.18212
• Published
• 30
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with
Fine-Grained Chinese Understanding
Paper
• 2405.08748
• Published
• 23
Compositional Text-to-Image Generation with Dense Blob Representations
Paper
• 2405.08246
• Published
• 17
CAT3D: Create Anything in 3D with Multi-View Diffusion Models
Paper
• 2405.10314
• Published
• 47
BitsFusion: 1.99 bits Weight Quantization of Diffusion Model
Paper
• 2406.04333
• Published
• 38
Step-aware Preference Optimization: Aligning Preference with Denoising
Performance at Each Step
Paper
• 2406.04314
• Published
• 30
Autoregressive Model Beats Diffusion: Llama for Scalable Image
Generation
Paper
• 2406.06525
• Published
• 71