DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI Paper • 2512.16676 • Published 20 days ago • 202
End-to-End Training for Autoregressive Video Diffusion via Self-Resampling Paper • 2512.15702 • Published 21 days ago • 14
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer Paper • 2511.22699 • Published Nov 27, 2025 • 224
PosterCopilot: Toward Layout Reasoning and Controllable Editing for Professional Graphic Design Paper • 2512.04082 • Published Dec 3, 2025 • 13
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length Paper • 2512.04677 • Published Dec 4, 2025 • 167
UI-Ins: Enhancing GUI Grounding with Multi-Perspective Instruction-as-Reasoning Paper • 2510.20286 • Published Oct 23, 2025 • 23
MorphoBench: A Benchmark with Difficulty Adaptive to Model Reasoning Paper • 2510.14265 • Published Oct 16, 2025 • 19
Autoregressive Images Watermarking through Lexical Biasing: An Approach Resistant to Regeneration Attack Paper • 2506.01011 • Published Jun 1, 2025 • 9
EasyText: Controllable Diffusion Transformer for Multilingual Text Rendering Paper • 2505.24417 • Published May 30, 2025 • 13
MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios Paper • 2505.21333 • Published May 27, 2025 • 38
OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data Paper • 2505.18445 • Published May 24, 2025 • 63
RepText: Rendering Visual Text via Replicating Paper • 2504.19724 • Published Apr 28, 2025 • 31
EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer Paper • 2503.07027 • Published Mar 10, 2025 • 29
PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data Paper • 2502.14397 • Published Feb 20, 2025 • 41
MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation Paper • 2502.01572 • Published Feb 3, 2025 • 21
Any2AnyTryon: Leveraging Adaptive Position Embeddings for Versatile Virtual Clothing Tasks Paper • 2501.15891 • Published Jan 27, 2025 • 16
Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis Paper • 2410.07155 • Published Oct 9, 2024 • 11
IPDreamer: Appearance-Controllable 3D Object Generation with Image Prompts Paper • 2310.05375 • Published Oct 9, 2023 • 3