DyaDiT: A Multi-Modal Diffusion Transformer for Socially Favorable Dyadic Gesture Generation Paper • 2602.23165 • Published 10 days ago • 2
MIBURI: Towards Expressive Interactive Gesture Synthesis Paper • 2603.03282 • Published 5 days ago • 3
OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens Paper • 2603.02138 • Published 6 days ago • 135
When Does RL Help Medical VLMs? Disentangling Vision, SFT, and RL Gains Paper • 2603.01301 • Published 6 days ago • 8
LoopFormer: Elastic-Depth Looped Transformers for Latent Reasoning via Shortcut Modulation Paper • 2602.11451 • Published 24 days ago • 15
EasyV2V: A High-quality Instruction-based Video Editing Framework Paper • 2512.16920 • Published Dec 18, 2025 • 18
CRISP: Contact-Guided Real2Sim from Monocular Video with Planar Scene Primitives Paper • 2512.14696 • Published Dec 16, 2025 • 8
FastHMR: Accelerating Human Mesh Recovery via Token and Layer Merging with Diffusion Decoding Paper • 2510.10868 • Published Oct 13, 2025 • 12
PickStyle: Video-to-Video Style Transfer with Context-Style Adapters Paper • 2510.07546 • Published Oct 8, 2025 • 22