5 22 3

Yexin Liu

AIPeanutman

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

Rethinking the Divergence Regularization in LLM RL

upvoted a paper 10 days ago

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

published a dataset 22 days ago

AIPeanutman/OmitI2V

View all activity

Organizations

None yet

upvoted 2 papers 10 days ago

Rethinking the Divergence Regularization in LLM RL

Paper • 2606.09821 • Published 12 days ago • 33

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Paper • 2606.11025 • Published 11 days ago • 41

published a dataset 22 days ago

AIPeanutman/OmitI2V

Updated 24 days ago • 19

updated a dataset 24 days ago

AIPeanutman/OmitI2V

Updated 24 days ago • 19

updated a dataset about 2 months ago

AIPeanutman/dREPA_collections

Updated Apr 26 • 2.94k • 1

published a dataset about 2 months ago

AIPeanutman/dREPA_collections

Updated Apr 26 • 2.94k • 1

upvoted 3 papers 3 months ago

Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis

Paper • 2603.29620 • Published Mar 31 • 48

MACRO: Advancing Multi-Reference Image Generation with Structured Long-Context Data

Paper • 2603.25319 • Published Mar 26 • 32

Manifold-Aware Exploration for Reinforcement Learning in Video Generation

Paper • 2603.21872 • Published Mar 23 • 34

authored a paper 3 months ago

Learning Latent Proxies for Controllable Single-Image Relighting

Paper • 2603.15555 • Published Mar 16 • 8

upvoted a paper 3 months ago

Learning Latent Proxies for Controllable Single-Image Relighting

Paper • 2603.15555 • Published Mar 16 • 8

upvoted a paper 4 months ago

LoopViT: Scaling Visual ARC with Looped Transformers

Paper • 2602.02156 • Published Feb 2 • 12

authored 8 papers 6 months ago

Efficient Multimodal Learning from Data-centric Perspective

Paper • 2402.11530 • Published Feb 18, 2024 • 1

Efficient Multimodal Large Language Models: A Survey

Paper • 2405.10739 • Published May 17, 2024

Seeing Clearly, Answering Incorrectly: A Multimodal Robustness Benchmark for Evaluating MLLMs on Leading Questions

Paper • 2406.10638 • Published Jun 15, 2024

MMRC: A Large-Scale Benchmark for Understanding Multimodal Large Language Model in Real-World Conversation

Paper • 2502.11903 • Published Feb 17, 2025

OmniGen2: Exploration to Advanced Multimodal Generation

Paper • 2506.18871 • Published Jun 23, 2025 • 79

When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding

Paper • 2506.05551 • Published Jun 5, 2025 • 5

TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis

Paper • 2508.13618 • Published Aug 19, 2025 • 19

Model Reveals What to Cache: Profiling-Based Feature Reuse for Video Diffusion Models

Paper • 2504.03140 • Published Apr 4, 2025

Yexin Liu

AI & ML interests

Recent Activity

Organizations

AIPeanutman's activity