26 8

sherry

rain305

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Kwai Keye-VL-2.0 Technical Report

upvoted a paper 8 days ago

Vision-OPD: Learning to See Fine Details for Multimodal LLMs via On-Policy Self-Distillation

upvoted a paper 8 days ago

Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts

View all activity

Organizations

None yet

upvoted a paper 5 days ago

Kwai Keye-VL-2.0 Technical Report

Paper • 2606.10651 • Published 10 days ago • 185

upvoted 4 papers 8 days ago

Vision-OPD: Learning to See Fine Details for Multimodal LLMs via On-Policy Self-Distillation

Paper • 2605.18740 • Published May 18 • 5

Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts

Paper • 2606.05922 • Published 14 days ago • 52

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Paper • 2606.11025 • Published 10 days ago • 41

SCAIL-2: Unifying Controlled Character Animation with End-to-end In-Context Conditioning

Paper • 2606.10804 • Published 10 days ago • 43

upvoted a paper 2 months ago

RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time

Paper • 2604.11626 • Published Apr 13 • 102

liked a model 2 months ago

Skywork/Skywork-UniPic-1.5B

Any-to-Any • Updated Sep 8, 2025 • 28 • 116

upvoted a paper 2 months ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 110

liked a model 3 months ago

jdopensource/JoyAI-Image-Edit

Image-to-Image • Updated May 7 • 241 • 127

upvoted a paper 3 months ago

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Paper • 2603.25746 • Published Mar 26 • 155

liked a model 3 months ago

CodeGoat24/UniGenBench-EvalModel-qwen-72b-v1

Image-Text-to-Text • 73B • Updated Oct 25, 2025 • 154 • 4

upvoted a paper 3 months ago

Evaluating and Steering Modality Preferences in Multimodal Large Language Model

Paper • 2505.20977 • Published May 27, 2025 • 10

upvoted an article 3 months ago

Article

NEO-unify: Building Native Multimodal Unified Models End to End

sensenova

•

Mar 5

• 165

upvoted 2 papers 3 months ago

Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published Mar 3 • 198

CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation

Paper • 2603.08652 • Published Mar 9 • 41

liked a dataset 3 months ago

LanguageBind/UniWorld-V1

Viewer • Updated Jun 16, 2025 • 7.11k • 3.64k • 26

upvoted 4 papers 3 months ago

Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

Paper • 2505.03318 • Published May 6, 2025 • 94

sherry

AI & ML interests

Recent Activity

Organizations

rain305's activity

NEO-unify: Building Native Multimodal Unified Models End to End