XYX's picture

XYX

xuyd16

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

TIP: Token Importance in On-Policy Distillation

upvoted a paper 2 days ago

OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models

upvoted a paper 2 days ago

SpatialEvo: Self-Evolving Spatial Intelligence via Deterministic Geometric Environments

View all activity

Organizations

None yet

upvoted 5 papers 2 days ago

TIP: Token Importance in On-Policy Distillation

Paper • 2604.14084 • Published 4 days ago • 11

OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models

Paper • 2604.10866 • Published 6 days ago • 58

SpatialEvo: Self-Evolving Spatial Intelligence via Deterministic Geometric Environments

Paper • 2604.14144 • Published 4 days ago • 61

RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time

Paper • 2604.11626 • Published 6 days ago • 99

Seedance 2.0: Advancing Video Generation for World Complexity

Paper • 2604.14148 • Published 4 days ago • 136

submitted a paper to Daily Papers 3 days ago

TIP: Token Importance in On-Policy Distillation

Paper • 2604.14084 • Published 4 days ago • 11

submitted a paper to Daily Papers about 1 month ago

PACED: Distillation at the Frontier of Student Competence

Paper • 2603.11178 • Published Mar 11 • 4

authored 4 papers about 1 month ago

Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning

Paper • 2602.21420 • Published Feb 24 • 6

On-Policy Self-Distillation for Reasoning Compression

Paper • 2603.05433 • Published Mar 5 • 8

Not all tokens are needed(NAT): token efficient reinforcement learning

Paper • 2603.06619 • Published Feb 20 • 1

PACED: Distillation at the Frontier of Student Competence

Paper • 2603.11178 • Published Mar 11 • 4

upvoted 3 papers about 1 month ago

PACED: Distillation at the Frontier of Student Competence

Paper • 2603.11178 • Published Mar 11 • 4

Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning

Paper • 2602.21420 • Published Feb 24 • 6

Flash-KMeans: Fast and Memory-Efficient Exact K-Means

Paper • 2603.09229 • Published Mar 10 • 82