arxiv:2602.10693
floyed shen
floyed
AI & ML interests
None yet
Recent Activity
upvoted a paper about 24 hours ago
From Generic Correlation to Input-Specific Credit in On-Policy Self Distillation upvoted a paper about 24 hours ago
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information upvoted a paper 17 days ago
Safety Instincts: LLMs Learn to Trust Their Internal Compass for Self-Defense