AnIdealRing's picture

1 9

AnIdealRing

SmartDazi

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 19 days ago

Long-horizon Reasoning Agent for Olympiad-Level Mathematical Problem Solving

upvoted a paper 22 days ago

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

upvoted a paper 3 months ago

LaSeR: Reinforcement Learning with Last-Token Self-Rewarding

View all activity

Organizations

models 0

None public yet

datasets 0

None public yet