Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning Paper • 2605.02913 • Published Apr 8 • 8
SplAttN: Bridging 2D and 3D with Gaussian Soft Splatting and Attention for Point Cloud Completion Paper • 2605.01466 • Published 12 days ago • 6
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published 11 days ago • 153
On the Robustness of LLM-Based Dense Retrievers: A Systematic Analysis of Generalizability and Stability Paper • 2604.16576 • Published 26 days ago • 2
RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time Paper • 2604.11626 • Published about 1 month ago • 101
AnomalyVFM -- Transforming Vision Foundation Models into Zero-Shot Anomaly Detectors Paper • 2601.20524 • Published Apr 9 • 6
Qualixar OS: A Universal Operating System for AI Agent Orchestration Paper • 2604.06392 • Published Apr 7 • 17
A Neural Score-Based Particle Method for the Vlasov-Maxwell-Landau System Paper • 2603.25832 • Published Mar 26 • 4
Diffutron: A Masked Diffusion Language Model for Turkish Language Paper • 2603.20466 • Published Mar 20 • 9