Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem Paper • 2512.24873 • Published 19 days ago • 101 • 5
OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization Paper • 2212.12017 • Published Dec 22, 2022 • 1 • 1
Direct Preference Optimization: Your Language Model is Secretly a Reward Model Paper • 2305.18290 • Published May 29, 2023 • 64 • 4
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference Paper • 2403.04132 • Published Mar 7, 2024 • 40 • 2
Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning Paper • 2601.09667 • Published 5 days ago • 76 • 6
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery Paper • 2408.06292 • Published Aug 12, 2024 • 127 • 11
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs Paper • 2601.08763 • Published 6 days ago • 131 • 5
VL-JEPA: Joint Embedding Predictive Architecture for Vision-language Paper • 2512.10942 • Published Dec 11, 2025 • 45 • 6
Urban Socio-Semantic Segmentation with Vision-Language Reasoning Paper • 2601.10477 • Published 4 days ago • 150 • 3
CloneMem: Benchmarking Long-Term Memory for AI Clones Paper • 2601.07023 • Published 8 days ago • 2 • 1
An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models Paper • 2408.00724 • Published Aug 1, 2024 • 2 • 1
DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation Paper • 2601.09688 • Published 5 days ago • 114 • 3