In-Context Reinforcement Learning for Tool Use in Large Language Models Paper • 2603.08068 • Published 5 days ago • 24
Reading, Not Thinking: Understanding and Bridging the Modality Gap When Text Becomes Pixels in Multimodal LLMs Paper • 2603.09095 • Published 4 days ago • 23
AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios Paper • 2602.23166 • Published 15 days ago • 40
DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval Paper • 2603.04743 • Published 9 days ago • 47
Heterogeneous Agent Collaborative Reinforcement Learning Paper • 2603.02604 • Published 11 days ago • 173
Timer-S1: A Billion-Scale Time Series Foundation Model with Serial Scaling Paper • 2603.04791 • Published 9 days ago • 16
Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory Paper • 2603.04257 • Published 9 days ago • 19
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-GGUF Text Generation • 27B • Updated 6 days ago • 117k • 191
Jackrong/Qwen3.5-2B-Claude-4.6-Opus-Reasoning-Distilled-GGUF Text Generation • 2B • Updated 7 days ago • 34k • 101
embedl/Cosmos-Reason2-2B-W4A16-Edge2 Image-Text-to-Text • 2B • Updated about 8 hours ago • 12.9k • 11
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Text Generation • 28B • Updated 6 days ago • 53.2k • 586