C2: Scalable Rubric-Augmented Reward Modeling from Binary Preferences Paper • 2604.13618 • Published 4 days ago • 3
Model Capability Dominates: Inference-Time Optimization Lessons from AIMO 3 Paper • 2603.27844 • Published 3 days ago • 2
An Optimal Transport-driven Approach for Cultivating Latent Space in Online Incremental Learning Paper • 2211.16780 • Published 3 days ago • 1
Towards Autonomous Mechanistic Reasoning in Virtual Cells Paper • 2604.11661 • Published 5 days ago • 4
MM-WebAgent: A Hierarchical Multimodal Web Agent for Webpage Generation Paper • 2604.15309 • Published 3 days ago • 5
SuperLocalMemory V3.3: The Living Brain -- Biologically-Inspired Forgetting, Cognitive Quantization, and Multi-Channel Retrieval for Zero-LLM Agent Memory Systems Paper • 2604.04514 • Published 13 days ago • 4
OneHOI: Unifying Human-Object Interaction Generation and Editing Paper • 2604.14062 • Published 4 days ago • 6
Cross-Tokenizer LLM Distillation through a Byte-Level Interface Paper • 2604.07466 • Published 6 days ago • 4
KV Packet: Recomputation-Free Context-Independent KV Caching for LLMs Paper • 2604.13226 • Published 5 days ago • 5
LeapAlign: Post-Training Flow Matching Models at Any Generation Step by Building Two-Step Trajectories Paper • 2604.15311 • Published 3 days ago • 5
LongAct: Harnessing Intrinsic Activation Patterns for Long-Context Reinforcement Learning Paper • 2604.14922 • Published 3 days ago • 5
Don't Retrieve, Navigate: Distilling Enterprise Knowledge into Navigable Agent Skills for QA and RAG Paper • 2604.14572 • Published 3 days ago • 4
Boosting Visual Instruction Tuning with Self-Supervised Guidance Paper • 2604.12966 • Published 5 days ago • 5
Representations Before Pixels: Semantics-Guided Hierarchical Video Prediction Paper • 2604.11707 • Published 6 days ago • 7
Dive into Claude Code: The Design Space of Today's and Future AI Agent Systems Paper • 2604.14228 • Published 5 days ago • 9
TRACER: Trace-Based Adaptive Cost-Efficient Routing for LLM Classification Paper • 2604.14531 • Published 3 days ago • 6
Switch-KD: Visual-Switch Knowledge Distillation for Vision-Language Models Paper • 2604.14629 • Published 3 days ago • 8
UniDoc-RL: Coarse-to-Fine Visual RAG with Hierarchical Actions and Dense Rewards Paper • 2604.14967 • Published 3 days ago • 8