DIFFA-2: A Practical Diffusion Large Language Model for General Audio Understanding Paper • 2601.23161 • Published 3 days ago • 5
ReGuLaR: Variational Latent Reasoning Guided by Rendered Chain-of-Thought Paper • 2601.23184 • Published 3 days ago • 12
PaperBanana: Automating Academic Illustration for AI Scientists Paper • 2601.23265 • Published 3 days ago • 18
NativeTok: Native Visual Tokenization for Improved Image Generation Paper • 2601.22837 • Published 3 days ago • 7
DINO-SAE: DINO Spherical Autoencoder for High-Fidelity Image Reconstruction and Generation Paper • 2601.22904 • Published 3 days ago • 6
Self-Improving Pretraining: using post-trained models to pretrain better models Paper • 2601.21343 • Published 5 days ago • 13
Scaling Embeddings Outperforms Scaling Experts in Language Models Paper • 2601.21204 • Published 5 days ago • 95
DeepSearchQA: Bridging the Comprehensiveness Gap for Deep Research Agents Paper • 2601.20975 • Published 5 days ago • 8
WorldBench: Disambiguating Physics for Diagnostic Evaluation of World Models Paper • 2601.21282 • Published 5 days ago
Linear representations in language models can change dramatically over a conversation Paper • 2601.20834 • Published 5 days ago • 20
OmegaUse: Building a General-Purpose GUI Agent for Autonomous Task Execution Paper • 2601.20380 • Published 6 days ago • 8
SketchDynamics: Exploring Free-Form Sketches for Dynamic Intent Expression in Animation Generation Paper • 2601.20622 • Published 5 days ago • 1
UI Remix: Supporting UI Design Through Interactive Example Retrieval and Remixing Paper • 2601.18759 • Published 7 days ago • 2
SAGE: Steerable Agentic Data Generation for Deep Search with Execution Feedback Paper • 2601.18202 • Published 8 days ago • 8