KWBench: Measuring Unprompted Problem Recognition in Knowledge Work Paper • 2604.15760 • Published 11 days ago • 1
R-HORIZON Collection The training and evaluation datasets for Paper "How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?" • 6 items • Updated Oct 22, 2025 • 8