Darwin Family: MRI-Trust-Weighted Evolutionary Merging for Training-Free Scaling of Language-Model Reasoning Paper • 2605.14386 • Published 4 days ago • 50
view article Article Training-Free Reasoning at 88.89% on GPQA Diamond: How Darwin Family Hit Frontier Scores Without a Single Gradient Step FINAL-Bench • 2 days ago • 13
view article Article FINAL Bench: The Real Bottleneck to AGI Is Self-Correction FINAL-Bench • Feb 21 • 20
Korean BEST Leaderboard Collection A curated collection of the best Korean-developed spaces and models released on Hugging Face • 221 items • Updated Apr 24, 2025 • 100