view article Article Phare LLM benchmark V2: Reasoning models don't guarantee better security 21 days ago • 10
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 265
SmolLM-Smashed: Tiny Giants, Optimized for Speed Collection SmolLM-Smashed is a collection of optimized language models. Each model is quantized and compiled for maximum efficiency while preserving performance. • 5 items • Updated Oct 4, 2025 • 1
NanoBEIR 🍺 Collection A collection of smaller versions of BEIR datasets with 50 queries and up to 10K documents each. • 13 items • Updated Sep 11, 2024 • 23
view article Article Building the Open Agent Ecosystem Together: Introducing OpenEnv +8 Oct 23, 2025 • 139
view article Article SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data +7 Jun 3, 2025 • 305
gpt-oss-safeguard Collection gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are safety reasoning models built-upon gpt-oss • 2 items • Updated Oct 29, 2025 • 58
💧 LFM2 Collection LFM2 is a new generation of hybrid models, designed for on-device deployment. • 26 items • Updated about 15 hours ago • 131
view article Article Vibe coding for data science: how to label a dataset with Kimi K2 Jul 22, 2025 • 21
view article Article RealPerformance, A Dataset of Language Model Business Compliance Issues Jul 21, 2025 • 4
view article Article LLM Hallucinations: bug or feature? The US Supreme Court 2025 cases experiment Jul 8, 2025 • 19
view article Article LLMs recognise bias but also reproduce harmful stereotypes: an analysis of bias in leading LLMs Jul 2, 2025 • 16