WeEdit: A Dataset, Benchmark and Glyph-Guided Framework for Text-centric Image Editing Paper • 2603.11593 • Published 16 days ago • 25
Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation Paper • 2603.12247 • Published 16 days ago • 23
view article Article ColPali: Efficient Document Retrieval with Vision Language Models 👀 Jul 5, 2024 • 317
UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation Paper • 2408.11305 • Published Aug 21, 2024 • 1
WebThinker: Empowering Large Reasoning Models with Deep Research Capability Paper • 2504.21776 • Published Apr 30, 2025 • 59
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM +2 Mar 12, 2025 • 493
Running 3.75k The Ultra-Scale Playbook 🌌 3.75k The ultimate guide to training LLM on large GPU Clusters
Reasoning Datasets Collection Distilled synthetic Reasoning datasets • 7 items • Updated Feb 2, 2025 • 61
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19, 2025 • 187
view post Post 2106 New smolagents example landed on Hugging Face cookbook 🤠Learn how to create an inventory managing multi-agent system with smolagents, MongoDB and DeepSeek Chat 📖 https://huggingface.co/learn/cookbook/mongodb_smolagents_multi_micro_agents See translation 🔥 7 7 🤗 4 4 😎 2 2 + Reply