Community Blog & Articles

Community Articles

How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs

BidirLM: Turning Generative LLMs into the Best Open-Source Omnimodal Encoders

Using OCR models with llama.cpp

KV Caching Explained: Optimizing Transformer Inference Efficiency

Darwin V6: Diagnostic-Guided Evolutionary Model Merging

Uncensor any LLM with abliteration

Building Harvey-style tabular review from scratch, but better

"Darwin-27B-Opus: Surpassing the Foundation Model Without Training"

about 7 hours ago

Mastering Tensor Dimensions in Transformers

Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

How I contributed a new model to the Transformers library using Codex

YC-Bench: Can Your AI Agent Run a Startup Without Going Bankrupt?

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons

ArmBench-LLM 1.0: Benchmarking LLMs on Armenian Language Tasks

Speculative Decoding in Practice: How EAGLE3 Makes LLMs Faster Without Changing Their Outputs

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

From GRPO to DAPO and GSPO: What, Why, and How

Projected Abliteration

announcementdiffusionworld-model

Waypoint-1.5: Higher-Fidelity Interactive Worlds for Everyday GPUs

+1

multimodalnlpcommunity

Multimodal Embedding & Reranker Models with Sentence Transformers

ALTK‑Evolve: On‑the‑Job Learning for AI Agents

open-source-collabpartnershipsopen-source

Safetensors is Joining the PyTorch Foundation

multimodalon-devicegemma4

Welcome Gemma 4: Frontier multimodal intelligence on device

+3

Holo3: Breaking the Computer Use Frontier

Falcon Perception

gradioserveropen-source

Any Custom Frontend with Gradio's Backend

Granite 4.0 3B Vision: Compact Multimodal Intelligence for Enterprise Documents

Training mRNA Language Models Across 25 Species for $165

trlreinforcement-learningannouncement

TRL v1.0: Post-Training Library Built to Move with the Field

guideagentsinference-providers

Liberate your OpenClaw

+4

A New Framework for Evaluating Voice Agents (EVA)

Build a Domain-Specific Embedding Model in Under a Day

Community Articles

NEW Articles from Team or Enterprise organizations will get promoted to the main section.

How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs

BidirLM: Turning Generative LLMs into the Best Open-Source Omnimodal Encoders

Using OCR models with llama.cpp

KV Caching Explained: Optimizing Transformer Inference Efficiency

Darwin V6: Diagnostic-Guided Evolutionary Model Merging

Uncensor any LLM with abliteration

Building Harvey-style tabular review from scratch, but better

"Darwin-27B-Opus: Surpassing the Foundation Model Without Training"

about 7 hours ago

Mastering Tensor Dimensions in Transformers

Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

How I contributed a new model to the Transformers library using Codex

YC-Bench: Can Your AI Agent Run a Startup Without Going Bankrupt?

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons

ArmBench-LLM 1.0: Benchmarking LLMs on Armenian Language Tasks

Speculative Decoding in Practice: How EAGLE3 Makes LLMs Faster Without Changing Their Outputs

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

From GRPO to DAPO and GSPO: What, Why, and How

Projected Abliteration

View all articles