AI Safety Research's picture

AI Safety Research

AISafety

·

https://humanaligned.ai

AI & ML interests

LLMs, planning, EA

Recent Activity

liked a model 7 days ago

Tesslate/OmniCoder-9B

liked a model 7 days ago

bartowski/zed-industries_zeta-2-GGUF

liked a dataset 7 days ago

MaziyarPanahi/Synthia-Coder-v1.5-I-sharegpt

View all activity

Organizations

upvoted a collection 7 days ago

ShareGPT Datasets

22 items • Updated Mar 2 • 16

upvoted a collection 17 days ago

Mistral Small 4

A state-of-the-art model, open-weight, with a granular Mixture-of-Experts architecture that fuses instruct, reasoning and agentic skills. • 3 items • Updated 19 days ago • 63

upvoted 2 articles 3 months ago

Article

Exploring Environments Hub: Your Language Model needs better (open) environments to learn

Sep 4, 2025

•

30

Article

HUMAINE: A Rigorous Framework for Understanding AI Through Human Experience

Sep 16, 2025

•

7

upvoted an article 4 months ago

Article

From GRPO to DAPO and GSPO: What, Why, and How

Aug 9, 2025

•

108

upvoted a collection 4 months ago

Olmo 3

Artifacts for the Olmo 3 release. • 7 items • Updated Mar 2 • 167

upvoted an article 4 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

618

upvoted a collection 4 months ago

Transformers.js demos

A collection of my favorite WebML demos, built with Transformers.js! • 30 items • Updated Jul 11, 2024 • 141

upvoted a paper 4 months ago

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

Paper • 2511.22570 • Published Nov 27, 2025 • 93

upvoted a paper 5 months ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9, 2025 • 133

upvoted a collection 5 months ago

The Bestiary

Decensored language models made using Heretic (https://github.com/p-e-w/heretic) • 6 items • Updated Nov 16, 2025 • 109

upvoted an article 6 months ago

Article

EuroLLM-9B

Dec 2, 2024

•

139

upvoted a collection 6 months ago

🎯 Liquid Nanos

Library of task-specific models: https://www.liquid.ai/blog/introducing-liquid-nanos-frontier-grade-performance-on-everyday-devices • 26 items • Updated 21 days ago • 111

upvoted an article 6 months ago

Article

SOTA OCR with Core ML and dots.ocr

Oct 2, 2025

•

64

upvoted a collection 6 months ago

DeepSeek-V3.2

4 items • Updated Dec 1, 2025 • 536

upvoted a paper 6 months ago

Reinforcement Learning on Pre-Training Data

Paper • 2509.19249 • Published Sep 23, 2025 • 67

upvoted a collection 7 months ago

InternVL3.5

This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). • 45 items • Updated Mar 2 • 107

upvoted a collection 8 months ago

DeepSeek-V3.1

3 items • Updated Mar 2 • 261

upvoted an article 8 months ago

Article

Introducing AI Sheets: a tool to work with datasets using open AI models!

+4

Aug 8, 2025

•

108

upvoted a paper 8 months ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8, 2025 • 210