5 6

hlzhang109 PRO

hlzhang109

AI & ML interests

None yet

Recent Activity

liked a Space 4 days ago

bobbyjin/oll-v2-performance-frontiers

authored a paper 11 days ago

Prescriptive Scaling Reveals the Evolution of Language Model Capabilities

liked a dataset 11 days ago

hlzhang109/proteus-2k

View all activity

Organizations

liked a Space 4 days ago

Estimated Performance Frontiers of Open LLM Leaderboard Tasks

🚀

Estimate LLM task performance from pretraining compute

authored a paper 11 days ago

Prescriptive Scaling Reveals the Evolution of Language Model Capabilities

Paper • 2602.15327 • Published 12 days ago • 2

liked a dataset 11 days ago

hlzhang109/proteus-2k

Viewer • Updated 15 days ago • 2.45k • 28 • 1

submitted a paper to Daily Papers 11 days ago

Prescriptive Scaling Reveals the Evolution of Language Model Capabilities

Paper • 2602.15327 • Published 12 days ago • 2

updated 2 datasets 15 days ago

hlzhang109/proteus-2k

Viewer • Updated 15 days ago • 2.45k • 28 • 1

hlzhang109/proteus-selected

Viewer • Updated 15 days ago • 565 • 17

published a dataset 15 days ago

hlzhang109/proteus-selected

Viewer • Updated 15 days ago • 565 • 17

submitted a paper to Daily Papers 17 days ago

Weight Decay Improves Language Model Plasticity

Paper • 2602.11137 • Published 17 days ago • 2

published a dataset 19 days ago

hlzhang109/proteus-2k

Viewer • Updated 15 days ago • 2.45k • 28 • 1

updated a model about 2 months ago

hlzhang109/llama-4B-80BT-weightdecay1.0-seed42

4B • Updated Jan 7 • 1

published a model about 2 months ago

hlzhang109/llama-4B-80BT-weightdecay1.0-seed42

4B • Updated Jan 7 • 1

liked a model 5 months ago

physical-intelligence/fast

Robotics • Updated Jan 16, 2025 • 166

liked a model 6 months ago

LLM360/K2-Think

Text Generation • 33B • Updated Nov 19, 2025 • 284 • 365

commented a paper 9 months ago

Discovering Hierarchical Latent Capabilities of Language Models via Causal Representation Learning

Paper • 2506.10378 • Published Jun 12, 2025 • 2 •

liked a Space about 1 year ago

The Ultra-Scale Playbook

🌌

3.71k

The ultimate guide to training LLM on large GPU Clusters

authored 5 papers about 1 year ago

Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI Benchmark

Paper • 2304.03279 • Published Apr 6, 2023 • 2

CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language Model Pre-training

Paper • 2406.10670 • Published Jun 15, 2024 • 4

DataComp-LM: In search of the next generation of training sets for language models

Paper • 2406.11794 • Published Jun 17, 2024 • 55

Eliminating Position Bias of Language Models: A Mechanistic Approach

Paper • 2407.01100 • Published Jul 1, 2024 • 8

Mind the Gap: Examining the Self-Improvement Capabilities of Large Language Models

Paper • 2412.02674 • Published Dec 3, 2024

hlzhang109 PRO

AI & ML interests

Recent Activity

Organizations

hlzhang109's activity

Estimated Performance Frontiers of Open LLM Leaderboard Tasks

The Ultra-Scale Playbook