ldwang's picture

ldwang

ldwang

·

ftgreat

AI & ML interests

LLM, MLLM, Infra

Recent Activity

liked a model 1 day ago

ai9stars/AutoTriton

upvoted a paper 1 day ago

mHC: Manifold-Constrained Hyper-Connections

updated a collection 8 days ago

View all activity

Organizations

upvoted a paper 1 day ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published 3 days ago • 135

upvoted a collection 9 days ago

Molmo2 Data

Artifacts for the Molmo2 data release • 16 items • Updated 11 days ago • 27

upvoted 4 papers about 1 month ago

CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning

Paper • 2512.02551 • Published Dec 2, 2025 • 12

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 280

Skywork-R1V4: Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch

Paper • 2512.02395 • Published Dec 2, 2025 • 47

FlagEval Findings Report: A Preliminary Evaluation of Large Reasoning Models on Automatically Verifiable Textual and Visual Questions

Paper • 2509.17177 • Published Sep 21, 2025 • 13

upvoted 3 papers about 2 months ago

Motif 2 12.7B technical report

Paper • 2511.07464 • Published Nov 7, 2025 • 39

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9, 2025 • 131

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published Nov 5, 2025 • 128

upvoted a collection 2 months ago

Emu3.5

Native Multimodal Models are World Learners 🌍 • 4 items • Updated 9 days ago • 72

upvoted 2 papers 2 months ago

Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published Oct 30, 2025 • 108

Uniform Discrete Diffusion with Metric Path for Video Generation

Paper • 2510.24717 • Published Oct 28, 2025 • 40

upvoted a collection 2 months ago

Reasoning Efficiency Research

Ultra-efficient reasoning model! SOTA Accuracy / CoT Length trade-offs • 3 items • Updated 11 days ago • 11

upvoted an article 2 months ago

Article

`LeRobotDataset:v3.0`: Bringing large-scale datasets to `lerobot`

+9

Sep 16, 2025

•

47

upvoted a paper 2 months ago

Glyph: Scaling Context Windows via Visual-Text Compression

Paper • 2510.17800 • Published Oct 20, 2025 • 67

upvoted an article 2 months ago

Article

Supercharge your OCR Pipelines with Open Models

+5

Oct 21, 2025

•

289

upvoted a paper 3 months ago

CommonForms: A Large, Diverse Dataset for Form Field Detection

Paper • 2509.16506 • Published Sep 20, 2025 • 19

upvoted a collection 3 months ago

The Ultimate Collection of Code Classifiers

🔥 15 classifiers, 124M parameters, one per programming language— for assessing the educational value of GitHub code • 15 items • Updated May 5, 2025 • 15

upvoted a paper 3 months ago

EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling

Paper • 2509.23909 • Published Sep 28, 2025 • 32

upvoted a collection 3 months ago

DataDecide

A suite of models, data, and evals over 25 corpora, 14 sizes, and 3 seeds to measure how accurately small experiments predict rankings at large scale. • 358 items • Updated 11 days ago • 21