Open to Work

11 23 3

Taki WU

taki555

https://wutaiqiang.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper about 10 hours ago

CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning

upvoted a collection 1 day ago

Qwen3.5

updated a model 4 days ago

taki555/Qwen3-30B-A3B-Instruct-2507-Art

View all activity

Organizations

upvoted a paper about 10 hours ago

CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning

Paper • 2603.00889 • Published 3 days ago • 30

upvoted a collection 1 day ago

Qwen3.5

Collection

21 items • Updated about 2 hours ago • 804

upvoted a paper 6 days ago

The Art of Efficient Reasoning: Data, Reward, and Optimization

Paper • 2602.20945 • Published 7 days ago • 6

upvoted a paper about 1 month ago

ProFit: Leveraging High-Value Signals in SFT via Probability-Guided Token Selection

Paper • 2601.09195 • Published Jan 14 • 15

upvoted 3 papers about 2 months ago

X-Coder: Advancing Competitive Programming with Fully Synthetic Tasks, Solutions, and Tests

Paper • 2601.06953 • Published Jan 11 • 45

BabyVision: Visual Reasoning Beyond Language

Paper • 2601.06521 • Published Jan 10 • 196

MMFormalizer: Multimodal Autoformalization in the Wild

Paper • 2601.03017 • Published Jan 6 • 105

upvoted a paper 4 months ago

From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model

Paper • 2510.19871 • Published Oct 22, 2025 • 30

upvoted 2 papers 5 months ago

Revisiting Model Interpolation for Efficient Reasoning

Paper • 2510.10977 • Published Oct 13, 2025 • 10

Timber: Training-free Instruct Model Refining with Base via Effective Rank

Paper • 2509.23595 • Published Sep 28, 2025 • 1

upvoted a collection 6 months ago

Qwen3

Collection

84 items • Updated Dec 31, 2025 • 1.7k

upvoted a paper 6 months ago

A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code

Paper • 2508.18106 • Published Aug 25, 2025 • 349

upvoted 3 papers 7 months ago

Shadow-FT: Tuning Instruct via Base

Paper • 2505.12716 • Published May 19, 2025 • 4

Autoregressive Models in Vision: A Survey

Paper • 2411.05902 • Published Nov 8, 2024 • 19

Rethinking Kullback-Leibler Divergence in Knowledge Distillation for Large Language Models

Paper • 2404.02657 • Published Apr 3, 2024 • 2

upvoted 2 papers 9 months ago

Learning to Reason without External Rewards

Paper • 2505.19590 • Published May 26, 2025 • 29

PhyX: Does Your Model Have the "Wits" for Physical Reasoning?

Paper • 2505.15929 • Published May 21, 2025 • 49

upvoted a paper over 1 year ago

LLM-Neo: Parameter Efficient Knowledge Distillation for Large Language Models

Paper • 2411.06839 • Published Nov 11, 2024 • 1

upvoted a collection over 1 year ago

LLM-Neo

Collection

Model hub for LLM-Neo, including Llama3.1-Neo-1B-100w and Minitron-4B-Depth-Neo-10w. • 3 items • Updated Nov 20, 2024 • 6

upvoted a paper over 1 year ago

A Survey on the Honesty of Large Language Models

Paper • 2409.18786 • Published Sep 27, 2024 • 31

Taki WU

AI & ML interests

Recent Activity

Organizations

taki555's activity