Hanlin Wang

Henrywang

https://wanghanlinhenry.github.io/

AI & ML interests

LLM Agent, Reinforcement Learning, Embodied AI

Recent Activity

upvoted a paper 9 days ago

ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning

upvoted a paper 9 days ago

Imagine-then-Plan: Agent Learning from Adaptive Lookahead with World Models

upvoted a paper 14 days ago

AT^2PO: Agentic Turn-based Policy Optimization via Tree Search

View all activity

Organizations

None yet

upvoted 2 papers 9 days ago

ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning

Paper • 2507.16815 • Published Jul 22, 2025 • 40

Imagine-then-Plan: Agent Learning from Adaptive Lookahead with World Models

Paper • 2601.08955 • Published 10 days ago • 13

upvoted a paper 14 days ago

AT^2PO: Agentic Turn-based Policy Optimization via Tree Search

Paper • 2601.04767 • Published 16 days ago • 27

upvoted a paper 2 months ago

The Station: An Open-World Environment for AI-Driven Discovery

Paper • 2511.06309 • Published Nov 9, 2025 • 37

authored 3 papers 8 months ago

E2CL: Exploration-based Error Correction Learning for Embodied Agents

Paper • 2409.03256 • Published Sep 5, 2024 • 1

Reasoning Beyond Language: A Comprehensive Survey on Latent Chain-of-Thought Reasoning

Paper • 2505.16782 • Published May 22, 2025 • 1

SPA-RL: Reinforcing LLM Agents via Stepwise Progress Attribution

Paper • 2505.20732 • Published May 27, 2025 • 1

upvoted a paper 8 months ago

SPA-RL: Reinforcing LLM Agents via Stepwise Progress Attribution

Paper • 2505.20732 • Published May 27, 2025 • 1

authored a paper 8 months ago

STeCa: Step-level Trajectory Calibration for LLM Agent Learning

Paper • 2502.14276 • Published Feb 20, 2025 • 1

upvoted a paper 8 months ago

STeCa: Step-level Trajectory Calibration for LLM Agent Learning

Paper • 2502.14276 • Published Feb 20, 2025 • 1

upvoted a paper 9 months ago

OTC: Optimal Tool Calls via Reinforcement Learning

Paper • 2504.14870 • Published Apr 21, 2025 • 35

upvoted a paper 10 months ago

E2CL: Exploration-based Error Correction Learning for Embodied Agents

Paper • 2409.03256 • Published Sep 5, 2024 • 1

upvoted a paper 11 months ago

Why Safeguarded Ships Run Aground? Aligned Large Language Models' Safety Mechanisms Tend to Be Anchored in The Template Region

Paper • 2502.13946 • Published Feb 19, 2025 • 10

updated a model over 3 years ago

Henrywang/dummy-model

Fill-Mask • Updated May 8, 2022 • 2

Hanlin Wang

AI & ML interests

Recent Activity

Organizations

Henrywang's activity