6 481 31

Young-Jun Lee PRO

passing2961

https://sites.google.com/view/passing2961/home

AI & ML interests

Social Dialogue System, Multi-Modal Dialogue

Recent Activity

upvoted a paper 1 day ago

AcademiClaw: When Students Set Challenges for AI Agents

upvoted a paper 1 day ago

From Context to Skills: Can Language Models Learn from Context Skillfully?

upvoted a paper 1 day ago

AI Co-Mathematician: Accelerating Mathematicians with Agentic AI

View all activity

Organizations

upvoted 5 papers 1 day ago

updated a bucket 6 days ago

passing2961/cobench-openevolve-results

384 MB

published a bucket 6 days ago

passing2961/cobench-openevolve-results

384 MB

upvoted a paper 9 days ago

Co-Director: Agentic Generative Video Storytelling

Paper • 2604.24842 • Published 13 days ago • 16

upvoted a paper 10 days ago

AutoResearchBench: Benchmarking AI Agents on Complex Scientific Literature Discovery

Paper • 2604.25256 • Published 12 days ago • 29

upvoted a paper 11 days ago

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Paper • 2604.24764 • Published 13 days ago • 116

upvoted 2 papers 16 days ago

Scaling Test-Time Compute for Agentic Coding

Paper • 2604.16529 • Published 24 days ago • 11

Abstain-R1: Calibrated Abstention and Post-Refusal Clarification via Verifiable RL

Paper • 2604.17073 • Published 22 days ago • 9

upvoted 4 papers 18 days ago

MathNet: a Global Multimodal Benchmark for Mathematical Reasoning and Retrieval

Paper • 2604.18584 • Published 20 days ago • 14

SkillFlow:Benchmarking Lifelong Skill Discovery and Evolution for Autonomous Agents

Paper • 2604.17308 • Published 21 days ago • 22

When Can LLMs Learn to Reason with Weak Supervision?

Paper • 2604.18574 • Published 20 days ago • 25

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

Paper • 2604.18292 • Published 20 days ago • 84

upvoted 2 papers 19 days ago

PRL-Bench: A Comprehensive Benchmark Evaluating LLMs' Capabilities in Frontier Physics Research

Paper • 2604.15411 • Published 24 days ago • 4

The Amazing Agent Race: Strong Tool Users, Weak Navigators

Paper • 2604.10261 • Published 23 days ago • 7

upvoted a paper 24 days ago

Toward Autonomous Long-Horizon Engineering for ML Research

Paper • 2604.13018 • Published 26 days ago • 34

upvoted a paper 25 days ago

QuanBench+: A Unified Multi-Framework Benchmark for LLM-Based Quantum Code Generation

Paper • 2604.08570 • Published Mar 25 • 125

Young-Jun Lee PRO

AI & ML interests

Recent Activity

Organizations

passing2961's activity

passing2961/cobench-openevolve-results

passing2961/cobench-openevolve-results