Chinese University of Hong Kong, Shenzhen

university

https://www.cuhk.edu.cn/

Activity Feed Request to join this org

AI & ML interests

NLP, CV

Recent Activity

yeyeyewang submitted a paper 17 days ago

Janus: Disaggregating Attention and Experts for Scalable MoE Inference

Eric3200 submitted a paper 20 days ago

DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry

Eric3200 authored a paper 3 months ago

Can Multimodal LLMs See Materials Clearly? A Multimodal Benchmark on Materials Characterization

View all activity

Papers

Janus: Disaggregating Attention and Experts for Scalable MoE Inference

View all Papers

yeyeyewang

submitted a paper to Daily Papers 17 days ago

Janus: Disaggregating Attention and Experts for Scalable MoE Inference

Paper • 2512.13525 • Published 19 days ago • 5

Eric3200

submitted a paper to Daily Papers 20 days ago

DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry

Paper • 2512.11558 • Published 22 days ago • 41

weipang142857

authored 4 papers about 2 months ago

Rethinking Spectral Augmentation for Contrast-based Graph Self-Supervised Learning

Paper • 2405.19600 • Published May 30, 2024

DREAM: Improving Video-Text Retrieval Through Relevance-Based Augmentation Using Large Foundation Models

Paper • 2404.05083 • Published Apr 7, 2024

LazyVLM: Neuro-Symbolic Approach to Video Analytics

Paper • 2505.21459 • Published May 27, 2025

The Underappreciated Power of Vision Models for Graph Structural Understanding

Paper • 2510.24788 • Published Oct 27, 2025 • 35

Eric3200

authored a paper 3 months ago

Can Multimodal LLMs See Materials Clearly? A Multimodal Benchmark on Materials Characterization

Paper • 2509.09307 • Published Sep 11, 2025 • 6

Kullpar

authored a paper 7 months ago

StreamBP: Memory-Efficient Exact Backpropagation for Long Sequence Training of LLMs

Paper • 2506.03077 • Published Jun 3, 2025 • 17

weipang142857

authored 4 papers 7 months ago

ClavaDDPM: Multi-relational Data Synthesis with Cluster-guided Diffusion Models

Paper • 2405.17724 • Published May 28, 2024

MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series

Paper • 2405.19327 • Published May 29, 2024 • 48

GraphOmni: A Comprehensive and Extendable Benchmark Framework for Large Language Models on Graph-theoretic Tasks

Paper • 2504.12764 • Published Apr 17, 2025 • 41

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Paper • 2505.21497 • Published May 27, 2025 • 109

IranQin

authored 7 papers 10 months ago

MP5: A Multi-modal Open-ended Embodied System in Minecraft via Active Perception

Paper • 2312.07472 • Published Dec 12, 2023 • 2

SupFusion: Supervised LiDAR-Camera Fusion for 3D Object Detection

Paper • 2309.07084 • Published Sep 13, 2023 • 1

MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control

Paper • 2403.12037 • Published Mar 18, 2024 • 1

WorldSimBench: Towards Video Generation Models as World Simulators

Paper • 2410.18072 • Published Oct 23, 2024 • 19

GameFactory: Creating New Games with Generative Interactive Videos

Paper • 2501.08325 • Published Jan 14, 2025 • 67

T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image Generation

Paper • 2501.12612 • Published Jan 22, 2025

RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints

Paper • 2503.16408 • Published Mar 20, 2025 • 42

whatlegequ

authored a paper 10 months ago

Inducing Neural Collapse in Deep Long-tailed Learning

Paper • 2302.12453 • Published Feb 24, 2023