Janus: Disaggregating Attention and Experts for Scalable MoE Inference Paper • 2512.13525 • Published 19 days ago • 5
DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry Paper • 2512.11558 • Published 22 days ago • 41
Rethinking Spectral Augmentation for Contrast-based Graph Self-Supervised Learning Paper • 2405.19600 • Published May 30, 2024
DREAM: Improving Video-Text Retrieval Through Relevance-Based Augmentation Using Large Foundation Models Paper • 2404.05083 • Published Apr 7, 2024
The Underappreciated Power of Vision Models for Graph Structural Understanding Paper • 2510.24788 • Published Oct 27, 2025 • 35
Can Multimodal LLMs See Materials Clearly? A Multimodal Benchmark on Materials Characterization Paper • 2509.09307 • Published Sep 11, 2025 • 6
StreamBP: Memory-Efficient Exact Backpropagation for Long Sequence Training of LLMs Paper • 2506.03077 • Published Jun 3, 2025 • 17
ClavaDDPM: Multi-relational Data Synthesis with Cluster-guided Diffusion Models Paper • 2405.17724 • Published May 28, 2024
MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series Paper • 2405.19327 • Published May 29, 2024 • 48
GraphOmni: A Comprehensive and Extendable Benchmark Framework for Large Language Models on Graph-theoretic Tasks Paper • 2504.12764 • Published Apr 17, 2025 • 41
Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers Paper • 2505.21497 • Published May 27, 2025 • 109
MP5: A Multi-modal Open-ended Embodied System in Minecraft via Active Perception Paper • 2312.07472 • Published Dec 12, 2023 • 2
SupFusion: Supervised LiDAR-Camera Fusion for 3D Object Detection Paper • 2309.07084 • Published Sep 13, 2023 • 1
MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control Paper • 2403.12037 • Published Mar 18, 2024 • 1
WorldSimBench: Towards Video Generation Models as World Simulators Paper • 2410.18072 • Published Oct 23, 2024 • 19
GameFactory: Creating New Games with Generative Interactive Videos Paper • 2501.08325 • Published Jan 14, 2025 • 67
T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image Generation Paper • 2501.12612 • Published Jan 22, 2025
RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints Paper • 2503.16408 • Published Mar 20, 2025 • 42