David Fan

davidfan97

dfan

AI & ML interests

Visual representation learning, videos, vision-language

Recent Activity

liked a model 6 days ago

facebook/webssl-dino300m-full2b-224

upvoted a collection 6 days ago

Scale RAE

upvoted a collection 3 months ago

RAE

View all activity

Organizations

upvoted a collection 6 days ago

Scale RAE

Collection

Collection for "Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders" • 7 items • Updated 15 days ago • 3

upvoted a collection 3 months ago

RAE

Collection

Collection for Diffusion Transformers with Representation Autoencoders • 1 item • Updated Oct 14, 2025 • 11

upvoted a paper 4 months ago

OneFlow: Concurrent Mixed-Modal and Interleaved Generation with Edit Flows

Paper • 2510.03506 • Published Oct 3, 2025 • 15

upvoted a paper 5 months ago

Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training

Paper • 2509.26625 • Published Sep 30, 2025 • 43

upvoted a paper 6 months ago

Deep Think with Confidence

Paper • 2508.15260 • Published Aug 21, 2025 • 90

upvoted a collection 9 months ago

V-JEPA 2

Collection

A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13, 2025 • 192

upvoted 4 collections 10 months ago

upvoted 4 papers about 1 year ago

Text-Guided Video Masked Autoencoder

Paper • 2408.00759 • Published Aug 1, 2024 • 1

Motion-Guided Masking for Spatiotemporal Representation Learning

Paper • 2308.12962 • Published Aug 24, 2023 • 1

MetaMorph: Multimodal Understanding and Generation via Instruction Tuning

Paper • 2412.14164 • Published Dec 18, 2024 • 4

Video Token Merging for Long-form Video Understanding

Paper • 2410.23782 • Published Oct 31, 2024 • 2

David Fan

AI & ML interests

Recent Activity

Organizations

davidfan97's activity