OpenGVLab

community

https://github.com/opengvlab

Activity Feed Request to join this org

AI & ML interests

Computer Vision

Recent Activity

cuierfei authored a paper 4 minutes ago

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Rayment authored a paper about 3 hours ago

MetaCaptioner: Towards Generalist Visual Captioning with Open-source Suites

Rayment authored a paper about 3 hours ago

ScaleEdit-12M: Scaling Open-Source Image Editing Data Generation via Multi-Agent Framework

View all activity

Papers

RIVER: A Real-Time Interaction Benchmark for Video LLMs

InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision

View all Papers

authored a paper 4 minutes ago

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Paper • 2603.25040 • Published 1 day ago • 76

authored 2 papers about 3 hours ago

MetaCaptioner: Towards Generalist Visual Captioning with Open-source Suites

Paper • 2510.12126 • Published Oct 14, 2025 • 1

ScaleEdit-12M: Scaling Open-Source Image Editing Data Generation via Multi-Agent Framework

Paper • 2603.20644 • Published 6 days ago • 2

authored a paper about 3 hours ago

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Paper • 2603.25040 • Published 1 day ago • 76

authored a paper about 3 hours ago

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Paper • 2603.25040 • Published 1 day ago • 76

posted an update 12 days ago

Post

6263

We should really have a release date range slider on the /models page. Tired of "trending/most downloaded" being the best way to sort and still seeing models from 2023 on the first page just because they're embedded in enterprise pipelines and get downloaded repeatedly. "Recently Created/Recently Updated" don't solve the discovery problem considering the amount of noise to sift through.

Slight caveat: Trending actually does have some recency bias, but it's not strong/precise enough.

3 replies

·

authored a paper 14 days ago

GRADE: Benchmarking Discipline-Informed Reasoning in Image Editing

Paper • 2603.12264 • Published 15 days ago • 14

authored a paper 14 days ago

GRADE: Benchmarking Discipline-Informed Reasoning in Image Editing

Paper • 2603.12264 • Published 15 days ago • 14

authored a paper 14 days ago

GRADE: Benchmarking Discipline-Informed Reasoning in Image Editing

Paper • 2603.12264 • Published 15 days ago • 14

authored a paper 14 days ago

Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation

Paper • 2603.12247 • Published 15 days ago • 23

submitted a paper to Daily Papers 15 days ago

GRADE: Benchmarking Discipline-Informed Reasoning in Image Editing

Paper • 2603.12264 • Published 15 days ago • 14

authored a paper 15 days ago

RetroAgent: From Solving to Evolving via Retrospective Dual Intrinsic Feedback

Paper • 2603.08561 • Published 18 days ago • 12

in OpenGVLab/InternVideo2-Stage2_1B-224p-f4 16 days ago

Error when using model

#2 opened about 2 months ago by

authored a paper 16 days ago

InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing

Paper • 2603.09877 • Published 17 days ago • 47

authored a paper 16 days ago

InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing

Paper • 2603.09877 • Published 17 days ago • 47

authored a paper 16 days ago

InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing

Paper • 2603.09877 • Published 17 days ago • 47

authored a paper 16 days ago

InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing

Paper • 2603.09877 • Published 17 days ago • 47

submitted a paper to Daily Papers 21 days ago

Towards Multimodal Lifelong Understanding: A Dataset and Agentic Baseline

Paper • 2603.05484 • Published 22 days ago • 4

updated a dataset 22 days ago

OpenGVLab/RIVER

Updated 22 days ago • 42

published a dataset 22 days ago

OpenGVLab/RIVER

Updated 22 days ago • 42