vision - a shobbs Collection

shobbs 's Collections

papers

think and learn

NSFW

bio

vision

video llm llava

arm

vision

updated 9 days ago

google/paligemma2-28b-pt-896

Image-Text-to-Text • 28B • Updated Dec 5, 2024 • 205 • 50
lmstudio-community/olmOCR-7B-0225-preview-GGUF

Image-Text-to-Text • 8B • Updated Feb 25, 2025 • 231 • 12
vidore/colqwen2.5-v0.2

Visual Document Retrieval • Updated Jun 16, 2025 • 37.5k • 92
vidore/colpali-v1.3

Visual Document Retrieval • Updated Mar 14, 2025 • 33.6k • 84
vidore/colSmol-500M

Visual Document Retrieval • Updated Mar 14, 2025 • 1.83k • 20
deepseek-ai/deepseek-vl2

Image-Text-to-Text • 27B • Updated Dec 18, 2024 • 12.6k • 376
Running on Zero

5

gen2seg: Generative Models Enable Generalizable Instance Segmentation

🚀

5

A demo of our gen2seg SD and MAE-H models.
nvidia/NitroGen

Updated about 4 hours ago • 455
naver-hyperclovax/HyperCLOVAX-SEED-Omni-8B

Text Generation • 11B • Updated 1 day ago • 774 • 131