Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Skier8402 's Collections
Guides
Interpretability tools
translation
OCR
biomedical
Browser-agents
Realtime-apps
Leaderboards
Quantization tools
3Dmodels
Reasoning-models
Embedding models
Swahili models
multimodal
Diffusion model tools
metrics
RAG-agents
Speech apps
Prompts
Interesting finds
Chat-agents
Datasets
LLM-transparency-tools
Data creation
Computer vision

Computer vision

updated Mar 25, 2025

Image and video models

Upvote
-

  • Runtime error
    Agents
    Featured
    198

    Better Florence 2

    πŸ”₯
    198

    Analyze images to detect objects, generate captions, or perform OCR


  • Runtime error
    Agents
    34

    EfficientSAM vs SAM

    βš”
    34


  • Sleeping
    Agents
    31

    Llava Interleave

    πŸŒ‹
    31

    Generate answers by uploading images or videos


  • Running on Zero
    Agents
    1.79k

    DALLE 3 XL v2

    πŸ”₯
    1.79k

    Generate high‑resolution images from your text prompt


  • Running on Zero
    Agents
    143

    Segment Anything 2

    πŸ”₯
    143

    Segment images with prompts or automatic masks


  • Running on Zero
    Agents
    Featured
    517

    Florence2 + SAM2

    πŸ”₯
    517

    Segment objects in images or videos using text prompts


  • Running on T4
    Agents
    143

    RF-DETR

    πŸ”₯
    143

    SOTA real-time object detection model

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs