Video-Text-to-Text
Transformers
Safetensors
English
moss_vl
feature-extraction
Base
Video-Understanding
Image-Understanding
MOSS-VL
OpenMOSS
multimodal
video
vision-language
custom_code
Instructions to use OpenMOSS-Team/MOSS-VL-Base-0408 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use OpenMOSS-Team/MOSS-VL-Base-0408 with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("OpenMOSS-Team/MOSS-VL-Base-0408", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
update transformers to 5.5.4
Browse files- requirements.txt +1 -1
requirements.txt
CHANGED
|
@@ -3,7 +3,7 @@
|
|
| 3 |
|
| 4 |
torch==2.8.0+cu128
|
| 5 |
torchvision==0.23.0+cu128
|
| 6 |
-
transformers==
|
| 7 |
accelerate==1.12.0
|
| 8 |
flash-attn==2.8.1
|
| 9 |
torchcodec==0.7.0
|
|
|
|
| 3 |
|
| 4 |
torch==2.8.0+cu128
|
| 5 |
torchvision==0.23.0+cu128
|
| 6 |
+
transformers==5.5.4
|
| 7 |
accelerate==1.12.0
|
| 8 |
flash-attn==2.8.1
|
| 9 |
torchcodec==0.7.0
|