Video-Text-to-Text
Transformers
Safetensors
English
video_mllama
text-generation
multimodal
video
vision-language
mllama
streaming
realtime
low-latency
custom_code
Instructions to use OpenMOSS-Team/moss-video-preview-realtime-sft with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use OpenMOSS-Team/moss-video-preview-realtime-sft with Transformers:
# Load model directly from transformers import AutoModelForCausalLM model = AutoModelForCausalLM.from_pretrained("OpenMOSS-Team/moss-video-preview-realtime-sft", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
| { | |
| "bos_token_id": 128000, | |
| "do_sample": true, | |
| "eos_token_id": [ | |
| 128001, | |
| 128008, | |
| 128009 | |
| ], | |
| "pad_token_id": 128004, | |
| "temperature": 0.6, | |
| "top_p": 0.9, | |
| "transformers_version": "4.47.1" | |
| } | |