YAML Metadata
Warning:
The pipeline tag "video-to-video" is not in the official list: text-classification, token-classification, table-question-answering, question-answering, zero-shot-classification, translation, summarization, feature-extraction, text-generation, text2text-generation, fill-mask, sentence-similarity, text-to-speech, text-to-audio, automatic-speech-recognition, audio-to-audio, audio-classification, audio-text-to-text, voice-activity-detection, depth-estimation, image-classification, object-detection, image-segmentation, text-to-image, image-to-text, image-to-image, image-to-video, unconditional-image-generation, video-classification, reinforcement-learning, robotics, tabular-classification, tabular-regression, tabular-to-text, table-to-text, multiple-choice, text-retrieval, time-series-forecasting, text-to-video, image-text-to-text, visual-question-answering, document-question-answering, zero-shot-image-classification, graph-ml, mask-generation, zero-shot-object-detection, text-to-3d, image-to-3d, image-feature-extraction, video-text-to-text, keypoint-detection, any-to-any, other
example outputs (courtesy of dotsimulate)
zeroscope_v2 1111 models
A collection of watermark-free Modelscope-based video models capable of generating high quality video at 448x256, 576x320 and 1024 x 576. These models were trained from the original weights with offset noise using 9,923 clips and 29,769 tagged frames.
This collection makes it easy to switch between models with the new dropdown menu in the 1111 extension.
Using it with the 1111 text2video extension
Simply download the contents of this repo to 'stable-diffusion-webui\models\text2video' Or, manually download the model folders you want, along with VQGAN_autoencoder.pth.
Thanks to dotsimulate for the config files.
Thanks to camenduru, kabachuha, ExponentialML, VANYA, polyware, tin2tin