sethforsgren commited on
Commit
9e9f9de
2 Parent(s): 8d982a5 25801bf

Merge branch 'main' of https://huggingface.co/riffusion/riffusion-model-v1

Browse files
Files changed (1) hide show
  1. README.md +46 -0
README.md ADDED
@@ -0,0 +1,46 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: creativeml-openrail-m
3
+ tags:
4
+ - stable-diffusion
5
+ - stable-diffusion-diffusers
6
+ - text-to-image
7
+ - text-to-audio
8
+ inference: true
9
+ extra_gated_prompt: |-
10
+ This model is open access and available to all, with a CreativeML OpenRAIL-M license further specifying rights and usage.
11
+ The CreativeML OpenRAIL License specifies:
12
+
13
+ 1. You can't use the model to deliberately produce nor share illegal or harmful outputs or content
14
+ 2. Riffusion claims no rights on the outputs you generate, you are free to use them and are accountable for their use which must not go against the provisions set in the license
15
+ 3. You may re-distribute the weights and use the model commercially and/or as a service. If you do, please be aware you have to include the same use restrictions as the ones in the license and share a copy of the CreativeML OpenRAIL-M to all your users (please read the license entirely and carefully)
16
+ Please read the full license carefully here: https://huggingface.co/spaces/CompVis/stable-diffusion-license
17
+
18
+ extra_gated_heading: Please read the LICENSE to access this model
19
+ ---
20
+
21
+ # Riffusion v1 Model Card
22
+
23
+ Riffusion is a latent text-to-image diffusion model capable of generating spectrogram images given any text input. These spectrograms can be converted into audio clips.
24
+
25
+ The model was created by [Seth Forsgren](https://sethforsgren.com/) and [Hayk Martiros](https://haykmartiros.com/) as a hobby project.
26
+
27
+ You can use the Riffusion model directly, or try the [Riffusion web app](https://www.riffusion.com/). To learn more about the Riffusion project, please look at [our about page](https://www.riffusion.com/about).
28
+
29
+ The Riffusion model was created by fine-tuning the **Stable-Diffusion-v1-5** checkpoint. Read about Stable Diffusion here [🤗's Stable Diffusion blog](https://huggingface.co/blog/stable_diffusion).
30
+
31
+ ## Model Details
32
+ - **Developed by:** Seth Forsgren, Hayk Martiros
33
+ - **Model type:** Diffusion-based text-to-image generation model
34
+ - **Language(s):** English
35
+ - **License:** [The CreativeML OpenRAIL M license](https://huggingface.co/spaces/CompVis/stable-diffusion-license) is an [Open RAIL M license](https://www.licenses.ai/blog/2022/8/18/naming-convention-of-responsible-ai-licenses), adapted from the work that [BigScience](https://bigscience.huggingface.co/) and [the RAIL Initiative](https://www.licenses.ai/) are jointly carrying in the area of responsible AI licensing. See also [the article about the BLOOM Open RAIL license](https://bigscience.huggingface.co/blog/the-bigscience-rail-license) on which our license is based.
36
+ - **Model Description:** This is a model that can be used to generate and modify images based on text prompts. It is a [Latent Diffusion Model](https://arxiv.org/abs/2112.10752) that uses a fixed, pretrained text encoder ([CLIP ViT-L/14](https://arxiv.org/abs/2103.00020)) as suggested in the [Imagen paper](https://arxiv.org/abs/2205.11487).
37
+
38
+ ## Direct Use
39
+ The model is intended for research purposes only. Possible research areas and
40
+ tasks include
41
+
42
+ - Generation of artworks, audio, and use in creative processes.
43
+ - Applications in educational or creative tools.
44
+ - Research on generative models.
45
+
46
+ *This model card is based on the [DALL-E Mini model card](https://huggingface.co/dalle-mini/dalle-mini).*