[KIT] Music to Image • v1 - a fffiloni Collection

fffiloni 's Collections

LipSync and Face Operations

🚂 SD-XL Training Suite

🎵 The MusicBox

Sora Reference Papers

🕹️ AI Games

Text-to-Image History

🎦🔀 Useful Tiny Video Converters

Historic Top Trending Demos

Video History [WIP]

The ControlNet Saga

[KIT] Music to Image • v1

UpScale / Enhancers

[KIT] Music to Image • v1

updated Oct 3

Everything you need to reproduce my Music-to-Image demo

Running

154

🎵🎵🎵

Lp Music Caps

Note Will describe music mood
Runtime error

201

⚡

Demucs

Note Optional: will separate different audio tracks; used here to get song voice only which is then passed through whisper
Running on T4

185

🤫

Whisper Large V2

Note Whisper will transcribe lyrics
meta-llama/Llama-2-7b

Text Generation • Updated Apr 17 • 4.13k

Note Llama is the major part: will use LP-Music-Cap + optional lyrics transcription to write an image description that should match your music input, according to the previous steps
stabilityai/stable-diffusion-xl-base-1.0

Text-to-Image • Updated Oct 30, 2023 • 2.85M • • 5.94k

Note Llama just gave an image description, use it to generate an image with SDXL model
Paused

266

🎶🌅

Music To Image