News from the "Real World": Riffusion

Thursday, December 29, 2022

Riffusion

www.riffusion.com: This is the v1.5 stable diffusion model with no modifications, just fine-tuned on images of spectrograms paired with text. Audio processing happens downstream of the model. It can generate infinite variations of a prompt by varying the seed. All the same web UIs and techniques like img2img, inpainting, negative prompts, and interpolation work out of the box.

News from the "Real World"

CMU School of Drama

Thursday, December 29, 2022

Riffusion

No comments:

Blog Archive

Labels