Hacker News

Infinite Stable Diffusion Videos

by lwnealon 9/5/2022, 5:37:26 AM with 16 comments

by lwnealon 9/7/2022, 4:33:00 AM
Sorry about the bugs, I've just released an update. The site's music should no longer shatter your eardrums until after you touch the unmute button.
The videos are generated from random https://lexica.art prompts, with linear interpolation between two random seeds for each video, held at the same prompt, looped with ffmpeg filter_complex reverse/concat. Music from various creative commons / free sources.
Source code at https://github.com/lwneal/duckrabbit/
Hosted on a single $7 node at https://www.digitalocean.com
by epron 9/7/2022, 12:17:07 AM
Bug report:
Sound permanently on after hitting next button once on Chromium 104.
Next button does move to next video, but also enables sound. Additionally, this breaks internal state, and the button still shows the muted symbol (despite the sound now being on). Hitting the muted sound button switches the sound button symbol to unmuted, sound still on as before. Hitting it again to mute the sound doesn't work, and doesn't change the symbol back to muted.
by KerrAvonon 9/7/2022, 12:17:19 AM
Holy fuck, be sure to turn down your sound before you visit. That should be muted by default.
by rhackeron 9/7/2022, 12:56:19 AM
I know this is silly but I can't wait for games to have automatically generated "levels" that look like this. I guess 3d training and output is probably minimally researched at this point, and there is NERF research... at some point all of this research will truly show off its potential beyond pretty pictures.
by eminence32on 9/7/2022, 12:03:50 AM
This is neat. Some text that describes how this was made would be useful. Also the mute button doesn't work.
by bl0bon 9/7/2022, 12:42:07 AM
As an aside, it would be cool if music could also be an input to these kinds of generative models, such that the generated image somehow matches the feeling or mood of the music.
by simonebrunozzion 9/7/2022, 1:50:34 PM
This looks almost silly now. But I'd bet that in a few years, we will see a full movie, created mostly with the equivalent of Stable Diffusion, win an Oscar.
My bet is that this will happen in 8-9 years from now, but it's just a guess.
I think it's hard to challenge the fact that it WILL happen, at some point in our lifetimes.
by yieldcrvon 9/7/2022, 12:39:31 AM
well at least a side project AI site that didn’t crash immediately
by mdaleon 9/7/2022, 2:39:06 AM
How long before we have interactive full fidelity generated game / films ?
by questiondevon 9/7/2022, 8:55:08 AM
what if there was a way to “increase frame rate” by adding in some type of logic checker between two generated images? kinda like a comparison between two generated frames that would lead to more generated images that mimic movement? so like a filler between frames that would predict how something got to one shape to another using a set of properties that a generated object has, those properties could be weight, speed, gravity etc etc, it just depends on what object it is conceptualizing or constructing
by meep0lon 9/7/2022, 3:22:59 AM
How does this work?
by behnamohon 9/7/2022, 12:30:41 AM
Not so “stable” then /jk
But please change the music!
by Atma-non 9/7/2022, 7:03:42 AM
Cool idea! How do you get the prompts from lexica? I can not find that in the repository.
by werdnapkon 9/7/2022, 12:19:31 AM
Calling these "videos" is a bit of a stretch I think.
by jonathanstrangeon 9/7/2022, 8:43:34 AM
I've been following r/StableDiffusion on reddit for a while and was wondering whether this can also be used for anything that doesn't look like a cheap fantasy or science fiction novel cover.
This is an honest question, I haven't seen any example of anything else so I got to wonder whether the models they are using are specialized for sci-fi and fantasy "air brush/digital" style? Why?