this post was submitted on 23 Feb 2024
183 points (97.9% liked)
Stable Diffusion
4287 readers
1 users here now
Discuss matters related to our favourite AI Art generation technology
Also see
Other communities
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Well, Copilot tried I guess. Same prompt, though I had to add “generate a” to get it to make an image and for some reason it cropped out “photo of” in the final result:
Let's see how SDXL does @[email protected] draw for me a red sphere on top of a blue cube. Behind them is a green triangle, on the right is a dog, on the left is a cat
Here are some images matching your request
Prompt: a red sphere on top of a blue cube. Behind them is a green triangle, on the right is a dog, on the left is a cat
Style: fustercluck
It's ok SDXL, you tried
What a fustercluck
Yeah... SD up till now has been just really good at people but terrible at multiple concepts. I've been pretty impressed with Dall-E 3, hoping SD 3 catches up or surpasses it.
Given the infinite pockets of OpenAI, I doubt this is possible, But if they get close enough, the FOSS community is having weekly breakthroughs and can take it much further. Just look at how good the SD 1.5 finetunes and customization is by now
SD 1.5 needs something like controlnet and inpaint to get close to Dall-E 3. I'm just amazed how Dall-E can do all that without any extra work.
But yeah, really hoping 3 has the community friendly tunability with at least some of that power that Dall-E has.
Heh, that third picture with the blue cat face. Funny, the other cat has the colors of the dog it wanted, but turned it into a cat.
I see they trained their AI on Word clipart.