this post was submitted on 05 Oct 2023
170 points (89.4% liked)
Technology
60101 readers
3182 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
The avocado had real text. Is Dall-E 3 capable of creating legible text?
Yes, it's the only model that manages to get text right, and the results are usually pretty consistent. It's a big step forward.
Base SDXL and SD1.5 with the help of controlnet can both do text too. I forgot Deep Floyd/IF can as well.
Control nets are kind of "cheating", though, they're a form of image-to-image where you provide them with something to trace over or otherwise guide them. I think in this area the open-source field has (briefly) fallen behind, we'll need another round of catchup. That's fine, though. Let competition drive hard.
It is, yeah
Kind of. It can generate readable text, but not all the time. It will frequently turn parts of your prompt into text that aren't meant to be text or mix perfectly readable text with AI gibberish: