this post was submitted on 09 Jul 2023
500 points (97.0% liked)
Technology
59583 readers
3037 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Scraping the web is legal and training AI on data is also legal.
Reusing the content you scraped, if copyright protected, is not.
Edit: unless you get the authorization of the original authors but OpenAI didn't even asked, that's why it's a crime.
Sounds like fair use to me.
That really will be the question at hand. Is the ai producing work that could be considered transformative, educational, or parody? The answer is of course yes, it is capable of doing all three of those things, but it's also capable of being coaxed into reproducing things exactly.
I don't know if current copyright laws are capable of dealing with the ai Renaissance.
Yeah it is. The only protection in copyright is called derivative works, and an AI is not a derivative of a book, No more than your brain is after you've read one.
The only exception would be if you manage to overtrain and encode the contents of the book inside of the model file. That's not what happened here because I'll chat GPT output was a summary.
The only valid claim here is the fact that the books were not supposed to be on the public internet and it's likely that the way open AI the books in the first place was through some piracy website through scraping the web.
At that point you just have to hold them liable for that act of piracy, not the fact that the model release was an act of copyright violation.