this post was submitted on 08 Jan 2024
334 points (96.1% liked)

Technology

59670 readers
2785 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

Microsoft, OpenAI sued for copyright infringement by nonfiction book authors in class action claim::The new copyright infringement lawsuit against Microsoft and OpenAI comes a week after The New York Times filed a similar complaint in New York.

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 7 points 10 months ago (1 children)

There's a big difference between borrowing inspiration and just using entire paragraphs of text or images wholesale. If GRRM uses entire paragraphs of JK Rowling with just the names changed and uses the same cover with a few different colors you have the same fight. LLM can do the first, but also does the second.

The "in the style of" is a different issue that's being debated, as style isn't protected by law. But apparently if you ask in the style of, the LLM can get lazy and produces parts of the (copyrighted) source material instead of something original.

[–] [email protected] 4 points 10 months ago (1 children)

Just as with the right query you could get a LLM to output a paragraph of copyrighted material, you can with the right query get Google to give you a link to copyrighted material. Does that make all search engines illegal?

[–] [email protected] 7 points 10 months ago* (last edited 10 months ago) (1 children)

Legally it's very different. One is a link, the other content. It's the same difference as pointing someone to the street where the dealers hang out or opening your coat and asking how many grams they want.

[–] [email protected] 4 points 10 months ago (1 children)

Websites that provide links to copyrighted material are illegal in the US. It's why torrent sites are taken down and need to be hosted in countries with different copyright laws .

So Google can be used to pirate but that's not it's intention. It requires careful queries to get Google to show pirate links. Making a tool that could be used for unintentional copyright violation illegal makes all search engines illegal.

It could even make all programming languages illegal. I could use C to write a program to add two numbers or to crawl the web and return illegal movies.

[–] [email protected] 4 points 10 months ago

Oh. Linking and even downloading torrents is legal in my place. Hosting and sharing is not. My bad.

From how I understand it is that the copyright holders want the LLM to do at least the same as Google is doing against torrents: it checks so no parts of the source material is in the output.