this post was submitted on 10 Jul 2023
219 points (100.0% liked)

Technology

37677 readers
511 users here now

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:


This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 28 points 1 year ago* (last edited 1 year ago) (15 children)

People keep taking issue with this articles use of "summarizing" and linking to wikipedia... Summaries of copyrighted work are obviously not illegal.

This article is oversimplified and does a crummy job of explaining the problem. Ars Technica does a much better job explaining.

The fact that the ai can summarize these works in detail is proof that they were trained using copyrighted material without permission, (which is not fair use) Sarah Silverman is obviously not going to be hurt financially by this, but there are hundreds of thousands of authors who definitely will be affected. They have every right to sue.

[–] [email protected] 12 points 1 year ago (14 children)

Why does "fair use" even fall into it? I'm not familiar with their specific license, but the general definition of copyright is:

A copyright is a type of intellectual property that gives its owner the exclusive right to copy, distribute, adapt, display, and perform a creative work, usually for a limited time.

Nothing was copied, or distributed (in a form that anybody can consider "The Work"), or displayed, or performed. The only possible legal argument they have is adapting as a derivative work. And anybody who is familiar with how an LLM works knows that the form that results from reading in content is completely different from the source.

LLMs/LDMs are not taking in billions of books and putting them into a database. It is a very lossy process. Out of all of the billions of images trained from the Stable Diffusion database, the resulting model is 4 GBs. There is no universe where you can store billions of images into a mere 4 GBs. Stable Diffusion cannot and will not, pixel-by-pixel, reproduce a Van Gogh. It can make something that kind of looks like a Van Gogh, but styles are not copyrightable.

The same applies to an LLM like ChatGPT. It cannot reproduce entire books, or anywhere close to that. If you ask it to recreate Page 25 of Silverman's book, it can't do it. If it doesn't even contain a minor portion of the original material, it can't even be considered a derivative work.

They don't have a case. They have a lot of publicity and noise, but they will lose to inevitability.

[–] [email protected] 12 points 1 year ago* (last edited 1 year ago) (13 children)

You make a lot of excellent points, but I think the main issue of contention is just using copyrighted work to train generative AI without the author's permission regardless.

If they did ask permission, there would be no problem. But an author or artist should be given the choice if their work is going to be used to train an AI.

[–] [email protected] 4 points 1 year ago (1 children)

I think the main issue of contention is just using copyrighted work to train generative AI without the author’s permission regardless.

You must define that in legal terms. This is a lawsuit, after all. It's not illegal to "just use" copyrighted work. The words "generative AI" are not in a federal or state bill anywhere in the US.

They can have an "issue of contention" all they want, but if they can't prove anything legally, they have nothing.

[–] [email protected] 1 points 1 year ago

Exactly! You can't just be like "AI bad" in front of the judge ._.

load more comments (11 replies)
load more comments (11 replies)
load more comments (11 replies)