this post was submitted on 10 Jul 2023
124 points (91.9% liked)

Technology

59298 readers
4979 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

Sarah Silverman, Christopher Golden, and Richard Kadrey are suing OpenAI and Meta over violation of their copyrighted books. The trio says their works were pulled from illegal “shadow libraries” without their consent.

top 48 comments
sorted by: hot top controversial new old
[–] [email protected] 13 points 1 year ago (2 children)

Good. Artists should get paid extra for AIs being trained on their stuff. Doing it for free is our job.

[–] [email protected] 19 points 1 year ago (2 children)

Should those artists pay the other artists they studied?

[–] [email protected] 0 points 1 year ago* (last edited 1 year ago) (2 children)

in design school, I had to pay for the books I bought which contained the images of the art. whomever owns those images got paid for the license to appear in the book. when I go to museums, I had to pay (by admission price or by the tax dollars that go into paying for the museum’s endowment), and that pays for the paintings/sculptures/etc.

whenever I saw or see art, in one context or another, there’s some compensatory arrangement (or it’s being/has been donated— in which case, it’s tax-deductible).

edit: then again, my work is not a remixed amalgam of all of the prior art I consumed— unlike AI, I am capable of creating new unique works which do not contain any of the elements of original works I may be seen or learned from previously. I am able to deconstruct, analyze, and implement nuanced constructs such as style, method, technique, and tone and also develop my own in the creation of an original work without relying on the assimilation and reuse of other original works in part or whole. AI cannot.

for this reason I find this a flawed premise— comparing what an artist does to what LLMs or AI do is logically flawed because they aren’t the same thing. LLMs can only ever create derivative works, whereas human artists are capable of creating truly original works.

[–] [email protected] 7 points 1 year ago* (last edited 1 year ago) (1 children)

In all those instances you paid for the physical resources.

These AI are just automating remix culture.

Human creations should be free for all of us to use where possible (e.g. material costs for say a book).

[–] [email protected] 4 points 1 year ago (1 children)

In all those instances you paid for the physical resources.

not only the physical resources but also the licensing fees for the images and the labor required to research and assemble the books. none of that was free, either. some of those design texts were hundreds of dollars-- and, no, I'm not referring to your bullshit college textbooks that have meaningless markups.

while I agree with the philosophy that all human knowledge should be free and that we should all have free access to art and media and whatever, I'm simply explaining that I did, in fact, as an art/design student studying art/design pay for the material I learned from, including the art to which I was exposed (or there was some other form of compensation involved). i am not arguing whether or not that should or should not be the case.

i also made a clear distinction between what a human artist is capable of achieving and what an ai is capable of achieving. perhaps I should have continued to state that it for this season that I believe artists should be compensated when AIs train using their works/data due to the difference between how they use it and how a human artist uses it when creating original works vs ai-generated pieces.

[–] [email protected] -1 points 1 year ago (1 children)

Except you didn't pay for generations of art culture endemic to human nature that took us from cave paintings to lying buttresses to modern design

[–] [email protected] 0 points 1 year ago (1 children)

that's an abstract concept, not a quantifiable object, product, or service that can be measured in terms of monetary value. if you move the goal posts any further, you might as well suggest I pay for having a soul, lol.

[–] [email protected] -1 points 1 year ago

I'm not the one suggesting this absurd idea

[–] [email protected] 5 points 1 year ago* (last edited 1 year ago) (1 children)

Everything is a remix. Including all the work you’ve ever done. And everything I’ve ever done. Nothing is wholly original.

[–] [email protected] -2 points 1 year ago (1 children)

But it is partially original. With AI nothing is original.

[–] [email protected] 4 points 1 year ago (1 children)

No. This fundamentally misrepresents the ai models.

[–] [email protected] -4 points 1 year ago (2 children)

No it doesn’t.

AI doesn’t generate anything new. It uses mathematical models to rearrange and regurgitate what it’s already been given. There’s no creation, there’s nothing original. It’s simply stats.

[–] [email protected] 1 points 1 year ago (1 children)
[–] [email protected] 0 points 1 year ago (1 children)

Original interpretation and human input. There’s neither with AI. AI does not create anything. Period. Full stop. No question about it. It’s an objective fact that it doesn’t create anything new.

[–] [email protected] 2 points 1 year ago (1 children)

Ants create things. Creation isn’t some complex higher functioning organism trait.

If it didn’t exist before and it does after, it was created. It doesn’t matter if it’s a mash of other content or made by a human.

[–] [email protected] -2 points 1 year ago

I fundamentally disagree. Creation needs some degree of originality. Otherwise it’s just a rehash of what exists already, and that’s not creation of anything new.

I never said non-humans can’t create. I said AI can’t. And I stand by that. There is nothing original about anything a LLM does. It’s statistical analysis on a crazy amount of data. Not creating.

[–] [email protected] 0 points 1 year ago (1 children)
[–] [email protected] 1 points 1 year ago

It is

Neural Networks and by extension LLMs are simply statistical models reorganizing training data based on statistical probability. It’s not creating anything new, it’s taking the inputs and more or less “translating” it.

[–] [email protected] -3 points 1 year ago (3 children)
[–] [email protected] 14 points 1 year ago (1 children)

Unworkable copyright maximalist take that wouldn’t help artists but would further entrench corporate IP holders.

[–] [email protected] -3 points 1 year ago (1 children)

You want to try explaining how, or is throwing basic claims it?

[–] [email protected] 7 points 1 year ago (1 children)

What, explain why "artists should pay artists that they study" is an unworkable copyright maximalist take? No, that's self evident. How it won't actually help artists, but would further entrench the corporate IP hoarders? No, I won't do that either. It's self evident. If your position is literally that artists should pay the artists that inspire them and that they study, you're a deeply unserious person whose position doesn't deserve to be seriously debated.

[–] [email protected] -3 points 1 year ago (1 children)

Uh huh. So you don't actually want to discuss, you just want to be insulting and shut down conversation?

[–] [email protected] 1 points 1 year ago (1 children)

No it's just a nonsense suggestion.

[–] [email protected] 5 points 1 year ago (1 children)

At least you’re consistent!

[–] [email protected] -4 points 1 year ago* (last edited 1 year ago)

I find that a little bit of a specious argument actually. An LLM is not a person, it is itself a commercial derivative. Because it is created for profit and capable of outproducing any human by many orders of magnitude, I think comparing it to human training is a little simplistic and deceptive.

[–] [email protected] 4 points 1 year ago (1 children)
[–] [email protected] -3 points 1 year ago

Yes, quite. Why wouldn't I be?

[–] [email protected] 10 points 1 year ago (1 children)

But there's no evidence, in this case anyway, that it was trained using the entire book(s). Multiple summaries of the author's works are available on various sites in the public domain, and GPT is capable of amalgamating all of them and summarizing it.

Now if you asked it to reproduce an entire book, or say some random non-free chapter or excerpt exactly word-by-word, that would be a issue, but so far I haven't seen any evidence that it was able to do so.

[–] [email protected] 0 points 1 year ago (1 children)

That'll come out during the case. I assume they have evidence, otherwise suing would be a waste of time. Unless some lawyer is taking them for a ride.

You only do need 51% certainty to win in civil court, though, so maybe they think they can just argue it? Still though, I'd want some sound evidence before going to court. Unless it's just a slapp-style suit, but that doesn't really fit.

[–] [email protected] 4 points 1 year ago (1 children)

That’s an incredibly bold assertion.

[–] [email protected] 0 points 1 year ago

Do you never make those?

[–] [email protected] 12 points 1 year ago (2 children)

It's simple: if you have to pay "copyright holders" for anything you use your AI training on, there can be no AI training. They need to ingest all the data they can to become better and it would cost dozen of billions if you had to pay every single piece of content. So we have to pick between a future in which "copyright holders" fight to get their $ or a future where we can push AI to enter a new era

[–] [email protected] 12 points 1 year ago

I really don't think it should be considered copyright infringement to simply ingest data. It doesn't infringe on copyright for a person to read a book, why should it matter if it's a simple program or an AI doing it?

That said, if the AI produces something in the exact words or style of a creator without attribution, just like with a person, then it should count as copyright infringement.

It's all about perceived harm. A creator is not harmed by an AI reading their works. But they are harmed when the AI can produce their style to potentially take business away from them.

[–] [email protected] 2 points 1 year ago (1 children)

If it isn't copyright infringement to read a book and apply those ideas to make a product (which it isn't), then it isn't copyright infringement to train an LLM with the info in a piece of media.

Pretty cut and dry.

[–] [email protected] -1 points 1 year ago

But if you pirated a book to read it, then applied those ideas to make a product, you still committed a crime.

[–] [email protected] 7 points 1 year ago

https://www.techdirt.com/2019/04/01/getty-images-sued-yet-again-trying-to-license-public-domain-images/

The only good outcome is if copyright is asymmetrical and unfair to big companies. It destroys human culture if Disney sues everybody every time they hum 2 seconds of a cartoon song. It also destroys human culture if every time somebody posts something for free on the internet a deranged billionaire pops up and gloats about how he's going to bury your post at the bottom of google and copy your answer into his database and use it to scam $100/month out of everybody you were trying to help for free.

[–] [email protected] 5 points 1 year ago* (last edited 1 year ago) (1 children)

Sarah Silverman is going to lose a suit. News at 11. Scraping is protected. This is settled law.

[–] [email protected] 4 points 1 year ago* (last edited 1 year ago)

If you read the article, it called out that this is not protected by law. They are claiming open ai got access to her books and works through sites that had illegally obtained it.

This is not covered by previous rulings around scraping.

load more comments
view more: next ›