this post was submitted on 18 Dec 2023
309 points (93.3% liked)

Technology

59039 readers
3369 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

Data poisoning: how artists are sabotaging AI to take revenge on image generators::As AI developers indiscriminately suck up online content to train their models, artists are seeking ways to fight back.

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 3 points 10 months ago (2 children)

using it to train their plagiarism machines

That's simply not how AI works, if you look inside the models after training, you will not see a shred of the original training data. Just a bunch of numbers and weights.

[–] [email protected] 5 points 10 months ago

| Just a bunch of numbers and weights

I agree with your sentiment, but it's not just that the data is encoded as a model, but it's extremely lossy. Compression, encoding, digital photography, etc is just turning pictures into different numbers to be processed by some math machine. It's the fact that a huge amount of information is actually lost during training, intentionally, that makes a huge difference. If it was just compression, it would be a gaming changing piece of tech for other reasons. YouTube would be using it today, but it is not good at keeping the original data from the training.

Rant not really for you, but in case someone else nitpicks in the future :)

[–] [email protected] 1 points 10 months ago (1 children)

If the individual images are so unimportant then it won't be a problem to only train it on images you have the rights to.

[–] [email protected] 3 points 10 months ago (1 children)

They do have the rights because this falls under fair use, It doesn't matter if a picture is copyrighted as long as the outcome is transformative.

[–] [email protected] 3 points 10 months ago

I'm sure you know something the Valve lawyers don't.