this post was submitted on 05 Jun 2024
134 points (82.8% liked)

Technology

59583 readers
3626 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 2 points 5 months ago (1 children)

LLMs performance are getting closer to plateau due to lack of data easily available. OpenAi is going around trying to license some data, but it won't be enough.

The company with more touch points with users is better positioned to transform these into Data Probes. Msft has windows, Apple has iOS and Google... Well Google is fucked because the other two have OS level access and can restrict what Google collects.

Now that LLM Foundation models are out, the game will be "who can get the most data" to retrain, optimise and ultimately monetise these models. And there's another whole "can of worms" with the legality of training models with unlicensed data collected trough "system snapshots". I.e.: Collecting NY Times data through windows snapshots of users that visit the site.

[–] [email protected] 2 points 5 months ago (1 children)

I mean I’m cool with that. I get a lot of use out of it as is.

[–] [email protected] 3 points 5 months ago (1 children)
[–] [email protected] 3 points 5 months ago

Wow no warning.