this post was submitted on 26 May 2024
382 points (97.0% liked)
Not The Onion
12224 readers
686 users here now
Welcome
We're not The Onion! Not affiliated with them in any way! Not operated by them in any way! All the news here is real!
The Rules
Posts must be:
- Links to news stories from...
- ...credible sources, with...
- ...their original headlines, that...
- ...would make people who see the headline think, “That has got to be a story from The Onion, America’s Finest News Source.”
Comments must abide by the server rules for Lemmy.world and generally abstain from trollish, bigoted, or otherwise disruptive behavior that makes this community less fun for everyone.
And that’s basically it!
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Oh, it’s worse than that.
Google’s “AI” results feed you things for 10 year old Reddit posts that are subtle (but sometimes, also not so subtle) bullshit.
Whatever they’re using to curate training data is evidently pretty awful at detecting shitposts.
Bold of you to assume they're curating their training data.
Those underpaid Indians probably aren't very good at picking up irony, even if they give a shit.
Most of the curation or fine tuning is done in low income African countries so this is little surprising. They‘re cheap labour but you can‘t expect them to reliably detect sarcasm or notice mistakes in specialized fields. They basically give a thumbs up whenever the AI sounds convincing. Of course that includes instances where it‘s confidently wrong and that appears to be most of the time with this model.
It's not a training data issue, look up Retrieval Augmented Generation. It's basically serving up stuff on the web and taking it as gospel.
That's bullwhip why can't it just think for itself