this post was submitted on 17 May 2024
5 points (100.0% liked)
Technology
59020 readers
4241 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Neither of these things are true.
It does create world models (see the Othello-GPT papers, Chess-GPT replication, and the Max Tegmark world model papers).
And while it is trained on predicting the next token, it isn't necessarily doing it from there on out purely based on "most probable" as your sentence suggests, such as using surface statistics.
Something like Othello-GPT, trained to predict the next move and only fed a bunch of moves, generated a virtual Othello board in its neural network and kept track of "my pieces" and "opponent pieces."
And that was a toy model.
AKA Othello-GPT chooses moves based on statistics.
Ofc it's going to use a virtual board in this process. Why would a computer ever use a real one board?
There's zero awareness here.