115
this post was submitted on 15 Oct 2024
115 points (95.3% liked)
Technology
59366 readers
3899 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Scanning texts is OCR and has never needed modern LLMs integrated to achieve amazing results.
Automated tagging gets closer, but there is a metric shit ton that can be done in that regard using incredibly simple tools that don't use an egregious amount of energy or hallucinate.
There is no way in hell that they aren't already doing these things. The best use cases for LLMs for NARA are edge cases of things mostly covered by existing tech.
And you and I both know this is going to give Google exclusive access to National Archive data. New training data that isn't tainted by potentially being LLM output is an insanely valuable commodity now that the hype is dying down and algorithmic advances are slowing.