this post was submitted on 24 Oct 2024
29 points (100.0% liked)

Opensource

1366 readers
47 users here now

A community for discussion about open source software! Ask questions, share knowledge, share news, or post interesting stuff related to it!

CreditsIcon base by Lorc under CC BY 3.0 with modifications to add a gradient



founded 1 year ago
MODERATORS
 

It’s available via Google’s Responsible Generative AI Toolkit.

Repo: https://github.com/google-deepmind/synthid-text

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 5 points 3 weeks ago

It gives an example:

For example, with the phrase “My favorite tropical fruits are __.” The LLM might start completing the sentence with the tokens “mango,” “lychee,” “papaya,” or “durian,” and each token is given a probability score. When there’s a range of different tokens to choose from, SynthID can adjust the probability score of each predicted token, in cases where it won’t compromise the quality, accuracy and creativity of the output.

So I suppose with a larger text, if all lists of things are "LLM Sorted", it's an indicator.

That's probably not the only thing, if it can detect a bunch of these indicators, there's a higher likelihood it's LLM text