this post was submitted on 07 Aug 2024
32 points (92.1% liked)

Open Source

31220 readers
295 users here now

All about open source! Feel free to ask questions, and share news, and interesting stuff!

Useful Links

Rules

Related Communities

Community icon from opensource.org, but we are not affiliated with them.

founded 5 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[โ€“] [email protected] 4 points 3 months ago (1 children)

BERT and early versions of GPT were trained on copyright free datasets like Wikipedia and out of copyright books. Unsure if those would be big enough for the modern ChatGPT types

[โ€“] [email protected] 2 points 3 months ago (1 children)

@flamingmongoose @cmnybo

> copyright free datasets like Wikipedia

๐Ÿคฆโ€โ™‚๏ธ

[โ€“] [email protected] 1 points 3 months ago

What's up with that? Appreciate they're permissive rather than copyright free as such