this post was submitted on 29 Jun 2024
39 points (83.1% liked)
Asklemmy
43889 readers
947 users here now
A loosely moderated place to ask open-ended questions
Search asklemmy π
If your post meets the following criteria, it's welcome here!
- Open-ended question
- Not offensive: at this point, we do not have the bandwidth to moderate overtly political discussions. Assume best intent and be excellent to each other.
- Not regarding using or support for Lemmy: context, see the list of support communities and tools for finding communities below
- Not ad nauseam inducing: please make sure it is a question that would be new to most members
- An actual topic of discussion
Looking for support?
Looking for a community?
- Lemmyverse: community search
- sub.rehab: maps old subreddits to fediverse options, marks official as such
- [email protected]: a community for finding communities
~Icon~ ~by~ ~@Double_[email protected]~
founded 5 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
GPT-4 is apparently the model to beat. I haven't seen all that much difference in practice between GPT-4 and 4o. I've heard various claims about various other models outperforming it (notably including Claude) but I haven't seen the claims materialize over the long haul as yet.
I have however heard that Mistral can get quite close to GPT-4, run for free locally with the right hardware, if you build up a hand curated set of around 100 query/response pairs from GPT-4 that are what you want it to do, and then fine-tune Mistral against that training set. I haven't tried it but that's what I've heard.
And also, any recommendations on a specific GPT4 addon or is the base model pretty much perfect as is?
GPT-4 generally doesnβt need fine tuning or anything no