this post was submitted on 04 Oct 2023

27 points (96.6% liked)

Free Open-Source Artificial Intelligence

2889 readers

1 users here now

Welcome to Free Open-Source Artificial Intelligence!

We are a community dedicated to forwarding the availability and access to:

Free Open Source Artificial Intelligence (F.O.S.A.I.)

More AI Communities

LLM Leaderboards

Developer Resources

GitHub Projects

GitHub Stars

FOSAI Time Capsule

founded 1 year ago

MODERATORS

[email protected]

Mistral 7B Megathread (lemmy.world)

submitted 1 year ago* (last edited 1 year ago) by [email protected] to c/[email protected]

5 comments fedilink hide all child comments

Starting a Mistral Megathread to aggregate resources.

This is my new favorite 7B model. It is really good for what it is. I am excited to see what we can tune together. I will be using this thread as a living document, expect a lot of changes and notes, revisions and updates.

Let me know if there's something in particular you want to see here. I will be adding to this thread throughout my fine-tuning journey with Mistral.

Mistral Model Megathread

Key

Link #1 - Base Model
Link #2 - Instruct Model

Quantized Base Models from TheBloke

GPTQ

GGUF

AWQ

Quantized Samantha Models from TheBloke

GPTQ

GGUF

AWQ

Quantized Kimiko Models from TheBloke

GPTQ

https://huggingface.co/TheBloke/Kimiko-Mistral-7B-GPTQ

GGUF

https://huggingface.co/TheBloke/Kimiko-Mistral-7B-GGUF

AWQ

https://huggingface.co/TheBloke/Kimiko-Mistral-7B-AWQ

Quantized Dolphin Models from TheBloke

GPTQ

https://huggingface.co/TheBloke/dolphin-2.0-mistral-7B-GPTQ

GGUF

https://huggingface.co/TheBloke/dolphin-2.0-mistral-7B-GGUF

AWQ

https://huggingface.co/TheBloke/dolphin-2.0-mistral-7B-AWQ

Quantized Orca Models from TheBloke

GPTQ

https://huggingface.co/TheBloke/Mistral-7B-OpenOrca-GPTQ

GGUF

https://huggingface.co/TheBloke/Mistral-7B-OpenOrca-GGUF

AWQ

https://huggingface.co/TheBloke/Mistral-7B-OpenOrca-AWQ

Quantized Airoboros Models from TheBloke

GPTQ

https://huggingface.co/TheBloke/airoboros-mistral2.2-7B-GPTQ

GGUF

https://huggingface.co/TheBloke/airoboros-mistral2.2-7B-GGUF

AWQ

https://huggingface.co/TheBloke/airoboros-mistral2.2-7B-AWQ

If you like to run any of the quantized/optimized models from TheBloke, do visit the full model pages from each of the quantized model cards to see and support the developers of each fine-tuned model.

Mistral - Mistral.ai
Mistral Samantha - Eric Hartford
Mistral Kimiko - nRuaif
Mistral Dolphin - Eric Hartford
Mistral OpenOrca - OpenOrca/Alignment Lab
Mistral Airoboros - teknium

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 3 points 1 year ago (1 children)

Looks like an interesting model. I couldn't find it on their website, do you know what the training data was for this model?

[–] [email protected] 2 points 1 year ago* (last edited 1 year ago)

https://mistral.ai/news/announcing-mistral-7b/

They don't publish the training dataset. It's a secret. There are open bugreports on their Github, HuggingFace #8, #10, #38 and i think someone said so explicitly on their Discord.