this post was submitted on 04 Oct 2023
27 points (96.6% liked)

Free Open-Source Artificial Intelligence

2889 readers
1 users here now

Welcome to Free Open-Source Artificial Intelligence!

We are a community dedicated to forwarding the availability and access to:

Free Open Source Artificial Intelligence (F.O.S.A.I.)

More AI Communities

LLM Leaderboards

Developer Resources

GitHub Projects

FOSAI Time Capsule

founded 1 year ago
MODERATORS
27
submitted 1 year ago* (last edited 1 year ago) by [email protected] to c/[email protected]
 

Starting a Mistral Megathread to aggregate resources.

This is my new favorite 7B model. It is really good for what it is. I am excited to see what we can tune together. I will be using this thread as a living document, expect a lot of changes and notes, revisions and updates.

Let me know if there's something in particular you want to see here. I will be adding to this thread throughout my fine-tuning journey with Mistral.

Mistral Model Megathread


Key

  • Link #1 - Base Model
  • Link #2 - Instruct Model

Quantized Base Models from TheBloke

GPTQ

GGUF

AWQ


Quantized Samantha Models from TheBloke

GPTQ

GGUF

AWQ


Quantized Kimiko Models from TheBloke

GPTQ

GGUF

AWQ


Quantized Dolphin Models from TheBloke

GPTQ

GGUF

AWQ


Quantized Orca Models from TheBloke

GPTQ

GGUF

AWQ


Quantized Airoboros Models from TheBloke

GPTQ

GGUF

AWQ


If you like to run any of the quantized/optimized models from TheBloke, do visit the full model pages from each of the quantized model cards to see and support the developers of each fine-tuned model.

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 2 points 1 year ago* (last edited 1 year ago)

https://mistral.ai/news/announcing-mistral-7b/

They don't publish the training dataset. It's a secret. There are open bugreports on their Github, HuggingFace #8, #10, #38 and i think someone said so explicitly on their Discord.