overview for kwa

Guide to Self Hosting LLMs Faster/Better than Ollama in c/[email protected]

[–] [email protected] 1 points 1 month ago (1 children)

Well this is what I quite don’t understand: I was trying to run the q3_k_m which is 3.81GB and it was failing with OutOfMemory error. The one you provided IQ4_XS is 4.22GB and is working fine.

Guide to Self Hosting LLMs Faster/Better than Ollama in c/[email protected]

[–] [email protected] 2 points 1 month ago (3 children)

Indeed, this model is working on my machine. Can you explain the difference with the one I tried before?

Guide to Self Hosting LLMs Faster/Better than Ollama in c/[email protected]

[–] [email protected] 1 points 1 month ago (6 children)

I have a MacBook Pro M1 Pro with 16GB RAM. I closed a lot of things and managed to have 10GB free, but that seems to still not be enough to run the 7B model. For the answer being truncated, it seems to be a frontend issue. I tried open-webui connected to llama-server and it seems to be working great, thank you!

Guide to Self Hosting LLMs Faster/Better than Ollama in c/[email protected]

[–] [email protected] 1 points 1 month ago (9 children)

I tried llama.cpp with llama-server and Qwen2.5 Coder 1.5B. Higher parameters just output garbage and I can see an OutOfMemory error in the logs. When trying the 1.5B model, I have an issue where the model will just stop outputting the answer, it will stop mid sentence or in the middle of a class. Is it an issue with my hardware not being performant enough or is it something I can tweak with some parameters?

Guide to Self Hosting LLMs Faster/Better than Ollama in c/[email protected]

[–] [email protected] 2 points 1 month ago (11 children)

Thanks!

For people on MacOS, is there a better alternative than croco.cpp?

Please suggest some good self-hostable RAG for my LLM. in c/[email protected]

[–] [email protected] 4 points 1 month ago* (last edited 1 month ago) (1 children)

I’m new to this and I was wondering why you don’t recommend ollama? This is the first one I managed to run and it seemed decent but if there are better alternatives I’m interested

Edit: it seems the two others don’t have an API. What would you recommend if you need an API?

Jellyfin burning subtitles for AndroidTV in c/[email protected]

[–] [email protected] 0 points 2 months ago

I see, thank you 😊

23

Jellyfin burning subtitles for AndroidTV (lemmy.zip)

submitted 2 months ago by [email protected] to c/[email protected]

4 comments fedilink

Hey, I’m using Jellyfin and AndroidTV as client.

Everything is usually working fine using direct play. But for a few files, when using subtitles, I will get the message "burning subtitles" and the server will start transcoding it. I haven’t really figured out what format is causing it, but it only happens when enabling subtitles.

Anyone else having this issue? Can I do anything to fix it?