655
The Rule (lemmy.ml)
submitted 1 month ago by [email protected] to c/[email protected]
you are viewing a single comment's thread
view the rest of the comments
[-] [email protected] 87 points 1 month ago

I doubt this person actually had a computer than could run the 405b model. You need over 200gb of ram, let alone having enough vram to run it with gpu acceleration.

[-] [email protected] 27 points 1 month ago* (last edited 1 month ago)

In terms of RAM it's not impossible, my current little server has 192GB of RAM installed.

Pic from TrueNAS

The VRAM would be quite the hurdle though, I'm curious on it's requirements for VRAM

Edit: Moving data in anticipation of a hardware migration ATM so basically none of the services are running.

[-] [email protected] 6 points 1 month ago

VRAM would be 810Gb/403Gb/203Gb for FP16/FP8/INT4 for interferrence, according to their website.

[-] [email protected] 4 points 1 month ago

Hot damn that's a lot! They ain't messing around with that requirement.

My current server has 32 MB of VRAM. Yes, MB not GB. Once I finish the hardware migration it's going to 8GB but that's not even a drop in the bucket compared to that requirement.

load more comments (7 replies)
load more comments (22 replies)
this post was submitted on 25 Jul 2024
655 points (100.0% liked)

196

16190 readers
2204 users here now

Be sure to follow the rule before you head out.

Rule: You must post before you leave.

^other^ ^rules^

founded 1 year ago
MODERATORS