So here's the way I see it; with Data Center profits being the way they are, I don't think Nvidia's going to do us any favors with GPU pricing next generation. And apparently, the new rule is Nvidia cards exist to bring AMD prices up.
So here's my plan. Starting with my current system;
OS: Linux Mint 21.2 x86_64
CPU: AMD Ryzen 7 5700G with Radeon Graphics (16) @ 4.673GHz
GPU: NVIDIA GeForce RTX 3060 Lite Hash Rate
GPU: AMD ATI 0b:00.0 Cezanne
GPU: NVIDIA GeForce GTX 1080 Ti
Memory: 4646MiB / 31374MiB
I think I'm better off just buying another 3060 or maybe 4060ti/16. To be nitpicky, I can get 3 3060s for the price of 2 4060tis and get more VRAM plus wider memory bus. The 4060ti is probably better in the long run, it's just so damn expensive for what you're actually getting. The 3060 really is the working man's compute card. It needs to be on an all-time-greats list.
My limitations are that I don't have room for full-length cards (a 1080ti, at 267mm, just barely fits), also I don't want the cursed power connector. Also, I don't really want to buy used because I've lost all faith in humanity and trust in my fellow man, but I realize that's more of a "me" problem.
Plus, I'm sure that used P40s and P100s are a great value as far as VRAM goes, but how long are they going to last? I've been using GPGPU since the early days of LuxRender OpenCL and Daz Studio Iray, so I know that sinking feeling when older CUDA versions get dropped from support and my GPU becomes a paperweight. Maxwell is already deprecated, so Pascal's days are definitely numbered.
On the CPU side, I'm upgrading to whatever they announce for Ryzen 9000 and a ton of RAM. Hopefully they have some models without NPUs, I don't think I'll need them. As far as what I'm running, it's Ollama and Oobabooga, mostly models 32Gb and lower. My goal is to run Mixtral 8x22b but I'll probably have to run it at a lower quant, maybe one of the 40 or 50Gb versions.
My budget: Less than Threadripper level.
Thanks for listening to my insane ramblings. Any thoughts?
The 1080ti does a fairly decent job with ai. But you may need to compile your tools to target the older architecture.
Whatever cards have the most VRAM is my answer.
As long as you're working with open source tools it should be possible to continue supporting those older cards by setting the compile time information to match your gpus.
I'll admit that's probably over my head, but still good advice. The 1080ti works fine for now, but whenever it gets dropped from support, I have a good home for it.
I have a Win7 machine so I can run my older games that don't play well in linux. Nvidia Rocket Sled Demo on a 1080ti screams.