Selfhost an LLM

Shimitar@downonthestreet.eu · 3 days ago

Selfhost an LLM

ragingHungryPanda@piefed.keyboardvagabond.com · 2 days ago

i had to do a particular command to get the AMD GPU properly available in docker. i can’t find that if you need

eleitl@lemmy.zip · 1 day ago

Is Radeon V with 8 GB HBM worth using today?

ragingHungryPanda@piefed.keyboardvagabond.com · 1 day ago

not for LLMs. I have a 16GB and even what I can fit in there just isn’t really enough to be useful. It can still do things and quickly enough, but I can’t fit models that large enough to be useful.

I also don’t know if your GPU is compatible with ROCM or not.

eleitl@lemmy.zip · 7 hours ago

The GPU used to but they dropped ROCm support for Radeon V and VII some time ago. Have to look at that Strix Halo/AI Max thing I guess.