I want to host some LLM’s locally and use more advanced models. Since new hardware is out of the question, I think I should be able to pull something off buying some yesteryear equipment on ebay etc. Did anybody attempt such a project? Does it scale horizontally? (I.e. can I connext two boxes to overcome single box slowness?)


I’ve just checked the Mac Studio on the site and lmao, they first ran out of 512gb uram and then of 256gb uram, now selling only 96gb version.