bot@lemmy.smeargle.fansMB to Hacker News@lemmy.smeargle.fans · 11 months agoHigh-Speed Large Language Model Serving on PCs with Consumer-Grade GPUsgithub.comexternal-linkmessage-square0fedilinkarrow-up13arrow-down10file-textcross-posted to: localllama@sh.itjust.workshackernews@derp.foo
arrow-up13arrow-down1external-linkHigh-Speed Large Language Model Serving on PCs with Consumer-Grade GPUsgithub.combot@lemmy.smeargle.fansMB to Hacker News@lemmy.smeargle.fans · 11 months agomessage-square0fedilinkfile-textcross-posted to: localllama@sh.itjust.workshackernews@derp.foo