Have you tried LocalGPT PrivateGPT or other similar alternatives to ChatGPT?

Moshpirit@lemmy.world · 1 year ago

Have you tried LocalGPT PrivateGPT or other similar alternatives to ChatGPT?

woodgen@lemm.ee · edit-2 1 year ago

I tired a bunch, but current state of the art is text-generation-webui, which can load multiple models and has a workflow similar to stable-diffusion-webui.

https://github.com/oobabooga/text-generation-webui

CumBroth@discuss.tchncs.de · 1 year ago

I’ve tried both this and https://github.com/jmorganca/ollama. I liked the latter a lot more; just can’t remember why.

GUI for ollama is a separate project: https://github.com/ollama-webui/ollama-webui

The Assman@sh.itjust.works · 1 year ago

Programming is my main use case for LLMs and I think it’s too slow right now. In the 30 seconds it takes a local llama to give me an answer I’ve already figured it out.

I’m definitely keeping an eye on it though. Our org is really interesting in training models on our code base and customer data.

SoleInvictus@lemmy.world · edit-2 1 year ago

It’s good for me because I’m piss poor at programming. In my defense, I’m not a programmer or even programmer adjacent. I do see how it wouldn’t be useful to a pro. It also has occasionally given me garbage advice that an expert would spot right away while I had to figure out in my own that it was ‘hallucinating’ again. There’s nothing better for learning than troubleshooting, though!

bogo@sh.itjust.works · 1 year ago

I can absolutely see it getting useful for a pro. It’s already a better version of IDE templates. If you have to write boilerplate code this can already do that. It’s a huge time saver for the things you’d have to go look up to remember how to do and piece together yourself.

Example: today I wanted a quick way to serve my current working directory over HTTP so I could do some quick web work. I asked ChatGPT to write me a bash function I could stick in my profile to do this, and I told it to pick a random unused port. That would have taken me much longer had I went to lookup how to do that all. The only hint I gave it was to use the Python builtin module for serving http.

The Assman@sh.itjust.works · 1 year ago

Don’t get me wrong, I use copilot and it’s super handy, especially for menial things. I just don’t have all day to wait for a local llm to spit out an answer.

I was using ollama + code llama + continue extension in vscode. The answers are pretty decent, but yeah, too slow and no inline suggestions. I think (not an expert) a machine with a graphics card would be more performant, but we can’t just build a suitable pc for 100 engineers.

I also don’t own the code so privacy isn’t my concern lol so copilot suits my needs just fine.

exu@feditown.com · 1 year ago

I’ve found it’s pretty good for translating between steps so to speak.

Converted some bash to python relatively quickly by giving it snippets and fixing errors as it made them.

I also had success generating an ansible playbook based on my own previously written install instructions for SillyTavern and llama.cpp.

I could do both of those tasks myself, but thar would be more difficult than having a mostly correct translation and fixing some errors.

amzd@kbin.social · 1 year ago

You should make sure you are running a model that fits in your vram, for me it runs faster than any online LLM I’ve tried.

Buffalobuffalo@lemmy.dbzer0.com · 1 year ago

Dbzero Lemmy has a relationship with the Horde AI shared LLM group. My primary use is for chat roleplay but they have streamlined guides to hosting your own models for personal or horde use. One of the primary interfaces is SillyTavern but they integrate numerous models

RachelRodent@lemmy.dbzer0.com · edit-2 1 year ago

I use koboldAI. It is local and open source

CubitOom@infosec.pub · 1 year ago

Checkout ollama.

There’s a lot of models you can pull from the official library.

Using ollama, you can also run external gguf models found on places like huggingface if you use a modelfile with something as simple as

echo "FROM ~/Documents/ollama/models/$model_filepath" >| ~/Documents/ollama/modelfiles/$model_name.modelfile

The Cooking Senpai@lemme.discus.sh · 1 year ago

Absolutely yes. You can try GPT4ALL which works on any decent CPU computer (the minimum I managed to run it with is a 2018 6 core 2.0ghz ARM64 processor) and has a lot of built in models. You can also import uncensored models (like the TheBloke ones on Huggingface ).

I also tried AutoGPT some times ago which is quite complex and cool.

hactar42@lemmy.world · 1 year ago

I’ve played around with a few of them. I’ve found LM Studio the most robust and user friendly.

SuperiorOne · 1 year ago

I’m actively using ollama with docker to run llama2:13b model. It’s generally works fine but heavy on resources as expected.

Haggunenons@lemmy.world · 1 year ago

Mixtral is an amazing one that isn’t super slow or require incredible hardware foe a decent speed.

In general this guy has really good videos/tutorials for the latest tools.

PipedLinkBot@feddit.rocks · 1 year ago

Here is an alternative Piped link(s):

Mixtral

Piped is a privacy-respecting open-source alternative frontend to YouTube.

I’m open-source; check me out at GitHub.

AlphaAutist@lemmy.world · 1 year ago

I haven’t tried any of them but I did just listen to a podcast the other week where they talk about LlamaGPT vs Ollama and other related tools. If you’re interested it’s episode 540: Uncensored AI on Linux by Linux Unplugged

TCB13@lemmy.world · 1 year ago

“Uncensored” models are bullshit everything but uncensored. Just ask them for a Windows XP Pro key and you’ll see how uncensored they really are.

Imacat@lemmy.dbzer0.com · edit-2 1 year ago

There’s a local llama subreddit with a lot of good information and 4chan’s /g/ board will usually have a good thread with a ton of helpful links in the first post. Don’t think there’s anything on lemmy yet. You can run some good models on a decent home pc but training and fine tuning will likely require renting out some cloud gpus.

Gooey0210@sh.itjust.works · 1 year ago

Recntly started using HuggingChat 🤗

amzd@kbin.social · 1 year ago

ollama + codellama works perfect, I use it from neovim with a plug-in called gen-nvim I think

TCB13@lemmy.world · 1 year ago

Yes, mostly https://gpt4all.io/ only to find out that even the “uncensored” models are bullshit and won’t even provide you with a Windows XP Pro key. That’s kind of my benchmark for models nowadays. :P

beta_tester · 1 year ago

Not with success but I’m using huggingface since a couple of days. You may want to have a look into it