How to run LLaMA (and other LLMs) on Android.

llama@lemmy.dbzer0.com · edit-2 15 hours ago

How to run LLaMA (and other LLMs) on Android.

land · 13 hours ago

Is there an alternative Android app that enables downloading LLaMA locally (without using a terminal)?

llama@lemmy.dbzer0.com · 3 hours ago

There are a few. There’s Private AI. It is free (as in beer) but it’s not libre (or open source). The app is a bit sketchy too, so I would still recommend doing as the tutorial says.

Out of curiosity, why do you not want to use a terminal for that?

land · 2 hours ago

Thanks for the suggestion.

I’m like GUI AI such as ChatGPT. I’m currently in the process of running a local model that also allows me to connect with the internet and cross-platform.

llama@lemmy.dbzer0.com · 1 hour ago

I see. I don’t think there there are many solutions on that front for Android. For PC there are a few, such as LM Studio.

beastlykings@sh.itjust.works · 16 hours ago

Very cool! I got it running. Though apparently I didn’t need step 6 as it started running after I downloaded it. I was a bit confused, and do was the LLM as it started telling me how the run command works 🤦‍♂️

Good fun. Got me interested in running local LLM for the first time. What type of performance increase should I expect when I spin this up on my 3070 ti?

llama@lemmy.dbzer0.com · 15 hours ago

Though apparently I didn’t need step 6 as it started running after I downloaded it

Hahahha. It really is a little redundant, now that you mention it. I’ll remove it from the post. Thank you!

Good fun. Got me interested in running local LLM for the first time.

I’m very happy to hear my post motivated you to run an LLM locally for the first time! Did you manage to run any other models? How was your experience? Let us know!

What type of performance increase should I expect when I spin this up on my 3070 ti?

That really depends on the model, to be completely honest. Make sure to check the model requirements. For llama3.2:2b you can expect a significant performance increase, at least.

Rhaedas@fedia.io · 2 days ago

I’ve run a local LLM on my PC for a while, so I’m familiar enough with Ollama to understand what’s going on. I’ve tried this with my Samsung Tracfone, not really expecting a lot. Surprisingly I’ve gotten all the way to getting a prompt, but then things crash and I’m kicked back to the starting terminal. Pretty sure it’s memory, so I’m now trying to use virtual memory to bump it up to the 4GB you’ve had success with (the phone looks to have 3GB actual memory, plenty of storage though).

If it doesn’t work, I’ll try some of the others, perhaps they’re a bit smaller.

I did get the 0.5 Qwen to run well. I’m surprised how fast it is even using CPU mode, but maybe being smaller also helps with the processing.

Just a tip (maybe obvious to experienced users): while you do have to run the terminal, login to debian, start the server and then run the model, remember that you can use the arrow keys in the terminal to repeat past commands, so it’s pretty quick to do. I actually missed the arrow keys the first time around because they aren’t very distinct or highlighted, but then when I had to look for how to do CTRL, I realized they were right in front of me.

llama@lemmy.dbzer0.com · 1 day ago

I have tried on more or less 5 spare phones. None of them have less than 4 GB of RAM, however.

How to run LLaMA (and other LLMs) on Android.

How to run LLaMA (and other LLMs) on Android.

Step 1: Install Termux

Step 2: Set Up proot-distro and Install Debian

Step 3: Install Dependencies

Step 4: Install Ollama

Step 5: Download and run the Llama3.2:1B Model