PocketPal (Mobile App that Runs AI Locally)

anon80779245 · November 30, 2024, 5:04pm

I just found GitHub - a-ghorbani/pocketpal-ai: An app that brings language models directly to your phone.
It is available on Android and iOS and allows you to run small LLMs from Meta, Google, Alibaba, etc.

On my Pixel 9, it worked really well. I am looking to see how it runs on other devices !

anon80779245 · December 1, 2024, 1:47pm

Check it out
Pocketpal is an mobile app to run LLMs locally!

Tech-Trooper · December 1, 2024, 7:59pm

I just tried it. An awesome app.

anon80779245 · December 2, 2024, 12:08pm

What device are you using?

Tech-Trooper · December 6, 2024, 10:31pm

iOS

anon80779245 · December 6, 2024, 11:14pm

I am more interested in the specs like RAM and processor

Clar · December 7, 2024, 3:51pm

I tried running it on both the Pixel 8 Pro and the iPhone 16 Pro. Although the iPhone has 8GB of memory, the largest model I was able to load was around 4.5GB. All models larger than that would not load at all.

In contrast, the Pixel has more memory than my iPhone, which allowed me to run larger models (around 9GB memory, 8B parameters). However, the inference time was painfully slow.

Clar · December 7, 2024, 3:55pm

I prefer to use the Duck chat now, but the app still seems very cool.

Perhaps when mobile hardware improves or larger models become more efficient, this will become a thing.

Tech-Trooper · December 8, 2024, 4:27pm

İPhone 15 a16.

The biggest Qwen model (Qwen2.5 3B(Quantisation5)) is sometimes a bit slower to load. But apart from that the app is working fine.

anon80779245 · December 8, 2024, 5:09pm

Interesting. How was the speed with 3B models?

I guess it’s better to stick with SLMs. When I have time, I will PR to add pocketpal and also explain what you can and can’t do on a smartphone AI.

On Pixel 9, I found Llama 3 8B speed to be OK actually. Like it’s not super fast but it basically is a fast as you can read.

Probably better to stick with Q4. The generation speed is OK then?

Tech-Trooper · December 8, 2024, 5:17pm

Yes. It approximately takes 10 seconds to load and generation speed is 8-10 tokens per second.

mangomango · April 11, 2025, 10:12pm

I am really enjoying it thank you !
Though it’s still up to see if I have a real use case for it.

I am using a Pixel 7 (8GB RAM) and it’s usable.

SkewedZeppelin · April 11, 2025, 11:24pm

its pretty handy or just fun to pass time when paired with eg. gemma3 1b or 4b if you have 8GB+ RAM

GorujoCY · September 6, 2025, 12:25pm

I do not object to this
I’m aware of pocketpal, great app, allows any local model to be downloaded within the limits of system memory from huggingface (does not check. you’ll have to do your won due dilligence) I’ll check the PR we could explain what models are possible to use based on the amount of system memory
Edit: I did not realize pocketpal was on iOS, good to know!
do I have permission to improve upon your PR? @Encounter5729

Encounter5729 · September 6, 2025, 1:07pm

Go ahead. There is already a table with model size and corresponding hardware I think.

GorujoCY · September 6, 2025, 1:27pm

have suggested changes

Redroyach · September 6, 2025, 8:08pm

Some time ago it was mentioned in this forum that pocketpal connected to firebase and google analytics each start, did this change?

GorujoCY · September 6, 2025, 8:34pm

if the exodus scan is anything to go by, probably, have never seen this connect to any firebase and stuff like that doe, idk in what basis this is.

exodus report:

though if I did have a secondary android device, i coulda use app manager and inspect it but I hope someone else does this instead.

[And even then, PocketPal works completely offline after you get the LLM(s)]

SkewedZeppelin · September 6, 2025, 9:11pm

No

github.com/a-ghorbani/pocketpal-ai

[Build] Separate firebase-enabled and non-firebase builds

opened 07:54AM - 21 Dec 24 UTC

a-ghorbani

build system

**Issue description** Currently the app only needs Firebase (App Check) for one… specific feature: sharing benchmark results with the community. We use App Check to verify the legitimacy of these benchmark submissions (i.e., to ensure they come from real users rather than automated scripts, etc). For custom compiles or other platforms like fdroid we can disable sharing benchmarks, so Firebase is unnecessary. Therefore, we want to provide two distinct build variants/flavors: - **Firebase-Enabled** – Includes Firebase libraries (App Check, etc.) and allows sharing benchmark results. - **Non-Firebase** – Excludes all Firebase references and cannot share benchmark results (or just greys out that feature). ### Potential solution: **Android** - Define product flavors in `android/app/build.gradle`: ```groovy flavorDimensions "default" productFlavors { firebaseEnabled { dimension "default" // This flavor includes Firebase dependencies (App Check) // for sharing benchmark results and verifying legitimacy } firebaseDisabled { dimension "default" // This flavor has no Firebase libraries -> cannot share benchmarks } } ``` - In the **`firebaseEnabled`** flavor: - Add `google-services.json` and apply the Google Services plugin. - In the **`firebaseDisabled`** flavor: - Omit `google-services.json`. - Do **not** include the Firebase dependencies. **iOS** - we can create **two targets / schemes** in Xcode: - **`FirebaseEnabled`** target references `GoogleService-Info.plist`. - **`FirebaseDisabled`** target does not include any Firebase refs. - conditionally exclude / include Firebase in `Podfile` depending on build configs. **ts code** - Optionally separate or conditionally import the `@react-native-firebase` modules (e.g., `app`, `app-check`) - Or rely on autolinking + native flavor settings

github.com/a-ghorbani/pocketpal-ai

[Feat]: Privacy - Remove telemetry

opened 11:44PM - 07 Apr 25 UTC

TFWol

enhancement

**Description** I'm submitting this to bring to light related Issue #143, since …searching for "telemetry" yields no results. I'd like the app to leverage one of the proposed solutions from the above Issue in order to prevent telemetry data from being sent from our devices every time app is launched. A few ideas are: - limit firebase to only trigger when launching the leaderboard feature - an option to toggle off the leaderboard feature (along with firebase) - a separate build without telemetry added to App store.

Firebase is used for the benchmark feature but inadvertently phones home each time regardless

Encounter5729 · September 7, 2025, 9:09am

Is this just a ping or other info sent ?

Topic		Replies	Views
Update on AI chat tools Site Development invalid	4	799	September 7, 2025
Local LLM on Android? Questions	5	614	August 18, 2025
Private online AI (Venice, Duck, etc) General	34	2646	October 5, 2025
Considering a Switch to Local LLMs from ChatGPT Questions	3	819	July 9, 2025
Thoughts on fullmoon Questions software	3	289	February 13, 2025

PocketPal (Mobile App that Runs AI Locally)

Related topics