Llama2-7B mobile app crashes on Samsung S23 8GB RAM #3599

salykova · 2024-05-14T12:06:42Z

Hi all,

I've succefully compiled .pte with default quantization parameters and tokenizer.bin for LLama2-7B according to the tutorial. However, during the inference the android app crashes with no error message (I assume its due to insufficient RAM). Is it currently possible to run 7B models on 8GB RAM phones?

P.S. ~ 4GB RAM out of 8GB is available

iseeyuan · 2024-05-14T14:10:15Z

@kirklandsign Could you help looking at it?

iseeyuan · 2024-05-14T14:11:13Z

@salykova could you provide more details on how to reproduce it, including the phone models, the Android version, etc.?

salykova · 2024-05-14T14:32:51Z

@iseeyuan

Device: Samsung S23, 8GB RAM, Android 14
Pytorch 0.2 stable branch
Java 17 JDK
Android SDK API Level 34
Android NDK 25.0.8775105

Steps to reproduce:

Follow the guide https://github.com/pytorch/executorch/tree/v0.2.0/examples/models/llama2 to create .pte and tokenizer.bin models for llama2-7b-chat (default params, default quant 128 groupwise)
Build android app following https://pytorch.org/executorch/0.2/llm/llama-demo-android.html
The app crashes with no error message immediately after clicking "generate"

salykova · 2024-05-15T11:28:55Z

@iseeyuan @kirklandsign

I've also tested both LLama2-7B and LLama3-8B models via adb binary-based approach and the inference works. Seems like the problem with the App or it requires much more RAM than adb-binary

mergennachin · 2024-05-15T14:37:06Z

cc @digantdesai - regarding s23 :)

kirklandsign · 2024-05-15T23:17:25Z

Hi @salykova the app requires slightly more RAM for Dalvik VM and graphics. However, I think the main issue is for binary, it has a high priority, so when OOM killer kills processes, it's usually killed last. I have seen situations like SystemUI killed before the binary killed.

For the app, it is a normal user app, and usually it's killed before system processes.

Unfortunately I don't have a good solution at the moment. I do see 8GB RAM runs sometimes, and 16GB runs almost all the time.

salykova · 2024-05-15T23:43:57Z

Hi @kirklandsign thanks for your response! Is it not possible to give the app higher priority? Sorry, I'm not an Android Developer and have no experience with it.

Also, I've found this option in the documentation https://developer.android.com/guide/topics/manifest/application-element.html#largeHeap. Can this increase stability of the app in theory?

kirklandsign · 2024-05-16T00:05:14Z

Hi @salykova unfortunately I don't have a good way to adjust the priority, especially for non-rooted device. android:largeHeap is for Dalvik heap space, but the RAM consumption happens in native layer so it doesn't help. I also tried with android:persistent but doesn't help either. So I am not sure how to fix this kind of issue at the moment.

iseeyuan assigned kirklandsign May 14, 2024

iseeyuan added module: extension Related to extension built on top of runtime, e.g. pybindings, data loader, etc. triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module Android Android building and execution related. labels May 14, 2024

salykova changed the title ~~Llama2-7B mobile app crashes on Android with 8GB RAM~~ Llama2-7B mobile app crashes on Samsung S23 8GB RAM May 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llama2-7B mobile app crashes on Samsung S23 8GB RAM #3599

Llama2-7B mobile app crashes on Samsung S23 8GB RAM #3599

salykova commented May 14, 2024 •

edited

iseeyuan commented May 14, 2024

iseeyuan commented May 14, 2024

salykova commented May 14, 2024 •

edited

salykova commented May 15, 2024 •

edited

mergennachin commented May 15, 2024

kirklandsign commented May 15, 2024

salykova commented May 15, 2024 •

edited

kirklandsign commented May 16, 2024

Llama2-7B mobile app crashes on Samsung S23 8GB RAM #3599

Llama2-7B mobile app crashes on Samsung S23 8GB RAM #3599

Comments

salykova commented May 14, 2024 • edited

iseeyuan commented May 14, 2024

iseeyuan commented May 14, 2024

salykova commented May 14, 2024 • edited

salykova commented May 15, 2024 • edited

mergennachin commented May 15, 2024

kirklandsign commented May 15, 2024

salykova commented May 15, 2024 • edited

kirklandsign commented May 16, 2024

salykova commented May 14, 2024 •

edited

salykova commented May 14, 2024 •

edited

salykova commented May 15, 2024 •

edited

salykova commented May 15, 2024 •

edited