ggml vs Qualcomm SNPE inference engine on qualcomm soc #809

Francis235 · 2024-04-30T02:35:33Z

Hello, I plan to deploy the model using ggml on Qualcomm's chip. I'm curious about the comparison between using ggml for inference on an SoC chip (such as a Qualcomm SoC, involving components like CPU, GPU, NPU, etc.) versus leveraging the inference engine provided by the chip itself (such as qualcomm SNPE). Since ggml inference primarily takes place on the CPU, whereas the chip's inference engine can offload computations to the GPU or NPU, does using ggml lead to a significant increase in CPU memory usage and %CPU, potentially impacting other tasks? Has anyone conducted a similar comparative test?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ggml vs Qualcomm SNPE inference engine on qualcomm soc #809

ggml vs Qualcomm SNPE inference engine on qualcomm soc #809

Francis235 commented Apr 30, 2024

ggml vs Qualcomm SNPE inference engine on qualcomm soc #809

ggml vs Qualcomm SNPE inference engine on qualcomm soc #809

Comments

Francis235 commented Apr 30, 2024