-
-
Notifications
You must be signed in to change notification settings - Fork 440
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Excessive Memory consumption. #759
Comments
Can you confirm you do not have any other tabs open? I can't seem to understand how this can be related to the application (not much is being rendered).
This makes sense since it's (most likely) running in fp16 mode. |
@xenova can confirm:
Navigated to https://huggingface.co/spaces/Xenova/experimental-phi3-webgpu and loaded the model:
To avoid any confusion this is the downloaded model: |
WebGPU currently only supports 16 and 32bit mode. |
same here |
System Info
Environment/Platform
Description
For the latest PHI-3 demo Chrome browser uses:
5.31 Gb for Renderer process and 4.16 Gb for GPU process, totaling almost 10 Gb while running ~2Gb model.
After first inference memory consumption jumps above 12Gb. That can't be normal.
Reproduction
load model
The text was updated successfully, but these errors were encountered: