Segmentation fault Windows 11 Docker #254

jak6jak · 2023-03-24T11:24:40Z

I tried installing dalai with docker on windows. Currently I am getting the following error when I try generating a response with debug mode on:

root@7788cdbedf9c:~/dalai/alpaca# /root/dalai/alpaca/main --seed -1 --threads 4 --n_predict 200 --model models/30B/ggml-model-q4_0.bin --top_k 40 --top_p 0.9 --temp 0.8 --repeat_last_n 64 --repeat_penalty 1.3 -p "Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request.
> 
> ### Instruction:
> >PROMPT
> 
> ### Response:
> "
main: seed = 1679656530
llama_model_load: loading model from 'models/30B/ggml-model-q4_0.bin' - please wait ...
llama_model_load: ggml ctx size = 25631.50 MB
Segmentation fault
root@7788cdbedf9c:~/dalai/alpaca# exit
exit

Looking at the llama.cpp project it seems that they have tried to fix some segmentation problems but where unsuccessful. Perhaps this is the issue I am facing but I do not know. ggerganov/llama.cpp@3cd8dde

Any tips on how to debug this or to get a better error would be appreciated.

The text was updated successfully, but these errors were encountered:

christopherorea · 2023-03-24T17:49:32Z

I do have the exact same problem.
I tried running it in the terminal via docker and clonning alpaca.cpp and run make chat but without success. If I know something I would post here

This causes long prompts to parse very slowly.

FrancescoGrazioso · 2023-03-29T17:07:48Z

Just downloaded the repo and installed the 30B model, having the same issue.
Here's the debug output:

`/root/dalai/alpaca/main --seed -1 --threads 4 --n_predict 200 --model models/30B/ggml-model-q4_0.bin --top_k 40 --top_p 0.9 --temp 0.8 --repeat_last_n 64 --repeat_penalty 1.3 -p "Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request.

Instruction:

PROMPT

Response:

"
exit
root@81743ba9c2e2:~/dalai/alpaca# /root/dalai/alpaca/main --seed -1 --threads 4 --n_predict 200 --model models/30B/ggml-model-q4_0.bin --top_k 40 --top_p 0.9 --temp 0.8 --repeat_last_n 64 --repeat_penalty 1.3 -p "Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request.

Instruction:

PROMPT

Response:

"
main: seed = 1680109480
llama_model_load: loading model from 'models/30B/ggml-model-q4_0.bin' - please wait ...
llama_model_load: ggml ctx size = 25631.50 MB
Segmentation fault
root@81743ba9c2e2:~/dalai/alpaca# exit
exit`

glozachmeur · 2023-03-30T09:16:29Z

I also have this issue with alpaca 30B and llama 30B, exactly the same error (but the ggml ctx size size is about 21000MB for me)

I have 32Go RAM, docker seems to consume a lot of it sometimes (via the vmmem process) and so I sometimes don't have the 22go needed, but when I have enough RAM I still can't run the model...

So I bet 32Go of ram in not enough for running the 30B model using docker ? 🤔 How much do you have ?

toolchild · 2023-04-01T19:12:12Z

Here i described my experience running models on Windows 10
#330 (comment)

christopherorea · 2023-04-05T11:50:31Z

I have the assumption that the issue comes from the fact that this models requires a lot of RAM in your machine. Can anybody confirm or dismiss this? I believe when the model is loaded it is loaded in the RAM, that is the reason it breaks.

pratyushtiwary · 2023-04-26T16:52:48Z

In my case the context size was causing this issue, I fixed it by adding new config to the UI which allows me to play with context size.

I was using 6 gb ram server to try it, in my case context size below 1024 seems to work without any errors.

PR for the same: #424

mirroredkube pushed a commit to mirroredkube/dalai that referenced this issue Mar 26, 2023

Fix n^2 loop in tokenization (cocktailpeanut#254)

a81d0c2

This causes long prompts to parse very slowly.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Segmentation fault Windows 11 Docker #254

Segmentation fault Windows 11 Docker #254

jak6jak commented Mar 24, 2023

christopherorea commented Mar 24, 2023

FrancescoGrazioso commented Mar 29, 2023

Instruction:

Response:

glozachmeur commented Mar 30, 2023 •

edited

Loading

toolchild commented Apr 1, 2023

christopherorea commented Apr 5, 2023

pratyushtiwary commented Apr 26, 2023

Segmentation fault Windows 11 Docker #254

Segmentation fault Windows 11 Docker #254

Comments

jak6jak commented Mar 24, 2023

christopherorea commented Mar 24, 2023

FrancescoGrazioso commented Mar 29, 2023

Instruction:

Response:

Instruction:

Response:

glozachmeur commented Mar 30, 2023 • edited Loading

toolchild commented Apr 1, 2023

christopherorea commented Apr 5, 2023

pratyushtiwary commented Apr 26, 2023

glozachmeur commented Mar 30, 2023 •

edited

Loading