convert-hf-to-gguf-update.py breaks #7207

CrispStrobe · 2024-05-10T19:30:59Z

just realized that seemingly some recent changes make the script break on creating the llama-spm contents. it runs through without that line. which is my quick and lazy workaround atm (also in a quickly hacked kaggle script to run through the steps to fix the pre tokenizer issue). sorry i cannot look into this further, and maybe it is just some intermediate inconsistency that gets solved in the process of the current edits in the repo. or maybe you want to look into it.

ProjectAtlantis-dev · 2024-05-10T20:09:00Z

What is the error? Was it trying to download a tokenizer from hf? I know that dbrx fails

The older convert throws a NotImplementedError("BPE pre-tokenizer was not recognized - update get_vocab_base_pre()") when trying to do llama3 refuel

ProjectAtlantis-dev · 2024-05-10T20:14:26Z

I get this error from convert-hf-to-gguf-update.py using python 3.11 when trying to convert llam3a refuel:

OSError: models/tokenizers/llama-spm does not appear to have a file named config.json. Checkout 'https://huggingface.co/models/tokenizers/llama-spm/tree/None' for available files.

CrispStrobe · 2024-05-10T20:16:02Z

yes the very same error, or also FileNotFoundError: [Errno 2] No such file or directory: 'models/tokenizers/llama-spm/tokenizer.json'

ProjectAtlantis-dev · 2024-05-10T20:18:36Z

Tried downloading llama-spm from hf directly except get 404 error - but I think we can also steal one from another llama spm based model

CrispStrobe · 2024-05-10T20:20:56Z

why though? a) you want to work with llama3 you said, so for this you can ignore llama-spm. b) you do not want the original hf files anyway, but you want what the update script will build for you, if it works.

ProjectAtlantis-dev · 2024-05-10T20:21:52Z

I don't understand all the logic tbh but it seems to be pulling configs from hf on the fly. Also, I think llama 3 refuel is bpe so yeah why should I even care

CrispStrobe · 2024-05-10T20:24:08Z

i just realize: maybe it will work if you just fill out the license form on https://huggingface.co/meta-llama/Llama-2-7b-hf
but i am 99% sure this was not an issue a few days ago, hm...

ProjectAtlantis-dev · 2024-05-10T20:25:36Z

i just deleted the dbrx and the llama-spm entries in the model list below line 61 and it seems to work - but then it also says I need to run a bunch of scrips to build vocabs which is something the other script would do automagically

I think your above license form is for llama2 which is spm not bpe

CrispStrobe · 2024-05-10T20:27:25Z

yes that is as it is intended by the devs atm. sounds more difficult than it is. you only need the one vocab actually. and you can also check out the kaggle script linked above which does it all on the fly too.

ProjectAtlantis-dev · 2024-05-10T20:36:29Z

From convert-hf-to-gguf.py line 367:

 # NOTE: if you get an error here, you need to update the convert-hf-to-gguf-update.py script
 #       or pull the latest version of the model from Huggingface
 #       don't edit the hashes manually!

So the entry for BPE tokenizer presumably needs to be added to the xxx-update.py script

CrispStrobe · 2024-05-10T20:45:08Z

indeed so

CrispStrobe · 2024-05-10T20:50:13Z

ok i just checked it with license access and that is most probably indeed the cause. the same for dbrx. so 2 options atm, either ask for access for both repos, or delete/comment out both lines. but i would rather change the update script so that this does not break the script. ok here is a PR for that.

ProjectAtlantis-dev · 2024-05-11T07:16:52Z

I think the overall intention is to emulate what python AutoTokenizer apply_chat_template() already does - it goes out to hf and pulls down the template automagically

CrispStrobe · 2024-05-11T07:19:16Z

the similarity ends after the pulling down though

oldmanjk · 2024-05-24T06:31:44Z

I don't understand all the logic tbh

You and me both

CrispStrobe added the bug-unconfirmed label May 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

convert-hf-to-gguf-update.py breaks #7207

convert-hf-to-gguf-update.py breaks #7207

CrispStrobe commented May 10, 2024

ProjectAtlantis-dev commented May 10, 2024

ProjectAtlantis-dev commented May 10, 2024

CrispStrobe commented May 10, 2024 •

edited

ProjectAtlantis-dev commented May 10, 2024 •

edited

CrispStrobe commented May 10, 2024

ProjectAtlantis-dev commented May 10, 2024 •

edited

CrispStrobe commented May 10, 2024

ProjectAtlantis-dev commented May 10, 2024 •

edited

CrispStrobe commented May 10, 2024 •

edited

ProjectAtlantis-dev commented May 10, 2024

CrispStrobe commented May 10, 2024

CrispStrobe commented May 10, 2024 •

edited

ProjectAtlantis-dev commented May 11, 2024

CrispStrobe commented May 11, 2024

oldmanjk commented May 24, 2024

convert-hf-to-gguf-update.py breaks #7207

convert-hf-to-gguf-update.py breaks #7207

Comments

CrispStrobe commented May 10, 2024

ProjectAtlantis-dev commented May 10, 2024

ProjectAtlantis-dev commented May 10, 2024

CrispStrobe commented May 10, 2024 • edited

ProjectAtlantis-dev commented May 10, 2024 • edited

CrispStrobe commented May 10, 2024

ProjectAtlantis-dev commented May 10, 2024 • edited

CrispStrobe commented May 10, 2024

ProjectAtlantis-dev commented May 10, 2024 • edited

CrispStrobe commented May 10, 2024 • edited

ProjectAtlantis-dev commented May 10, 2024

CrispStrobe commented May 10, 2024

CrispStrobe commented May 10, 2024 • edited

ProjectAtlantis-dev commented May 11, 2024

CrispStrobe commented May 11, 2024

oldmanjk commented May 24, 2024

CrispStrobe commented May 10, 2024 •

edited

ProjectAtlantis-dev commented May 10, 2024 •

edited

ProjectAtlantis-dev commented May 10, 2024 •

edited

ProjectAtlantis-dev commented May 10, 2024 •

edited

CrispStrobe commented May 10, 2024 •

edited

CrispStrobe commented May 10, 2024 •

edited