Speed up index construction by converting vocabulary types while loading the model #768

rlouf · 2024-03-25T07:49:12Z

Because we use Numba to compile the index we need to convert the vocabulary types, which takes a non-negligible amount of time every time the script is run. A simple way to go around this is to execute this function in a separate thread while model is being loaded. We may also be able to make Numba cache JIT-compiled function by compiling the index for a trivial regex.

rlouf added the structured generation Linked to structured generation label Mar 25, 2024

rlouf added the optimization Related to performance optimizations label Apr 12, 2024

rlouf pinned this issue Apr 12, 2024

brandonwillard mentioned this issue Apr 20, 2024

Use a trie for scanning during index construction #507

Closed

rlouf linked a pull request Apr 21, 2024 that will close this issue

Convert vocabulary types and load model concurrently #832

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up index construction by converting vocabulary types while loading the model #768

Speed up index construction by converting vocabulary types while loading the model #768

rlouf commented Mar 25, 2024

Speed up index construction by converting vocabulary types while loading the model #768

Speed up index construction by converting vocabulary types while loading the model #768

Comments

rlouf commented Mar 25, 2024