New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature Request]: Ability to specify local_files_only #2039
Comments
Currently I have to turn off the wifi every time I start the script, otherwise it'll just hang forever at requesting https://huggingface.co/sentence-transformers/paraphrase-multilingual-mpnet-base-v2/resolve/main/modules.json
|
@simaotwx, let me try to clarify this. What you want to be able to do is use embedding models (let's say sentence transformers) locally, without having to go to the internet to fetch them every time you redeploy Chroma. We have PR about this - #1799 While this gets merged for sentence transformers, you can mount a dir for the model cache at Another alternative could be that you use Chroma base image |
@tazarov Not quite. I'm using the local embedded Chroma instance for development. The cache is there but it still goes to the internet to check if there are updates. If my internet is unstable (which, unfortunately, is quite common in Germany) it would prevent me from starting the script. |
@simaotwx, have you tried passing |
Where exactly do I pass this? See my example code, I'm using Chroma's embedding_functions.SentenceTransformerEmbeddingFunction |
the ST EF supports kwargs:
you can call it like this: ef = embedding_functions.SentenceTransformerEmbeddingFunction(
model_name="sentence-transformers/paraphrase-multilingual-mpnet-base-v2",
use_auth_token=False
) |
Yeah this is in |
Okay, I'll wait for the release |
@simaotwx, new version of Chroma released last week. Please update your deps and try again. |
Describe the problem
I'd like to be able to not make connections to random servers all over the world whenever I start my python script to debug it. Not only does it make my script slower, but also whenever a new version is downloaded, my previous tests are no longer valid.
Also, should the software being written go into production, the script's ability to function should not be dependent on subsequent requests across restarts to outside servers.
Describe the proposed solution
I would like to have the ability to specify the "local_files_only" parameter here:
Alternatives considered
No response
Importance
i cannot use Chroma without it
Additional Information
huggingface/transformers@a143d94
The text was updated successfully, but these errors were encountered: