Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Concurrent batch vectorization requests #4827

Open
wants to merge 9 commits into
base: stable/v1.25
Choose a base branch
from

Conversation

dirkkul
Copy link
Contributor

@dirkkul dirkkul commented May 2, 2024

What's being changed:

Send batch requests concurrently if the rate limit is high enough, otherwise keep sequential batching. For example Cohere allows 10k requests/min and we can basically send as many requests in parallel as we want.

With this the 100k sphere dataset with 2 named vectors was vectorized in ~2min using cohere and collection.batch.fixed_size(concurrent_requests=20, batch_size=96).

To test use this python client branch weaviate/weaviate-python-client#981 and the profiling/test_sphere.py script.

Review checklist

  • Documentation has been updated, if necessary. Link to changed documentation:
  • Chaos pipeline run or not necessary. Link to pipeline:
  • All new code is covered by tests where it is reasonable.
  • Performance tests have been run or not necessary.

@dirkkul dirkkul changed the title Concurrent batch requests Concurrent batch vectorization requests May 2, 2024
@dirkkul dirkkul changed the base branch from main to stable/v1.25 May 3, 2024 18:35
Copy link

sonarcloud bot commented May 16, 2024

Quality Gate Passed Quality Gate passed

Issues
2 New issues
0 Accepted issues

Measures
0 Security Hotspots
No data about Coverage
0.0% Duplication on New Code

See analysis details on SonarCloud

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants