Torch Batcher

Serve batched requests using redis, can scale linearly by increasing the number of workers per device and along devices.

Dependencies

Install Redis
pip3 install -r requriments.txt

Usage

For Linear Scaling, start nvidia-cuda-mps-control, Check Section 2.1.1 GPU utilization for details.

nvidia-cuda-mps-control -d # To start

# To exit mps after stoping the server do.
nvidia-cuda-mps-control # Will enter the command prompt
quit # enter command to quit

Start Redis
```
redis-server --save "" --appendonly no
```

Start Batch-Serving

supervisord -c supervisor.conf # Start 3 workers on a single gpu

Start Batch benchmark
```
python3 bench_batched.py
```

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.gitignore		.gitignore
README.md		README.md
bench_batched.py		bench_batched.py
client.py		client.py
infer.py		infer.py
requirements.txt		requirements.txt
supervisord.conf		supervisord.conf
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

README.md

README.md

bench_batched.py

bench_batched.py

client.py

client.py

infer.py

infer.py

requirements.txt

requirements.txt

supervisord.conf

supervisord.conf

utils.py

utils.py

Repository files navigation

Torch Batcher

Dependencies

Usage

About

Releases

Packages

Languages

SABER-labs/torch_batcher

Folders and files

Latest commit

History

Repository files navigation

Torch Batcher

Dependencies

Usage

About

Topics

Resources

Stars

Watchers

Forks

Languages