Performance on large number of small files #11960
Unanswered
shcheklein
asked this question in
Q&A
Replies: 1 comment 5 replies
-
Hi. My real example. MiniO version RELEASE.2021-03-17T02-33-02Z
MiniO volume is located on NVME ssd (3Gb/s, INTEL SSDPE2KX040T8) I have one bucket which I plan to use with DVC. The bucket contains 455,605 files and the size is about 36Gb. ".minio.sys" dir size is 73Mb and contains 455k files also). LAN is 1Gbit/sec. Both machines are connected to one switch. I used an awscli for testing I got about 15Mbits per second. I also used warp, here is its output:
|
Beta Was this translation helpful? Give feedback.
5 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi, I'm one of the maintainers at DVC - github.com/iterative/dvc. A lot of users use Minio as a S3 on-prem alternative and from time to time we see complaints like this:
It doesn't help to change level of parallelism,
aws s3 sync
is also slow:aws --endpoint-url http://<ip>:9000 s3 sync s3://myminio .
with any of these settings:
gives no more than 15MiB/s.
Question: Is it known and expected or it's a client problem? what can be the root cause for this? Is there a known overhead of creating objects?
Beta Was this translation helpful? Give feedback.
All reactions