Utterly crazy cpu usage #998

Wildcarde · 2023-11-09T21:51:46Z

Describe the bug
I converted an existing dropbox folder on fedora linux to a maestral docker container, all went well from a sync perspective till it seemed to consider itself 'done' with the sync and goes into standard running mode. At this point it pins the os at around 80-90% of the cpu. I've got a 6 core 12 thread laptop and it's currently sitting at 910% cpu usage.

To Reproduce
I'm not sure if there's anything specific to replicate this beyond run the docker version as a background process. Only additional thing I've done is add a kde widget to monitor the output of docker exec -t maestral maestral status this runs every 5 seconds.

Expected behaviour
This shouldn't be using more than 2-3% when idle, ideally way less than that.

System:

Maestral version: docker latest image
Python version:
OS: docker image running on a fedora 39 laptop
Desktop environment: kde - but unused by maestral

The text was updated successfully, but these errors were encountered:

Wildcarde · 2023-11-09T22:06:30Z

quick followup after looking through the code a bit, it seems like the filesystem monitoring defaults to polling only instead of using an OS native implementation when available? If the observer.polling was replaced with the standard observer: https://pythonhosted.org/watchdog/api.html#module-watchdog.observers it should reduce the amount of processing overhead that's used. It'll still fail over to the polling implementation if the other ones aren't available.

Wildcarde · 2023-11-09T22:21:38Z

I've noticed the indexing mode also chews up a ton of cpu time, I haven't checked yet but if all files are being hashed using the approach here: https://pythonhosted.org/watchdog/api.html#module-watchdog.observers it may be a good idea to swap that out for files below a specific threshold for a function that reads the whole file in one go then hashes once it's read into memory, instead of reading 1024 bytes at a time as this should allow you to reduce the number of filesystem actions taken.

samschott · 2023-11-09T23:11:55Z

I'm not very familiar with the docker setup, this was an external contribution. If run natively, Maestral will either use FSEvents on macOS or Inotify on Linux, and the idle CPU usage is close to 0%. It will only fall back to polling if the Platform is neither of those:

maestral/src/maestral/fsevents/__init__.py

Lines 25 to 30 in 5f0cf08

 if platform.is_linux(): 

 from watchdog.observers.inotify import InotifyObserver as Observer 

 elif platform.is_darwin(): 

 from watchdog.observers.fsevents import FSEventsObserver as Observer 

 else: 

 from .polling import OrderedPollingObserver as Observer

Regarding hashing, this is done in chunks of 65536 bytes, which has been a good tradeoff between CPU and memory usage. Note that multiple files may be hashed in parallel to better distribute load across CPU cores.

Finally, the config file has settings for max bandwidth and CPU usage, and Maestral will throttle its work and transfer speeds to stay below both.

Could you check if you can replicate this behavior outside of a Docker image?

Wildcarde · 2023-11-09T23:28:22Z

Ah i hadn't noticed that in the init area and had found the polling observer elsewhere. My bad! I had been trying to avoid setting up an environment locally but will replicate my docker config to test and see if I can figure out what's going squirrely with the docker one if things behave more normally natively.

Wildcarde · 2023-11-10T00:55:12Z

ok I've got maestral configured as a local install in a venv now and it is working, unfortunately even with max cpu time set to 5% the indexing step is now running at 1100% cpu usage. I'm going to try using a cgroup to manage the max cpu and see if that works.

Wildcarde · 2023-11-10T03:20:49Z

Ok, so I wrote a new systemd user file and forced it to restrain cpu usage via cgroup and my laptop is no longer trying to burn holes in my desk!

[Unit]
Description="Maestral Service"
After=network.target

[Service]
Type=forking
ExecStart=/home/<user>/maestral-venv/bin/maestral start
ExecStop=/home/<user>/maestral-venv/bin/maestral stop
CPUQuota=5%

[Install]
WantedBy=default.target

I may consider bumping up the total amount of cpu it can use but for now it's working in the background and chugging along. I'll dig into the docker container in a bit, there's ways to do similar there and a compose file would be a more robust way of handling things long term.

edit: quick note for anybody using this, this is a user space systemd file it would go into ~/.local/share/systemd/user/maestral.service and be started with systemctl --user daemon-reload (to reparse the folder) and systemctl --user start maestral to start the service, you can use enable to make it a user space service as well.

second note: moved to 20% and the sync and update finished in about an hour and is just idling around not really hurting at all. working solid.

Wildcarde added the bug Something isn't working label Nov 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Utterly crazy cpu usage #998

Utterly crazy cpu usage #998

Wildcarde commented Nov 9, 2023

Wildcarde commented Nov 9, 2023

Wildcarde commented Nov 9, 2023

samschott commented Nov 9, 2023 •

edited

Wildcarde commented Nov 9, 2023

Wildcarde commented Nov 10, 2023

Wildcarde commented Nov 10, 2023 •

edited

Utterly crazy cpu usage #998

Utterly crazy cpu usage #998

Comments

Wildcarde commented Nov 9, 2023

Wildcarde commented Nov 9, 2023

Wildcarde commented Nov 9, 2023

samschott commented Nov 9, 2023 • edited

Wildcarde commented Nov 9, 2023

Wildcarde commented Nov 10, 2023

Wildcarde commented Nov 10, 2023 • edited

samschott commented Nov 9, 2023 •

edited

Wildcarde commented Nov 10, 2023 •

edited