Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FEAT-#4574: Warn users when pre-initialized Ray cluster is not using all available memory #4575

Open
wants to merge 3 commits into
base: master
Choose a base branch
from

Conversation

RehanSD
Copy link
Collaborator

@RehanSD RehanSD commented Jun 14, 2022

Signed-off-by: Rehan Durrani [email protected]

What do these changes do?

@RehanSD RehanSD requested a review from a team as a code owner June 14, 2022 18:46
@@ -219,6 +219,17 @@ def initialize_ray(
if not GPU_MANAGERS:
for i in range(GpuCount.get()):
GPU_MANAGERS.append(GPUManager.remote(i))
else:
ray_obj_store_mem = ray.available_resources()["object_store_memory"]
virtual_memory = psutil.virtual_memory().total
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actually, in this context you might want to use available since there could be other processes running that are consuming memory?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Free vs. available memory in Linux: https://haydenjames.io/free-vs-available-memory-in-linux/

@codecov
Copy link

codecov bot commented Jun 14, 2022

Codecov Report

Merging #4575 (d98fed0) into master (4ec7f63) will increase coverage by 3.18%.
The diff coverage is n/a.

❗ Current head d98fed0 differs from pull request most recent head 3286ccc. Consider uploading reports for the commit 3286ccc to get more accurate results

@@            Coverage Diff             @@
##           master    #4575      +/-   ##
==========================================
+ Coverage   86.22%   89.40%   +3.18%     
==========================================
  Files         228      229       +1     
  Lines       18451    18727     +276     
==========================================
+ Hits        15909    16743     +834     
+ Misses       2542     1984     -558     
Impacted Files Coverage Δ
modin/core/execution/ray/common/utils.py 96.87% <ø> (ø)
modin/experimental/batch/test/test_pipeline.py 100.00% <0.00%> (ø)
modin/pandas/base.py 94.81% <0.00%> (+0.08%) ⬆️
modin/core/storage_formats/pandas/parsers.py 90.22% <0.00%> (+1.26%) ⬆️
...mentations/pandas_on_ray/partitioning/partition.py 93.57% <0.00%> (+1.83%) ⬆️
...tations/pandas_on_python/partitioning/partition.py 93.75% <0.00%> (+2.08%) ⬆️
...entations/pandas_on_dask/partitioning/partition.py 91.46% <0.00%> (+2.43%) ⬆️
modin/pandas/__init__.py 69.69% <0.00%> (+3.03%) ⬆️
...dataframe/pandas/partitioning/partition_manager.py 90.00% <0.00%> (+3.44%) ⬆️
modin/config/envvars.py 86.63% <0.00%> (+3.46%) ⬆️
... and 15 more

📣 Codecov can now indicate which changes are the most critical in Pull Requests. Learn more

… using all available memory

Signed-off-by: Rehan Durrani <[email protected]>
Signed-off-by: Rehan Durrani <[email protected]>
Signed-off-by: Rehan Durrani <[email protected]>
@vnlitvinov
Copy link
Collaborator

@RehanSD what's the status of this PR?

@RehanSD
Copy link
Collaborator Author

RehanSD commented Sep 11, 2022

@vnlitvinov it's ready for review!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Warn users that if ray is already initialized it may not be using all available memory for object store
3 participants