Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add classloader diagnostics to initShuffleManager error message #10871

Merged
merged 18 commits into from
May 29, 2024

Conversation

zpuller
Copy link
Collaborator

@zpuller zpuller commented May 23, 2024

Closes #10172

As mentioned in the above issue, presumably this error is hit when the classloaders are different for the RapidsShuffleManagerLike interface in the case clause vs the implementation version, so we are adding logging of these classloader instance values for troubleshooting purposes.

@mattahrens mattahrens marked this pull request as draft May 23, 2024 16:38
@sameerz sameerz added the ease of use Makes the product simpler to use or configure label May 23, 2024
@zpuller zpuller changed the base branch from branch-24.06 to branch-24.08 May 24, 2024 17:57
Signed-off-by: Zach Puller <[email protected]>
Signed-off-by: Zach Puller <[email protected]>
Signed-off-by: Zach Puller <[email protected]>
Signed-off-by: Zach Puller <[email protected]>
zpuller and others added 2 commits May 24, 2024 16:35
Signed-off-by: Zach Puller <[email protected]>
gerashegalov
gerashegalov previously approved these changes May 25, 2024
Copy link
Collaborator

@gerashegalov gerashegalov left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, minor formatting comments

@zpuller
Copy link
Collaborator Author

zpuller commented May 25, 2024

build

@jlowe
Copy link
Member

jlowe commented May 28, 2024

@zpuller curious why this is in draft? I didn't find any explanation text.

@zpuller zpuller changed the title [DRAFT] add log line Add classloader diagnostics to initShuffleManager error message May 28, 2024
@zpuller
Copy link
Collaborator Author

zpuller commented May 28, 2024

@zpuller curious why this is in draft? I didn't find any explanation text.

You're right, it's really no longer a draft. Fixed the title/description.

@zpuller zpuller marked this pull request as ready for review May 28, 2024 18:19
@ttnghia
Copy link
Collaborator

ttnghia commented May 28, 2024

build

@abellina
Copy link
Collaborator

build

abellina
abellina previously approved these changes May 28, 2024
Signed-off-by: Zach Puller <[email protected]>
@jlowe
Copy link
Member

jlowe commented May 28, 2024

build

@zpuller zpuller merged commit dfcde72 into NVIDIA:branch-24.08 May 29, 2024
44 checks passed
@zpuller zpuller deleted the 10172_log_shuffle_manager_class branch May 29, 2024 20:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ease of use Makes the product simpler to use or configure
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[FEA] Log the shuffle manager class when shuffle manager is misconfigured
6 participants