Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Maximum number of artifacts exceeded. How to aggregate artifacts from >100 ParallelFor iterations #7513

Open
bronevet-abc opened this issue Mar 14, 2024 · 2 comments

Comments

@bronevet-abc
Copy link

/kind bug

I am running a ParallelFor loop with > 100 iterations and aggregating the resulting artifacts into a dsl.Collection but I'm getting the following error:
com.google.cloud.ai.platform.common.errors.AiPlatformException: code=INVALID_ARGUMENT, message=Maximum number of artifacts exceeded. Maximum allowed: 100, requested: 270, cause=null

Is there a way to raise this maximum so I can run larger ParallelFor iterations?

  • Kubeflow version: 2.4
  • Running on the GCP Vertex Pipelines service
@siddeshas07
Copy link

siddeshas07 commented Mar 27, 2024

hi @bronevet-abc , I'm not sure but this may help but you can try this you can create a custom container image for your ParallelFor loop execution. This image could handle artifact management internally, bypassing the 100-artifact limit.

@bronevet-abc
Copy link
Author

Interesting! I'm not sure exactly the mechanics of this. Are you suggesting a custom container for the task running within the ParallelFor or for the task receiving the aggregated data objects? Can you point to any examples for this or the relevant APIs? Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Development

No branches or pull requests

2 participants