Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Switching authorization from ConfigMap to API and ConfigMap breaks fargate (or is it EKS bug) #2972

Closed
cdenneen opened this issue Mar 14, 2024 · 4 comments

Comments

@cdenneen
Copy link

So I think this might be a larger EKS bug but it seems when switching from ConfigMap to api & ConfigMap the module switches from the old ConfigMap entries to the new access_entries but it complains that the fargate profile is misconfigured:

  Warning  FailedScheduling  40m   fargate-scheduler  Misconfigured Fargate Profile: fargate profile karpenter blocked for new launches due to: Pod execution role is not found in auth config or does not have all required permissions for launching fargate pods.

However I have seen this on a brand new cluster too. I destroyed the new cluster created which had this complaint and recreated exactly same and "second" new cluster worked. I'm not sure why some times clusters have this issues and some don't...

I'm seeing this on existing cluster upgrades as well where I have 2 fargate_profiles defined using this module and the coredns one worked fine and pod came up but the Karpenter one had this error. The module creates 2 access entries that except for the name look the same so maybe its the EKS managed backend that all regions or something haven't been properly updated and that's why sometimes this works and other times it doesn't?

Not sure where to raise this as EKS bug though so I figured I'd start here since the developers are on the EKS team.

@cdenneen cdenneen changed the title Switching authorization from ConfigMap to API and ConfigMap breaks fargate Switching authorization from ConfigMap to API and ConfigMap breaks fargate (or is it EKS bug) Mar 14, 2024
@cdenneen
Copy link
Author

I deleted the fargate profile from the console, deleted the role, deleted the access entry and allowed module to recreate them and issue was gone this time. Definitely something strange on backend whether it's the conversion or managed control plane itself.

@bryantbiggs
Copy link
Member

duplicate of #2912

@cdenneen can you open an AWS support case with your cluster ARN and the time period when you encountered this behavior, please

@bryantbiggs
Copy link
Member

closing in favor of #2912

Copy link

I'm going to lock this issue because it has been closed for 30 days ⏳. This helps our maintainers find and focus on the active issues. If you have found a problem that seems similar to this, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

@github-actions github-actions bot locked as resolved and limited conversation to collaborators Apr 17, 2024
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants