Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Kubelite not starting after power failure unless cgroups-per-qos=false #4378

Open
AlexGustafsson opened this issue Jan 24, 2024 · 2 comments
Labels
kind/bug Something isn't working

Comments

@AlexGustafsson
Copy link
Contributor

Summary

After the host had been shut down abruptly, microk8s (kubelite) would no longer start due to the following error:

Jan 24 19:02:35 bernd microk8s.daemon-kubelite[2373]: E0124 19:02:35.011772    2373 kubelet.go:1542] "Failed to start ContainerManager" err="failed to initialize top level QOS containers: root container [kubepods] doesn't exist"

After having applied the workaround mentioned by @neoaggelos in #4301 (comment), microk8s started.

Now microk8s cannot start without those changes.

What Should Happen Instead?

Microk8s should start without having to disable cgroups per qos.

Reproduction Steps

None.

Introspection Report

inspection-report-20240124_193532.tar.gz

@AlexGustafsson
Copy link
Contributor Author

During the last boot, before the power outage, the host had been running for a long time. microk8s had been updated from 1.26, through 1.27, 1.28 to 1.29 without a reboot. So the power cycle might just have exposed issues that would otherwise have shown.

I haven't found anything in the patch notes that suggest that there's some change in how cgroups works lately. The computer haven't been configured any different since it was working. So I'm unsure what would make cgroups misbehave (as suggested in #4301).

@neoaggelos
Copy link
Member

Hi @AlexGustafsson, thank you for raising this. This has been an issue we are seeing with MicroK8s 1.29 recently, see also #4361. I wonder if you are bumping into the same problem.

@neoaggelos neoaggelos added the kind/bug Something isn't working label Jan 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants