Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bug: memphis statefulset cant start after fail over #1271

Open
1 task done
waruwat-kkp opened this issue Aug 23, 2023 · 2 comments
Open
1 task done

Bug: memphis statefulset cant start after fail over #1271

waruwat-kkp opened this issue Aug 23, 2023 · 2 comments
Assignees
Labels
bug Something isn't working

Comments

@waruwat-kkp
Copy link

Describe the bug

I test fail over by kill all pod of memphis, memphis can't start back and got following error

container : memphis

nats-server: invalid padding size

container : reloader

2023/08/23 05:03:34 Starting Memphis Config Reloader vversion-not-set
2023/08/23 05:03:34 Error: open /var/run/memphis/memphis.pid: no such file or directory
2023/08/23 05:03:36 Error: open /var/run/memphis/memphis.pid: no such file or directory
2023/08/23 05:03:38 Error: open /var/run/memphis/memphis.pid: no such file or directory
2023/08/23 05:03:40 Error: open /var/run/memphis/memphis.pid: no such file or directory
2023/08/23 05:03:42 Error: open /var/run/memphis/memphis.pid: no such file or directory
2023/08/23 05:03:44 Error: open /var/run/memphis/memphis.pid: no such file or directory
Error: Too many errors attempting to find server process

Steps to reproduce

  1. kill all pod of memphis : statefulset, deployment
  2. waiting for pod re-creation

Affected services

Broker

Broker version

v1.1.1-stable

Platforms

Kubernetes

If UI - Browsers

No response

Environment

No response

Additional context

No response

Code of Conduct

  • I agree to follow this project's Code of Conduct
@waruwat-kkp waruwat-kkp added the bug Something isn't working label Aug 23, 2023
@yanivbh1
Copy link
Contributor

Hey @waruwat-kkp ,
It will be reviewed as soon as possible.
Which flavor and version of K8S are you using? If it is managed - by which vendor?
Have you deployed Memphis in cluster mode?

@idanasulin2706
Copy link
Contributor

Hi @waruwat-kkp this issue you described is happening when memphis secret keys are getting changed.
That means that some encrypted values that have been encrypted with the older keys can not been decrypted with the new ones.
Now this situation might happen in one of two options:

  1. You changed the memphis secrets manually.
  2. You didn't set them to be constant, in this case memphis is randomly generate them on your first helm install and on helm upgrades.

So any chances that these values changed somehow?

And anyway I'm attaching the relevant section in our docs which explains about it.

Here you can make them constant for your choice on installation (creds section)

https://docs.memphis.dev/memphis/open-source-installation/kubernetes/1-installation#optional-helm-deployment-options

This one is for manual upgrade

https://docs.memphis.dev/memphis/open-source-installation/kubernetes/how-to-upgrade#upgrade-memphis-cluster-with-helm-upgrade-using-a-manual-rolling-upgrade

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

4 participants