Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

K8SSAND-1284 ⁃ Control Plane HA Design #435

Open
sync-by-unito bot opened this issue Feb 24, 2022 · 2 comments · May be fixed by #433
Open

K8SSAND-1284 ⁃ Control Plane HA Design #435

sync-by-unito bot opened this issue Feb 24, 2022 · 2 comments · May be fixed by #433
Assignees
Labels
blocked Issues in the state 'blocked'

Comments

@sync-by-unito
Copy link

sync-by-unito bot commented Feb 24, 2022

Generate an initial design document for review proposing a methodology for providing HA for the control plan within K8ssandra Operator deployed clusters.

┆Issue is synchronized with this Jira Feature by Unito
┆Reviewer: Alexander Dejanovski
┆fixVersions: k8ssandra-operator-v1.2.0
┆friendlyId: K8SSAND-1284
┆priority: Medium

@jdonenine jdonenine linked a pull request Feb 24, 2022 that will close this issue
5 tasks
@sync-by-unito
Copy link
Author

sync-by-unito bot commented Mar 1, 2022

➤ Jeff DiNoto commented:

The design itself is already drafted, we’ll use this issue to plan out a review/discussion process to finalize the design.

So the goal here is a finalized design and a new Epic scoping out the work.

@sync-by-unito sync-by-unito bot changed the title K8SSAND-1284 ⁃ Control Plan HA Design K8SSAND-1284 ⁃ Control Plane HA Design Mar 1, 2022
@Miles-Garnsey
Copy link
Member

Miles-Garnsey commented Mar 1, 2022

One note on how we stage this piece of work. I think we should consider supporting a manual repair/migration process first, then coming up with the fancy leader election process second.

We can do this by putting in some functionality that allows a control plane k8ssandra-operator instance to take control of an existing CassDC. This hits two birds with one stone:

  1. Facilitating a migration pathway for existing DCs.
  2. Allowing a new control plane to take over from an old one if it breaks somehow (by manually adding the DC).

That first step will probably throw up interesting problems around the way we've drawn boundaries between the control and data planes.

Second step will then be to do this via an inter-k8s-cluster leader election mechanism. That throws up additional questions about achieving distributed state consistency and such.

Keeping these two phases segregated will hopefully avoid some headaches.

@adejanovski adejanovski added zh:Review Issues in the ZenHub pipeline 'Review' zh:Ready-For-Review Issues in the ZenHub pipeline 'Ready-For-Review' and removed zh:Review Issues in the ZenHub pipeline 'Review' labels Jul 21, 2022
@adejanovski adejanovski removed the zh:Ready-For-Review Issues in the ZenHub pipeline 'Ready-For-Review' label Sep 9, 2022
@adejanovski adejanovski added the blocked Issues in the state 'blocked' label Nov 22, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
blocked Issues in the state 'blocked'
Projects
Status: Blocked/Stale
Development

Successfully merging a pull request may close this issue.

2 participants