Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

k8ssandra repair taking days to complete #1631

Open
judefdo-1 opened this issue Nov 17, 2023 · 1 comment
Open

k8ssandra repair taking days to complete #1631

judefdo-1 opened this issue Nov 17, 2023 · 1 comment
Labels
question Further information is requested

Comments

@judefdo-1
Copy link

We had a major disk corruption(AWS Volume) on one of our rack instance. After deleting the PVC and PV, We restarted the rack1.

Once pods are online, there is a huge data mismatch between the query. At some point, it will return 0 row. To fix this issue we started the repair using the reaper operator. But it is taking days, even though we stopped all our services.

@judefdo-1 judefdo-1 added the question Further information is requested label Nov 17, 2023
@adejanovski
Copy link
Contributor

Hi,

sadly this is not how replacements of broken nodes should be done. Repair is efficient for small entropy levels but is not meant to rebuild an empty node because it will re-stream data between all replicas, not just the ones that are missing the data.

There's a "Replace" CassandraTask that is meant for replacing broken nodes, which will recreate a pod with a new PVC and re-bootstrap it by replacing the old one: https://github.com/k8ssandra/cass-operator/blob/master/tests/testdata/tasks/replace_node_task.yaml

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants