Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Instance Sizing #12

Open
chrislovecnm opened this issue Apr 20, 2021 · 4 comments
Open

Instance Sizing #12

chrislovecnm opened this issue Apr 20, 2021 · 4 comments
Assignees
Labels
documentation Improvements or additions to documentation zh:Icebox Issues in the ZenHub pipeline 'Icebox'

Comments

@chrislovecnm
Copy link
Contributor

chrislovecnm commented Apr 20, 2021

An open issue for documenting GKE instance sizing

┆Issue is synchronized with this Jira Task by Unito
┆friendlyId: K8SSAND-175
┆priority: Medium

@bradfordcp bradfordcp added the documentation Improvements or additions to documentation label Apr 24, 2021
@bradfordcp
Copy link
Member

@adejanovski can you provide some guidance here in line with what's being used for performance testing?

IIRC we were looking at 8-16 vCPUs (leaning towards 16 here) and 64 GB RAM.

@adejanovski
Copy link

Currently we're using e2-highmem-8 (8 vCPUs and 64GB RAM) for perf testing.
e2-highmem-16 (16 vCPUs and 128GB RAM) are also a good fit for larger workloads/density.

General recommendations for running Cassandra in production:

  • 8 to 16 vCPUs
  • 32GB to 128GB RAM (no more than 31GB heap but the rest will be used for offheap memory and file cache)
  • ~2TB of live data per node (not a hard limit, Cassandra can support much more but time to recovery gets fairly high then).

@bradfordcp
Copy link
Member

With zero copy streaming in 4.0 (and 10 GbE) I'm curious what the density looks like to match the current time to stream 1-2 TB on 3.11.

@adejanovski
Copy link

Zero copy streaming as it was implemented for 4.0 only works for tables using LCS and requires that there aren't too many vnodes (if at all), which reduces the benefits we can expect from it.
Compaction and repair improvements though should be helpful to handle higher densities. Even without Zero Copy Streaming, I guess the lighter memory footprint of 4.0 should help reducing GC during streaming and make it faster.

It could be interesting to benchmark this specific aspect to compare 4.0 to 3.11 time to recovery.

@adejanovski adejanovski added the zh:Icebox Issues in the ZenHub pipeline 'Icebox' label Jun 12, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation zh:Icebox Issues in the ZenHub pipeline 'Icebox'
Projects
None yet
Development

No branches or pull requests

5 participants