GitHub - somrajroy/OpenSourceProject-AWS-EKS-Cluster-Autoscaling: Cluster autoscaling feature of K8S. Together with HPA a production level K8S cluster can be deployed in AWS EKS with very less operational/management overhead for cloud cost optimization.CA Demo

AWS EKS Cluster Autoscaling

This repo demonstrates the cluster autoscaling feature of K8S.

To adjust to changing application demands clusters often need a way to automatically scale. In AWS EKS clusters automatic scaling is done primarily in two ways - Cluster Autoscaler and Horizental Pod Autoscaler
- The cluster autoscaling (CA) watches for pods that can't be scheduled on nodes because of resource constraints. If a node doesn't have sufficient compute resources to run a requested pod, that pod can't progress through the scheduling process. The pod can't start unless additional compute resources are available within the node pool. The cluster then automatically increases the number of nodes when it notices the resource constraints.The below diagrams makes it clear about cluster autoscaler functionality.
- Kubernetes uses the horizontal pod autoscaler (HPA) to monitor the resource demand and automatically scale the number of replicas. By default, the horizontal pod autoscaler checks the Metrics API. The horizontal pod autoscaler uses the Metrics Server in a Kubernetes cluster to monitor the resource demand of pods. If an application needs more resources, the number of pods is automatically increased to meet the demand.
- The below diagrams makes it clear about HPA & CA functionality.
- The cluster and horizontal pod autoscalers can work together, and are often both deployed in a cluster. When combined, the horizontal pod autoscaler is focused on running the number of pods required to meet application demand. The cluster autoscaler is focused on running the number of nodes required to support the scheduled pods.
This official AWS Documentation mentions the steps to implement EKS Cluster Autoscaling. In this article we will use Kubernetes Cluster Autoscaler
Some steps does not work properly from the above documentation which will be highlighted here

Steps for Cluster Autoscaling with Kubernetes Cluster Autoscaler

Region us-east-1 gives some issues with availability zones. It maybe convinient to choose some other AWS region. Here us-west-2 is chosen (aws configure)
Create a cluster with below command. It would create 2 m5.large EC2 instances in region configured in your CLI (AWS configure).
$ eksctl create cluster --name my-cluster --version 1.23 --managed --asg-access
Create or update a kubeconfig file for the cluster. Replace region-code with the AWS Region that the cluster is in and replace my-cluster with the name of the cluster (then kubectl commands can be run on the created cluster).
$ aws eks update-kubeconfig --region region-code --name my-cluster
Please complete the prerequisites as mentioned in official AWS documentation shared above.
(Optional)Create an IAM policy (AmazonEKSClusterAutoscalerPolicy) and paste the JSON code from the text file "AmazonEKSClusterAutoscalerPolicy.txt"
Create IAM role and service account with below command (replace the --attach-policy-arn with the ARN of the policy created by EKSTL or above created role)
If you created your node groups using the --asg-access option, then replace the name of the IAM policy with that eksctl created for you. The policy name is similar to eksctl-my-cluster-nodegroup-ng-xxxxxxxx-PolicyAutoScaling. (Optionally) If you created policy AmazonEKSClusterAutoscalerPolicy then use that.
- Replace the account ID with the one that is being used.
- Replace the policy name "eksctl-my-cluster-nodegroup-ng-ae84bd0e-PolicyAutoScaling" with the one eksctl created for you
  $ eksctl create iamserviceaccount --cluster=my-cluster --namespace=kube-system --name=cluster-autoscaler --attach-policy-arn=arn:aws:iam::<<-your account id->>:policy/eksctl-my-cluster-nodegroup-ng-ae84bd0e-PolicyAutoScaling --override-existing-serviceaccounts --approve
Check created service account and role. Check annotations and role should be there
$ kubectl describe sa -n kube-system cluster-autoscaler
Deploy the Cluster Autoscaler
Annotate the cluster-autoscaler service account with the ARN of the IAM role that was created earlier as described in the AWS documentation. Check the service account (kubectl describe sa) and if it is already present then skip this step
To add the cluster-autoscaler.kubernetes.io/safe-to-evict annotation, execute below command (the patch command given in official documentation sometimes does not work)
$ kubectl -n kube-system annotate deployment.apps/cluster-autoscaler cluster-autoscaler.kubernetes.io/safe-to-evict="false"
As mentioned in official documentation edit the Cluster Autoscaler deployment with options "--balance-similar-node-groups" and "--skip-nodes-with-system-pods=false"
(Optional) Verify the deployment.
$ kubectl describe deployment -n kube-system cluster-autoscaler
Set the Cluster Autoscaler image tag as from the official Github page as mentioned in the official AWS documentation
$ kubectl set image deployment cluster-autoscaler -n kube-system cluster-autoscaler=k8s.gcr.io/autoscaling/cluster-autoscaler:v1.23.0
View and verify the cluster autoscaler logs to ensure its working properly & monitoring the cluster load
Navigate to "Auto Scaling Group" in AWS EC2 console and update maximum capacity in autoscaling group to 6. (all values would be set to 2 - desired, minimum and maximum). Update the maximum value
Apply the deployment "php-apache.yaml" from this repo with below command. It would create 1 replica
$ kubectl apply -f php-apache.yaml
We can see that 2 EC2 m5.large machines are in cluster.
Edit the "php-apache.yaml" file and update the replicas to 20 and reapply executing below command
$ kubectl apply -f php-apache.yaml
Now 20 pods are required with 500m CPU which is is equivalent to 5 m5.large EC2 instances. Many pods will be in pending status (not running). This would trigger CA
Check the activities tab in the autoscaling group and it can be seen that new instances are created. It checks the cluster load every 10 seconds
In few mins it can be seen in EC2 console that 5/6 m5.large machines are created by cluster autoscaler. The same can be seen in terminal by executing below command
$ kubectl get nodes
Edit the "php-apache.yaml" file and update the replicas back to 1 and reapply executing below command. It would take 10 mins for the scale down operation
$ kubectl apply -f php-apache.yaml
Clean up AWS enviornment
$ eksctl delete cluster --name my-cluster
By default, cluster autoscaler will wait 10 minutes between scale down operations, which can be adjusted using the --scale-down-delay-after-add, --scale-down-delay-after-delete, and --scale-down-delay-after-failure flag. E.g. --scale-down-delay-after-add=5m to decrease the scale down delay to 5 minutes after a node has been added.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
AmazonEKSClusterAutoscalerPolicy.txt		AmazonEKSClusterAutoscalerPolicy.txt
README.md		README.md
php-apache.yaml		php-apache.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AmazonEKSClusterAutoscalerPolicy.txt

AmazonEKSClusterAutoscalerPolicy.txt

README.md

README.md

php-apache.yaml

php-apache.yaml

Repository files navigation

AWS EKS Cluster Autoscaling

Steps for Cluster Autoscaling with Kubernetes Cluster Autoscaler

Further references

About

Releases

Packages

somrajroy/OpenSourceProject-AWS-EKS-Cluster-Autoscaling

Folders and files

Latest commit

History

Repository files navigation

AWS EKS Cluster Autoscaling

Steps for Cluster Autoscaling with Kubernetes Cluster Autoscaler

Further references

About

Topics

Resources

Stars

Watchers

Forks