Add the `VPAAndHPAForAPIServer` feature gate for the gardener-operator #9735

ialidzhikov · 2024-05-10T15:21:30Z

How to categorize this PR?

/area auto-scaling
/kind enhancement

What this PR does / why we need it:

Which issue(s) this PR fixes:
Part of #9562
A follow-up of #9678

Special notes for your reviewer:

This PR is based on Introduce a new autoscaling mode (VPAAndHPA) for Shoot Kubernetes API servers #9678, hence it is in draft state until Introduce a new autoscaling mode (VPAAndHPA) for Shoot Kubernetes API servers #9678 is merged. The PR is now rebased after the merge of Introduce a new autoscaling mode (VPAAndHPA) for Shoot Kubernetes API servers #9678.

Release note:

The `VPAAndHPAForAPIServer` feature gate is now also implemented for the gardener-operator. When enabled, the virtual-garden-kube-apiserver and gardener-apiserver are scaled simultaneously by VPA and HPA on the same metric (CPU and memory usage).

gardener-prow · 2024-05-10T15:21:34Z

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

…server

voelzmo · 2024-05-17T11:41:05Z

Hey @ialidzhikov, thanks for the PR! While looking at the changes, I was wondering if we're missing the removal code for HVPA, HPA and VPA objects for the cases when the autoscalingMode is changed? This seems to have been broken already for any switch between HVPA enabled or HVPA disabled, but we never saw this?
Once we merge this and the mode gets changed from HVPA to VPAAndHPA, I guess we would still have an HVPA object and the corresponding VPA and HPA objects created by the hvpa-controller, right?

ialidzhikov · 2024-05-20T12:53:42Z

While looking at the changes, I was wondering if we're missing the removal code for HVPA, HPA and VPA objects for the cases when the autoscalingMode is changed? This seems to have been broken already for any switch between HVPA enabled or HVPA disabled, but we never saw this?
Once we merge this and the mode gets changed from HVPA to VPAAndHPA, I guess we would still have an HVPA object and the corresponding VPA and HPA objects created by the hvpa-controller, right?

For the kubernetes apiserver component (pkg/component/kubernetes/apiserver, used for the Shoot kube-apiserver and virtual-garden-kube-apiserver) - this is a component that is NOT deployed via GRM, but with a client. Hence, we have everywhere explicit client invocations to delete the no longer needed objects:

gardener/pkg/component/kubernetes/apiserver/hvpa.go

Lines 32 to 36 in 6d6c06c

 if k.values.Autoscaling.Mode != apiserver.AutoscalingModeHVPA || 

 k.values.Autoscaling.Replicas == nil || 

 *k.values.Autoscaling.Replicas == 0 { 

 return kubernetesutils.DeleteObject(ctx, k.client.Client(), hvpa) 

 }

gardener/pkg/component/kubernetes/apiserver/verticalpodautoscaler.go

Lines 28 to 37 in 6d6c06c

 func (k *kubeAPIServer) reconcileVerticalPodAutoscaler(ctx context.Context, verticalPodAutoscaler *vpaautoscalingv1.VerticalPodAutoscaler, deployment *appsv1.Deployment) error { 

 switch k.values.Autoscaling.Mode { 

 case apiserver.AutoscalingModeHVPA: 

 return kubernetesutils.DeleteObject(ctx, k.client.Client(), verticalPodAutoscaler) 

 case apiserver.AutoscalingModeVPAAndHPA: 

 return k.reconcileVerticalPodAutoscalerInVPAAndHPAMode(ctx, verticalPodAutoscaler, deployment) 

 default: 

 return k.reconcileVerticalPodAutoscalerInBaselineMode(ctx, verticalPodAutoscaler, deployment) 

 } 

 }

gardener/pkg/component/kubernetes/apiserver/horizontalpodautoscaler.go

Lines 38 to 50 in 6d6c06c

 func (k *kubeAPIServer) reconcileHorizontalPodAutoscaler(ctx context.Context, hpa *autoscalingv2.HorizontalPodAutoscaler, deployment *appsv1.Deployment) error { 

 if k.values.Autoscaling.Mode == apiserver.AutoscalingModeHVPA || 

 k.values.Autoscaling.Replicas == nil || 

 *k.values.Autoscaling.Replicas == 0 { 

 return kubernetesutils.DeleteObject(ctx, k.client.Client(), hpa) 

 } 

 if k.values.Autoscaling.Mode == apiserver.AutoscalingModeVPAAndHPA { 

 return k.reconcileHorizontalPodAutoscalerInVPAAndHPAMode(ctx, hpa, deployment) 

 } 

 return k.reconcileHorizontalPodAutoscalerInBaselineMode(ctx, hpa, deployment) 

 }

For the gardener apiserver (pkg/component/gardener/apiserver) - this is a component deployed via GRM:

gardener/pkg/component/gardener/apiserver/apiserver.go

Lines 145 to 153 in 876f6f0

 runtimeResources, err := runtimeRegistry.AddAllAndSerialize( 

 g.podDisruptionBudget(), 

 g.serviceRuntime(), 

 g.horizontalPodAutoscaler(), 

 g.verticalPodAutoscaler(), 

 g.hvpa(), 

 g.deployment(secretCAETCD, secretETCDClient, secretGenericTokenKubeconfig, secretServer, secretAdmissionKubeconfigs, secretETCDEncryptionConfiguration, secretAuditWebhookKubeconfig, secretVirtualGardenAccess, configMapAuditPolicy, configMapAdmissionConfigs), 

 g.serviceMonitor(), 

 )

Hence, for the gardener apiserver component returning nil from the verticalPodAutoscaler/horizontalPodAutoscaler/hvpa funcs is enough, GRM takes care to delete the no longer desired objects.

rfranzke · 2024-05-21T07:24:11Z

/assign

docs/deployment/feature_gates.md

pkg/component/gardener/apiserver/hpa.go

rfranzke · 2024-05-27T13:56:23Z

/lgtm

gardener-prow · 2024-05-27T13:56:28Z

LGTM label has been added.

Git tree hash: 3fb2e5501e072f30b500c92d4353875f1e590096

voelzmo

/lgtm

ialidzhikov · 2024-05-28T13:05:29Z

/approve

gardener-prow · 2024-05-28T13:06:04Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: ialidzhikov, voelzmo

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [ialidzhikov]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

gardener-prow bot requested review from rfranzke and ScheererJ May 10, 2024 15:21

gardener-prow bot added the size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. label May 10, 2024

ialidzhikov force-pushed the enh/vpaandhpa-for-operator branch from a4651e6 to 25deeb4 Compare May 13, 2024 12:45

ialidzhikov added 5 commits May 15, 2024 15:05

Add the VPAAndHPAForAPIServer feature gate for the gardener-operator

307ec0b

Enable the VPAAndHPA autoscaling mode for the virtual-garden-kube-api…

22a40a8

…server

Enable the VPAAndHPA autoscaling mode for the gardener-apiserver

876f6f0

Add docs for virtual-garden-apiserver and gardener-apiserver autoscaling

f032b09

Address review comments from vlerenc

656589f

ialidzhikov force-pushed the enh/vpaandhpa-for-operator branch from 8391e51 to 656589f Compare May 15, 2024 12:13

gardener-prow bot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. labels May 15, 2024

ialidzhikov marked this pull request as ready for review May 15, 2024 13:16

gardener-prow bot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label May 15, 2024

gardener-prow bot requested review from ary1992 and oliver-goetz May 15, 2024 13:16

gardener-prow bot assigned rfranzke May 21, 2024

rfranzke reviewed May 21, 2024

View reviewed changes

docs/deployment/feature_gates.md Outdated Show resolved Hide resolved

pkg/component/gardener/apiserver/hpa.go Show resolved Hide resolved

pkg/component/gardener/apiserver/hpa.go Outdated Show resolved Hide resolved

Address review comments from rfranzke

2eb806d

gardener-prow bot added the lgtm Indicates that a PR is ready to be merged. label May 27, 2024

voelzmo approved these changes May 28, 2024

View reviewed changes

gardener-prow bot assigned voelzmo May 28, 2024

gardener-prow bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label May 28, 2024

gardener-prow bot merged commit c72cab2 into gardener:master May 28, 2024
18 checks passed

ialidzhikov deleted the enh/vpaandhpa-for-operator branch May 28, 2024 15:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add the `VPAAndHPAForAPIServer` feature gate for the gardener-operator #9735

Add the `VPAAndHPAForAPIServer` feature gate for the gardener-operator #9735

ialidzhikov commented May 10, 2024 •

edited

gardener-prow bot commented May 10, 2024

voelzmo commented May 17, 2024

ialidzhikov commented May 20, 2024

rfranzke commented May 21, 2024

rfranzke commented May 27, 2024

gardener-prow bot commented May 27, 2024

voelzmo left a comment

ialidzhikov commented May 28, 2024

gardener-prow bot commented May 28, 2024

Add the VPAAndHPAForAPIServer feature gate for the gardener-operator #9735

Add the VPAAndHPAForAPIServer feature gate for the gardener-operator #9735

Conversation

ialidzhikov commented May 10, 2024 • edited

gardener-prow bot commented May 10, 2024

voelzmo commented May 17, 2024

ialidzhikov commented May 20, 2024

rfranzke commented May 21, 2024

rfranzke commented May 27, 2024

gardener-prow bot commented May 27, 2024

voelzmo left a comment

Choose a reason for hiding this comment

ialidzhikov commented May 28, 2024

gardener-prow bot commented May 28, 2024

Add the `VPAAndHPAForAPIServer` feature gate for the gardener-operator #9735

Add the `VPAAndHPAForAPIServer` feature gate for the gardener-operator #9735

ialidzhikov commented May 10, 2024 •

edited