What if containers take more than 30 secs to start? #39

itsmesuniljacob · 2021-02-09T05:59:24Z

Hi Team,

There is scenario where many of our containers may take more than 30 secs to start. In other words, 30 seconds (probably) will not be enough for new replicas to start when VMs receive preemption signal.

Is it possible to modify this draining_timeout_when_node_expired_ms values to 45 secs to solve the above problem?

The text was updated successfully, but these errors were encountered:

nrx-ops · 2021-02-11T17:58:36Z

Hi,
I have same issue.
According to documentation : https://cloud.google.com/compute/docs/instances/preemptible#preemption-process
Compute Engine sends a preemption notice to the instance in the form of an ACPI G2 Soft Off signal. You can use a shutdown script to handle the preemption notice and complete cleanup actions before the instance stops.
If the instance does not stop after 30 seconds, Compute Engine sends an ACPI G3 Mechanical Off signal to the operating system.

I think there is no way to override this 30s deadline after G2 ACPI call.

I reduce downtime with using replica and pod anti-affinity.

itsmesuniljacob · 2021-03-22T11:31:10Z

Sure

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What if containers take more than 30 secs to start? #39

What if containers take more than 30 secs to start? #39

itsmesuniljacob commented Feb 9, 2021

nrx-ops commented Feb 11, 2021

itsmesuniljacob commented Mar 22, 2021

What if containers take more than 30 secs to start? #39

What if containers take more than 30 secs to start? #39

Comments

itsmesuniljacob commented Feb 9, 2021

nrx-ops commented Feb 11, 2021

itsmesuniljacob commented Mar 22, 2021