Getting frequent restarts for Prometheus-msteams[BUG] #267

firoshaq · 2022-10-12T11:06:05Z

We are seeing the prometheus-msteams being restarted at times and it is showing OOM errors.

We have increased cpu and memory to below as well. Still, the pods are getting restarted.

resources:
  limits:
    cpu: 50m
    memory: 100Mi
  requests:
    cpu: 50m
    memory: 100Mi

Interestingly we don't see the pod hitting the limits anywhere.

kubectl top pod prometheus-msteams-58bcd967fc-pdpw9
NAME                                  CPU(cores)   MEMORY(bytes)
prometheus-msteams-58bcd967fc-pdpw9   6m           50Mi

Here is some of the event logs

13m         Warning   Unhealthy                pod/prometheus-msteams-58bcd967fc-pdpw9                                            Readiness probe failed: Get "http://IP:2000/config": dial tcp IP:2000: connect: connection refused
57m         Warning   Unhealthy                pod/prometheus-msteams-58bcd967fc-pdpw9                                            Liveness probe failed: Get "http://IP:2000/config": context deadline exceeded (Client.Timeout exceeded while awaiting headers)
57m         Warning   Unhealthy                pod/prometheus-msteams-58bcd967fc-pdpw9                                            Readiness probe failed: Get "http://IP:2000/config": context deadline exceeded (Client.Timeout exceeded while awaiting headers)
58m         Warning   Unhealthy                pod/prometheus-msteams-58bcd967fc-pdpw9                                            Liveness probe failed: Get "http://IP:2000/config": dial tcp IP:2000: connect: connection refused
58m         Warning   BackOff                  pod/prometheus-msteams-58bcd967fc-pdpw9                                            Back-off restarting failed container

Version
1.5.0

Expected behavior
A stable prometheus-msteams

The text was updated successfully, but these errors were encountered:

ckotzbauer · 2022-10-12T11:20:23Z

Can you try to increase the CPU limits? 50m is not that much, maybe the probes are not answered in time.

firoshaq · 2022-10-17T08:30:51Z

Sure we can do that, but what we have understood is that prometheus-msteams is a really lightweight Go Web Server that does only the api calls and it shouldn't be consuming that much.

Even we started with the default values mentioned here and then increased to the value mentioned above.

So we want to understand if there are any memory leakages happening that might lead to this issue.

Regards,
Firos Haq

ckotzbauer · 2022-10-17T11:47:16Z

The resource consumption depends on the load and your setup. In general it should be low, but as I said that depends on your specific setup. Therefore the default values from the chart are only a suggestion.

When the OOMs are now gone with the increased memory, then I don't see where there should be a massive memory leak.
Or are there OOMs after several days and the memory usage increases over time?

The probe-failures may be because of the low cpu limit, as I described above.

byroncollins · 2023-02-07T00:49:33Z

We had a similar problem with prometheus-msteams pods restarting occasionally and made adjustments to the container resources and haven't had an issue since.

We overrode default resources in our values.yml file to

resources:
  limits:
    cpu: 50m
    memory: 64Mi
  requests:
    cpu: 25m
    memory: 25Mi

firoshaq added the bug Something isn't working label Oct 12, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Getting frequent restarts for Prometheus-msteams[BUG] #267

Getting frequent restarts for Prometheus-msteams[BUG] #267

firoshaq commented Oct 12, 2022

ckotzbauer commented Oct 12, 2022

firoshaq commented Oct 17, 2022

ckotzbauer commented Oct 17, 2022

byroncollins commented Feb 7, 2023

Getting frequent restarts for Prometheus-msteams[BUG] #267

Getting frequent restarts for Prometheus-msteams[BUG] #267

Comments

firoshaq commented Oct 12, 2022

ckotzbauer commented Oct 12, 2022

firoshaq commented Oct 17, 2022

ckotzbauer commented Oct 17, 2022

byroncollins commented Feb 7, 2023