Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

prometheus metric shows Node as 100% unhealthy always #126

Open
surendarmsk1 opened this issue Nov 24, 2022 · 0 comments
Open

prometheus metric shows Node as 100% unhealthy always #126

surendarmsk1 opened this issue Nov 24, 2022 · 0 comments

Comments

@surendarmsk1
Copy link

surendarmsk1 commented Nov 24, 2022

Have Grafana dashbord configured with below query however on all times noticing 100% unhealthy status across most of the nodes. Not sure whether we are hitting the bug on default ping check of 300ms which is being timeout. Can someone assist on why this would occur? how to triage further?

6hr metric view
Screenshot 2022-11-24 at 8 55 24 PM

Grafana Query:

sum(increase(goldpinger_nodes_health_total{cluster="$cluster",goldpinger_instance="$instance",status="unhealthy"}[15m])) by (goldpinger_instance) / (sum(increase(goldpinger_nodes_health_total{cluster="$cluster",goldpinger_instance="$instance",status="healthy"}[15m])) by (goldpinger_instance) + sum(increase(goldpinger_nodes_health_total{cluster="$cluster",goldpinger_instance="$instance",status="unhealthy"}[15m])) by (goldpinger_instance))

Repeated Warn Message on GoldPinger pods logs:
{"level":"warn","ts":1669303893.1442885,"caller":"goldpinger/pinger.go:151","msg":"Ping returned error","op":"pinger","name":"goldpinger","hostIP":"XX.XX.XX.XX","podIP":"XX.XX.XX.XX","responseTime":0.300629455,"error":"Get "http://XX.XX.XX.XX:8080/ping\": context deadline exceeded"}
 

@surendarmsk1 surendarmsk1 changed the title prometheus metric of unhealthy node shows 100% unhealthy always prometheus metric shows Node as 100% unhealthy always Nov 24, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant