-
Notifications
You must be signed in to change notification settings - Fork 38.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Emit an event when the result of a probe for a container changes #115963
Emit an event when the result of a probe for a container changes #115963
Conversation
Hi @intUnderflow. Thanks for your PR. I'm waiting for a kubernetes member to verify that this patch is reasonable to test. If it is, they should reply with Once the patch is verified, the new status will be reflected by the I understand the commands that are listed here. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/ok-to-test |
Had some thoughts:
Given that I think this is ready to go |
/cc |
This PR may require API review. If so, when the changes are ready, complete the pre-review checklist and request an API review. Status of requested reviews is tracked in the API Review project. |
/retest |
As far as I can tell this will not work as you expect. I believe that what you expect to see is the Event going from below threshold to above threshold after some failures right? If that is the case, this is not what you will see on the apiserver side because of how we aggregate Events in the client. When merging similar Events together the client disregards the annotations and it is completely intended because annotations are not part of what we considered to be the structure of an Event. What will happen in practice is that the client will merge the existing emissions of below threshold Events for your Pod with the newer above threshold Event and emit a below threshold Event instead of an above threshold one. That said, have you perhaps considered emitting a new Event instead of reusing an existing one? |
I have very little context in this area, but if you have any ideas as to how we consider the design around new events / etc I can start to put together a KEP and refactor this PR around that? |
when I looked at it I imagined two events - one for failed probe under the threshold and another - after. Those should be different events so the first will not make the second to be throttled. As for tracking specific failure trends - it is indeed the best done via metrics. |
gentle ping @mrunalp @derekwaynecarr @dchen1107 @klueska |
@SergeyKanzhelev propose to move to priority/important-longterm, are you happy with this given you initially triaged the underlying issue? Basing off https://www.kubernetes.dev/docs/guide/issue-triage/ |
/priority important-longterm |
/remove-priority backlog |
d94a7fb
to
4aa51f5
Compare
(merge conflicts) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/lgtm
LGTM label has been added. Git tree hash: a29a3294e7e615092f744c3d79c4faaef410f1cc
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: intUnderflow, SergeyKanzhelev The full list of commands accepted by this bot can be found here.
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
/retest |
PR needs rebase. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs. This bot triages PRs according to the following rules:
You can:
Please send feedback to sig-contributor-experience at kubernetes/community. /close |
@k8s-triage-robot: Closed this PR. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/kind feature
/kind api-change
/sig node
/hold
What this PR does / why we need it:
This PR introduces a new event that is emitted by the probe worker when the result of the probe changes. End-users can watch this new event to learn when their probes entered the Sucess or Failure state and in the case of Failure they gain some contextual information in the form of the output from the last probe.
Fixes #115823 by dealing with the lack of information on when a probe fails (as users can now just look at this event)
Does this PR introduce a user-facing change?