Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Notification/alerting system in Modernisation Platform #6817

Open
5 tasks
ewastempel opened this issue Apr 17, 2024 · 2 comments
Open
5 tasks

Notification/alerting system in Modernisation Platform #6817

ewastempel opened this issue Apr 17, 2024 · 2 comments
Labels
alerting member request Feature requested by a member to enhance the platform experience

Comments

@ewastempel
Copy link
Contributor

ewastempel commented Apr 17, 2024

User Story

As a Modernisation Platform member user
I want to be notified about health of resources or upcoming events that need actioning (e.g. expiring certificates)
So that I can react and fix/prevent an issue.

Value / Purpose

Healthy application/system means no outages.

Additional Information

This want came as an ask channel request and it is to look if we can implement a new one or reuse our existing alerting system that can be easily consumed by the members.

Currently MP alerting workflow is made of: CloudWatch -> SNS -> PagerDuty -> Slack and using this solution is documented here.

This ticket is to remove the need of PagerDuty acting as a mid-man and to integrate with a variety of resources (CloudWatch, EventBridge, SNS) rather than being limited to one only (although it could start with one and then build on it).

The user that requested this, suggested EventBridge → SNS → e-mail → Slack as an approach described in here, which could be considered.

Definition of Done

  • New alerting system is implemented (ideally as a module) and it does not use PagerDuty
  • If applicable, tests are implemented
  • User docs have been updated
  • Another team member has reviewed
  • Pipeline runs green
@dms1981
Copy link
Contributor

dms1981 commented May 9, 2024

Is this potentially too broad? Is this ticket meant to cover the creation of a new alerting/notification module that we can use, or a one-off to cover alerting when certificates are reaching their expiration date that could later be extended to replace PagerDuty as a middleman?

Is this something that customers are presently empowered to do without us being involved?

@dms1981 dms1981 added the member request Feature requested by a member to enhance the platform experience label May 9, 2024
@dms1981
Copy link
Contributor

dms1981 commented May 9, 2024

As you noted in Slack, @ewastempel , maybe this is a better fit for enrolling with Observability Platform and getting the information through there?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
alerting member request Feature requested by a member to enhance the platform experience
Projects
Status: To Do
Development

No branches or pull requests

3 participants