-
Notifications
You must be signed in to change notification settings - Fork 293
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Investigate high Change Failure Rate #6989
Comments
To start on this I will look at issues on the team channel that indicate failures. This may require additional issues being raised as the calls are often different but in some cases an iterative approach may be needed - fix one, a different issue, fix that, yet another issue etc. Unlikely to start on this until either later tomorrow or Friday (milk monitor duties could impact this). |
One example is the daily Terraform Static Code Analysis which fails daily. There is a link on the modernisation platform workflow status which may provide additional information |
Change made to the static workflow check has worked and this no longer produces errors |
No other obvious regular errors popping up. |
@SteveLinden this is the code that checks for failures https://github.com/ministryofjustice/dora-the-explora/blob/main/cfr.py |
On a regular basis the dependabot changes fail on the Go tests. There are various reasons (mainly Error: No valid credential sources found) but due to the infrequency of these failures I have decided to ignore them here. Working through and I'll list those that need checks here.... Secure Code Analysis on modernisation-platform-terraform-ecs-cluster - 12 checkov failures Secure Code Analysis on https://github.com/ministryofjustice/modernisation-platform-terraform-s3-bucket/actions/runs/9184139276/job/25255974159 - unknown cause - sent to team Sent to the team modernisation-platform-security is regularly failing due to a feature existing that is not supported in this repository. An example is in here |
Apart from. the issues on the Go test, which can be alleviated by connecting, closing the PR and re-opening it and then running the job, there do not appear to be many issues other than those listed above. One to two of these have been fixed. |
Some issues were highlighted with closing this and they are listed below Following our discussions I did a check of the repo (thanks
There were also 5 routines that showed errors - first one was 4 days ago but the other more recent.
Asked the team for opinions on raising calls for the 5 above so additional ones may be raised. |
User Story
As a Product Owner
I want to invest time in investigating our failing changes
So that we improve the quality of our Change Failure Rate (CFR)
Value / Purpose
A high CFR isn't necessarily indicative of degraded services or poor quality changes; by investing time in looking at our commonly-failing changes we can make this metric more meaningful.
Useful Contacts
No response
Additional Information
No response
Proposal / Unknowns
Definition of Done
The text was updated successfully, but these errors were encountered: