Iterate through all revisions to find first good revision before bisecting. #3934

letitz · 2024-04-09T14:59:52Z

This change builds on #3927 to change the way regression task operates.

To review, given GitHub's lack of support for stacked PRs, this link should work: https://github.com/google/clusterfuzz/pull/3934/files/1bdca4468269ee897b5145f6dc24c395b6eb4d2d..dc74698caba2f0ce6512eca69d93d41d1692aef7

The overall goal is to be more dogged when trying to find the first good revision to start bisecting from, and try all revisions. This may take too long, so we respect the task deadline and ensure that this step is resumable in a subsequent regression task. Regression task already supports resuming timed out regressions later, as this is how the core bisection works.

Previously, regression task would start its first run (detected by the absence of last_regression_min or last_regression_max metadata on the testcase) by:

checking whether the regression happened in the last N revisions, ending early if so, having found the regression range
checking whether one of the first N revisions was good, and:
- if not, ending early with an error (this is what is changing)
- if so, and the first good revision crashed, ending early, having found the regression range to be [0, min)
- if so, and the first good revision did not crash, going on to bisect

This change does a few things:

splits the notion of first run into two: check separately for last_regression_max and last_regression_min
when last_regression_max is absent from the testcase:
- check the latest N revisions for a regression
- update last_regression_max on the testcase to indicate this happened
when last_regression_min is absent from the testcase:
- check all revisions sequentially to find the first good revision
- update last_regression_min as we go along to record our progress
- time out if this takes too long
side benefit: since we now record the progress made in these two steps, we can avoid:
- testing the same revision in check_latest_revisions and find_earliest_good_revision
- bisecting revisions that were already tested in check_latest_revisions or find_earliest_good_revision

TODO: An integration test of some kind that checks that forward progress is made after timeouts.

letitz · 2024-04-09T15:05:00Z

@alhijazi PTAL while I work out how to write some kind of integration test.

letitz · 2024-04-12T12:48:19Z

Thanks! Onto @jonathanmetzman, who may have suggestions on how/where to write an overarching test.

letitz added 10 commits April 9, 2024 13:41

Refactor found_regression_near_extreme_revisions.

bee4397

Document build_data_list behavior.

05afa49

Test that check_latest_revisions skips bad builds.

48ee4e9

Remove unused test helper.

1bdca44

Rename check_earliest_revisions to find_earliest_good_revision.

b2fed2b

Search through all revisions.

877a429

Set last_regression_max in check_latest_revisions.

b4eda63

Add deadline to find_earliest_good_revision().

677adba

Add mock deadline to tests.

8634386

Add tests for find_earliest_good_revision.

dc74698

letitz requested a review from alhijazi April 9, 2024 15:03

letitz mentioned this pull request Apr 12, 2024

Refactor regression_task.found_regression_near_extreme_revisions. #3927

Open

alhijazi approved these changes Apr 12, 2024

View reviewed changes

letitz requested a review from jonathanmetzman April 12, 2024 12:48

jonathanmetzman force-pushed the master branch 2 times, most recently from e7e91a0 to 22e1108 Compare May 28, 2024 07:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Iterate through all revisions to find first good revision before bisecting. #3934

Iterate through all revisions to find first good revision before bisecting. #3934

letitz commented Apr 9, 2024 •

edited

letitz commented Apr 9, 2024

letitz commented Apr 12, 2024

Iterate through all revisions to find first good revision before bisecting. #3934

Are you sure you want to change the base?

Iterate through all revisions to find first good revision before bisecting. #3934

Conversation

letitz commented Apr 9, 2024 • edited

letitz commented Apr 9, 2024

letitz commented Apr 12, 2024

letitz commented Apr 9, 2024 •

edited