Alerts verbosity fix by MohamedBilelBesbes · Pull Request #9362 · mozilla/treeherder

MohamedBilelBesbes · 2026-04-01T17:21:19Z

Upon deploying the voting system into the testing environment, it gave so many alerts. Particularly, it gave too many alerts related to the priority voting system and few ones related to the equal voting system. Upon investigating why this is happening, it boiled down to these factors:

The hyper parameters used in the methods in the voting system are the default hyper parameters and not the ones that yields the best performance, so it makes sense to have so many alerts because such hyper parameters configuration is too sensitive.
Here, the alert generation function is being called twice, the first time for the priority voting and the sdecond time for the equal voting. So chronologically, priority voting check is occurring and producing the (verbose) alerts. Once the equal voting occurs, it gets limited visiblity on the time series to analyze for change detection because it is constrained to use measurements of revisions pushed after the creation of the latest alert (the ones that were created ealier by the priority voting) because of this part of the code.

In order to fix these issues, we do the following:

Methods hyper parameter fixing: The methods were first tuned individually by testing different hyperparameters, such as the sizes of backward and forward windows, to achieve the best performance for each one. However, upon incorporating those into the voting system, we need to select a set of hyper-parameter configuration that has a fixed forward and back windows across all methods. This is because having different windows values for different methods makes them fundamentally not evaluate the same set of data, which is inconsistent. To address this, the backward and forward window values from the Student T Test were fixed and applied to all methods, while only the confidence and magnitude-related hyperparameters were further tuned. Upon incorporating such setup, Levene method has been deteriorating the quality of the voting system by throwing false alerts. Indeed, initial investigation showed that it is the worse performing method on both precision (minimizing false alerts) and recall (minimizing missed alerts). So we ditch it from the voting system for now. However, we keep its implementation in case further tuning is done on it, maybe it is just constrained by the spoace of the hyper parameters we used in our experimentation and that is why it is giving bad perofrmance.
Adding the detection methods as part of the PerformanceAlertTesting lookup: This will allow multiple voting systems to detect alerts at the same time without affecting each other's workflow.

gmierz

Just a couple minor fixes :)

treeherder/perf/alerts.py

MohamedBilelBesbes requested review from beatrice-acasandrei and esanuandra as code owners April 1, 2026 17:21

gmierz requested changes Apr 2, 2026

View reviewed changes

treeherder/perf/alerts.py Outdated Show resolved Hide resolved

treeherder/perf/alerts.py Outdated Show resolved Hide resolved

Alerts verbosity fix

a927329

MohamedBilelBesbes force-pushed the VotingFix branch from 411dc8c to a927329 Compare April 2, 2026 19:34

MohamedBilelBesbes requested a review from gmierz April 2, 2026 19:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Alerts verbosity fix#9362

Alerts verbosity fix#9362
MohamedBilelBesbes wants to merge 1 commit intomozilla:masterfrom
MohamedBilelBesbes:VotingFix

MohamedBilelBesbes commented Apr 1, 2026 •

edited

Loading

Uh oh!

gmierz left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

MohamedBilelBesbes commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gmierz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

MohamedBilelBesbes commented Apr 1, 2026 •

edited

Loading