You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
ksy36
changed the title
Gather stats and experiment with lowering confidence threshold for bugbug prediction
Analyze ml prediction result and possibly lower confidence threshold for bugbug prediction
Jun 6, 2022
ksy36
changed the title
Analyze ml prediction result and possibly lower confidence threshold for bugbug prediction
Analyze ml prediction results and possibly lower confidence threshold for bugbug prediction
Jun 6, 2022
To analyze whether it makes sense to lower the confidence from 97% to 95%, I've searched for issues that would be considered false positives with a confidence of 95%, but lower than 97%. That would be issues that predicted as invalid with confidence from 0.95-0.97, but ended up being valid (had a needsdiagnosis or moved milestone).
There are 13 issues that fall into this criteria since we've started recording stats 3 weeks ago:
Total number of anonymous issues that was filed within this time frame is 906. Also around 450 of those were classified as invalid and closed (with the confidence threshold is > 97%).
It likely doesn't make sense to lower the confidence threshold at the moment, despite the fact that the number of false positives is low, relative to the total number of anonymous reports. Every such report is valuable and 13 missed reports within 3 weeks would be undesirable for us (in addition to possible false positives at a current confidence threshold, see mozilla/webcompat-team-okrs#256 (comment)).
Current confidence threshold is 97%:
with this confidence the model is able to find 53% of the issues and wrong 1% of the time (precision 99).
Let's analyze stats that were added in #3701 and make a decision whether we would benefit from lowering the confidence threshold to 95%.
With 95% confidence the model would find higher percent of issues (69%) but will have a bit lower precision (2% error instead of 1%).
The text was updated successfully, but these errors were encountered: