You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The ale plot should not be anchored at the midpoint of the bucket. This is because the ALE value represents the average change in response from the bottom of the bucket to the top.
As a simple example:
If each bucket, the average local effect in each bucket, and the observation weight (usually number of observations) are as follows:
bucket
average_local_effect
weight
[0, 1]
22
15
(1, 2]
36
25
(2, 3]
-10
35
(3, 4]
-41
25
Then the (non-centered) ALE will be:
bucket_edge
ALE
0
0
1
22
2
58
3
48
4
7
This is because 22 is the change in prediction between 0 and 1, 36 is the change between 1 and 2, and so on.
We then center with the constant -1.45 (from average_local_effect @ weight / sum(weight)) to get the final ALE:
Does my updated implementation match the original implementation in this regard? It would be great to get a second opinion on this before I invest the time to prepare a pull request (many of my changes are probably not very relevant in general).
The ale plot should not be anchored at the midpoint of the bucket. This is because the ALE value represents the average change in response from the bottom of the bucket to the top.
As a simple example:
If each bucket, the average local effect in each bucket, and the observation weight (usually number of observations) are as follows:
Then the (non-centered) ALE will be:
This is because 22 is the change in prediction between 0 and 1, 36 is the change between 1 and 2, and so on.
We then center with the constant -1.45 (from
average_local_effect @ weight / sum(weight)
) to get the final ALE:See the original implementation here.
The text was updated successfully, but these errors were encountered: