-
-
Notifications
You must be signed in to change notification settings - Fork 8.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Correctness of visualization #10829
Comments
Hi, I still need to look into the source code of either project.
Please help elaborate on what this means. Is there a histogram when plotting a tree? And this histogram has some data in it, and this data can be split by a (node split) feature in the root node? If you want to compare against xgboost's own plot tree function, you can dump the tree in the |
Ah, that's much clearer! Thank you for sharing. I will look into it after sorting out some of the on going work here |
Hello,
I'm developing a library for decision tree visualization https://github.com/mljar/supertree and would appreciate feedback on whether my visualization approach for XGBoost is correct. I've compared my library with dtreeviz, and in dtreeviz, the data in the histogram appears to be split at each node according to the feature from the root node (based on my observations). In contrast, my implementation splits the data according to the feature extracted from the respective node in booster.get_dump().
I would greatly appreciate it if you could provide guidance on the correct visualization approach for your library.
Code from my compare notebook:
The text was updated successfully, but these errors were encountered: