-
Notifications
You must be signed in to change notification settings - Fork 24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How is the 'variable.importance' calculated in the rpart package? #43
Labels
help wanted
Extra attention is needed
Comments
In the vignette
An overall measure of variable importance is the sum of the goodness of split measures for each split for which it was the primary variable, plus goodness * (adjusted agreement) for all splits in which it was a surrogate. In the printout these are scaled to sum to 100 and the rounded values are shown, omitting any variable whose proportion is less than 1%. Imagine two variables which were essentially duplicates of each other; if we did not count surrogates they would split the importance with neither showing up as strongly as it should.
…________________________________
From: WANG Zhiwei ***@***.***>
Sent: Tuesday, July 5, 2022 2:37 AM
To: bethatkinson/rpart ***@***.***>
Cc: Subscribed ***@***.***>
Subject: [EXTERNAL] [bethatkinson/rpart] How is the 'variable.importance' calculated in the rpart package? (Issue #43)
In xgboost, there are several importance types, including weight’, ‘gain’, ‘cover’, ‘total_gain’, and ‘total_cover’. I wonder how rpart calculates importance score.
—
Reply to this email directly, view it on GitHub<#43>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ACWQG53AD2YK5AMEHH2Y6VLVSPQ2ZANCNFSM52VG2DZA>.
You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>
|
Thank you very much! |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
In xgboost, there are several importance types, including weight’, ‘gain’, ‘cover’, ‘total_gain’, and ‘total_cover’. I wonder how rpart calculates importance score.
The text was updated successfully, but these errors were encountered: