-
-
Notifications
You must be signed in to change notification settings - Fork 315
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
OHE: allow encoding of specific, user desired categories #303
Comments
Hi @solegalli, This looks interesting! Are you envisioning that OHE allows the following functionality?
|
@solegalli, should this task be closed? It seems like task #403 resolved this issue |
#403 is in essence asking for the same functionality. That's probably why I closed it. I flagged it as duped now. Still open. |
@solegalli, resurrecting this issue ;) When someone selects this functionality, do we want to limit the user to one variable? I imagine that the user will select values that are specific to one variable. It seems odd for multiple categorical variables to have the same values. |
I think the most straight forward would be to add a new parameter, or perhaps even better, extend top_categories to take a dictionary with the variable as key and the categories to encode as values. Then, for each variable, the transformer will create dummies only for the categories indicated by the user. Will you pick this one up? |
I like the idea of using a dictionary. However, I'm unsure if the dictionary should be accepted by Would it be cleaner to have a separate param called |
Sounds good to me! |
As per this thread, the user may want to encode certain categories, that may not be the most frequent.
The text was updated successfully, but these errors were encountered: