-
Notifications
You must be signed in to change notification settings - Fork 45
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Parallelize ApplyByCols and ApplyToRows #40
Labels
Comments
Also, would you mind separating this into two distinct issues? |
Hi! I would like to make a new issue for the No.2 and work on it if its okay :) |
Yeah, definitely! Go fo it and let me know how it goes! :) |
shaypal5
changed the title
Parallel and timed apply
Parallelized ApplyByCols and ApplyToRows
Nov 15, 2021
shaypal5
changed the title
Parallelized ApplyByCols and ApplyToRows
Parallelize ApplyByCols and ApplyToRows
Nov 15, 2021
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
First of all, thanks for this amazing library, really useful. Now going to the issue / request:
It seems to me that ApplyByCols and ApplyToRows could have a parameter to make them run in parallel. Has this ever been considered as a feature? I think it could speed up pipelines quite a lot, esp. useful for big dataframes. WDYT?
---- Edit by @shaypal5 : added in v0.0.67 -----
Also, I see that pipeline.fit and pipeline.transform have the
timed
bool. Would it be possible to add the same for apply? I know I can do pipeline.transform(...., time=True), but don't see a reason why apply cannot have itThe text was updated successfully, but these errors were encountered: