Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Integrate genomics libraries in Spark support #26

Open
heuermh opened this issue Apr 4, 2022 · 1 comment
Open

Integrate genomics libraries in Spark support #26

heuermh opened this issue Apr 4, 2022 · 1 comment

Comments

@heuermh
Copy link

heuermh commented Apr 4, 2022

Hello, congrats on the recent blog post on the AWS HPC Blog!

I intend to look closer into your Spark support to see how it works. While I imagine it straightforward to use Spark-based tools via command line calls, it might be interesting to integrate these libraries and tools into the redun APIs directly.

@mattrasmus
Copy link
Collaborator

Thanks @heuermh for the ideas and links. It's very helpful.

Hopefully, many of these tools can be used within a redun task as-is. One design goal of redun was to make very few assumptions about what goes on inside as task function ("no dirty tricks") in order to be as compatible as possible with other data science libraries.

But I'll keep an eye out for opportunities for a more meaningful integration between redun and other spark libs.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

2 participants