Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use tf.data API to build flexible and high performance data pipeline #829

Open
2 tasks done
luozhouyang opened this issue Jun 6, 2020 · 1 comment
Open
2 tasks done

Comments

@luozhouyang
Copy link

  • I checked to make sure that this is not a duplicate issue
  • I'm submitting the request to the correct repository (for model requests, see here)

Is your feature request related to a problem? Please describe.

We need a more flexible and powerful data pipeline when training on very large corpus.

Describe the solution you'd like

Use tf.data API to build the high performance and flexible data pipeline.

@faneshion
Copy link
Member

Do you mean the tf.data can not handle large-scale dataset? Did you try the MatchZoo-py version?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants