Use `tf.data` API to build flexible and high performance data pipeline #829

luozhouyang · 2020-06-06T12:36:44Z

I checked to make sure that this is not a duplicate issue
I'm submitting the request to the correct repository (for model requests, see here)

We need a more flexible and powerful data pipeline when training on very large corpus.

Use tf.data API to build the high performance and flexible data pipeline.

The text was updated successfully, but these errors were encountered:

faneshion · 2020-09-20T07:04:01Z

Do you mean the tf.data can not handle large-scale dataset? Did you try the MatchZoo-py version?

luozhouyang added the enhancement label Jun 6, 2020

Provide feedback