stage 2 training problem #180

ruiyeNLP · 2023-01-10T17:42:13Z

In stage 2, four datasets are trained as described in the paper. Maybe it is a stupid question: will the four datasets be trained all together, or will be trained one by one? If they are trained all together, can the 'cat' command simply be used?
Looking forward to your reply.

gotutiyan · 2023-01-15T14:56:12Z

Typically, the four datasets are used together (even if GECToR or some seq2seq models).
If the datasets are used one by one, it would be 4-stages training. However, given that there is no such description in the GECToR paper, it would be natural to use them together.

skurzhanskyi · 2023-01-17T10:10:55Z

@gotutiyan is right. We used all four datasets together by mixing them (concatenating and shuffling).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

stage 2 training problem #180

stage 2 training problem #180

ruiyeNLP commented Jan 10, 2023

gotutiyan commented Jan 15, 2023

skurzhanskyi commented Jan 17, 2023

stage 2 training problem #180

stage 2 training problem #180

Comments

ruiyeNLP commented Jan 10, 2023

gotutiyan commented Jan 15, 2023

skurzhanskyi commented Jan 17, 2023