How to directly use word embedding from the pre-trained LM during training and inference? #7

Fishersponge · 2020-07-15T09:07:18Z

If I have a 'train.mdb', how can I use the fasttext pre-train model cc.en.300.bin? I see nothing about fasttext in your ./models and trainers.py and main.py. Waiting for your answer, thanks~

Pay20Y · 2020-07-26T07:50:46Z

Hi, please refer to create_all_synth_lmdb.py and modify the dataloader accordingly.

Ma01180724 · 2020-07-29T12:52:37Z

hello，i have a same question，we have the datasets nips2014 and cvpr 2016 （lmdb），how can i use the fasttext pre-train model？can you solve it？ thanks

Ma01180724 · 2020-07-30T10:01:52Z

@Pay20Y ，could you give me datasets that you had prepared？

Pay20Y · 2020-08-02T02:23:23Z

@Ma01180724 Hi, I'm really sorry that I can't directly share the training datasets with you because of the large storage. There are two ways to get the training datasets by yourself. First, You can modify the create_all_synth_lmdb.py that load the labels from MJ and ST then generates the new LMDB datasets with embedding labels. Second, as mentioned before, you can modify the dataloader and generate the according word embedding from the recognition label during the training process.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to directly use word embedding from the pre-trained LM during training and inference? #7

How to directly use word embedding from the pre-trained LM during training and inference? #7

Fishersponge commented Jul 15, 2020

Pay20Y commented Jul 26, 2020

Ma01180724 commented Jul 29, 2020

Ma01180724 commented Jul 30, 2020

Pay20Y commented Aug 2, 2020

How to directly use word embedding from the pre-trained LM during training and inference? #7

How to directly use word embedding from the pre-trained LM during training and inference? #7

Comments

Fishersponge commented Jul 15, 2020

Pay20Y commented Jul 26, 2020

Ma01180724 commented Jul 29, 2020

Ma01180724 commented Jul 30, 2020

Pay20Y commented Aug 2, 2020