What are the pretrained-language-model that is obviously better than BERT and RoBERTa? #85

guotong1988 · 2021-01-20T06:28:12Z

The BERT is described in the paper 《BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding》.
The RoBERTa is described in the paper 《RoBERTa: A Robustly Optimized BERT Pretraining Approach》.
Now 3 years past. Are there any pretrained-language-model that surpass them in most of the tasks? (Under the same or nearby resources)
Speedup without accuracy decreasing is also considered as a better one.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What are the pretrained-language-model that is obviously better than BERT and RoBERTa? #85

What are the pretrained-language-model that is obviously better than BERT and RoBERTa? #85

guotong1988 commented Jan 20, 2021 •

edited

What are the pretrained-language-model that is obviously better than BERT and RoBERTa? #85

What are the pretrained-language-model that is obviously better than BERT and RoBERTa? #85

Comments

guotong1988 commented Jan 20, 2021 • edited

guotong1988 commented Jan 20, 2021 •

edited