This project uses datasets from the following projects:
- 日英中基本文データ
- BPersona-chat
- Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation
- FLORES-200
- JParaCrawl
- Japanese-English Bilingual Corpus of Wikipedia's Kyoto Articles
- Japanese-English Legal Parallel Corpus
- ParaNatCom
- No Language Left Behind
- 日英対訳文対応付けデータ
- OpenSubtitles
- Alignment of Reuters Corpora
- Tatoeba
- TED2020
- Japanese WordNet
Citations and license information are available the app's help modal, generated from sources.tsx. If there is data that you wish to take down, please contact me via email or Github.