Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update the udpipe model #121

Open
mayeulk opened this issue Oct 21, 2023 · 4 comments
Open

Update the udpipe model #121

mayeulk opened this issue Oct 21, 2023 · 4 comments

Comments

@mayeulk
Copy link

mayeulk commented Oct 21, 2023

The most recent english-ewt-ud updpipe model accessible from the R package is 2.5:
english-ewt-ud-2.5-191206.udpipe
Is it due to incompatibility of 2.6 and later versions with the R package? (models>2.5 are for Udpipe 2).
Can English versions 2.6 or later be used?
I found issues in 2.5 that are corrected in later versions:
For instance, the lemma for token "whistle" is "whisle" (without "t").
Checking with http://lindat.mff.cuni.cz/services/udpipe/ , v. 2.6 correctly returns the lemma "whistle".

@jwijffels
Copy link
Contributor

jwijffels commented Oct 21, 2023

You can train your own models on more recent data from universal dependencies with this R package. These models are 'udpipe 1' models and you can train them on any version of data of universal dependencies.
Documentation of how to do that is put at

@jwijffels
Copy link
Contributor

You can train your own models on more recent data from universal dependencies with this R package.
Documentation of how to do that is put at

@mayeulk
Copy link
Author

mayeulk commented Oct 30, 2023

Thank you for this!
(I was hoping to be able to use more recent pre-trained model out of the box).

@locusclassicus
Copy link

Hi, many thanks for your package. I have the same question concerning the Latin models. Training a model on one's own requires rather advanced NLP-skills, so it would really be useful to have newer (2.13 or 2.14) models pretrained, if possible.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants