Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unable to fine-tune previously trained transformer based spaCy NER. #13709

Open
jlustgarten opened this issue Dec 6, 2024 · 1 comment
Open

Comments

@jlustgarten
Copy link

How to reproduce the behaviour

Use spacy to fine-tune a base model with a transformer from hugging face:
python -m spacy train config.cfg --output ./output --paths.train ./train.spacy --paths.dev ./dev.spacy

Collect new tagged entries under new sets and set your model location to the output/model-last in a new config:
python -m spacy train fine_tune_config.cfg --output ./fine_tune_output --paths.train ./newtrain.spacy --paths.dev ./newdev.spacy

You will get an error about a missing config.json. Even replacing this will then lead to an error of a missing tokenizer.

Your Environment

  • Operating System: Windows 11
  • spaCy version: 3.7.2
  • Platform: Linux-5.15.167.4-microsoft-standard-WSL2-x86_64-with-glibc2.35
  • Python version: 3.10.13
@jlustgarten
Copy link
Author

Apologies I didn't mean to open it up twice!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant