Mistaken tagging in portuguese #2862
Labels
lang / pt
Portuguese language data and models
models
Issues related to the statistical models
perf / accuracy
Performance: accuracy
Hi,
Just check a simple sentence in portuguese as "Os alunos amam os livros" (The students love the books). The output for a token.pos_ is 'DET', 'SYM', 'VERB', 'DET', 'SYM', where SYM stands for symbol. I checked the tag_map.py file for portuguese and noted some 17 instances where SYM is misplaced, all of them should be NOUN.
Best,
Ricardo
The text was updated successfully, but these errors were encountered: