Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Downcasing of derived proper nouns broken by hyperminimisation ( #91

Open
albbas opened this issue Jan 11, 2016 · 3 comments
Open

Downcasing of derived proper nouns broken by hyperminimisation ( #91

albbas opened this issue Jan 11, 2016 · 3 comments
Assignees
Labels
bug Something isn't working low priority

Comments

@albbas
Copy link
Contributor

albbas commented Jan 11, 2016

This issue was created automatically with bugzilla2github

Bugzilla Bug 2145

Date: 2016-01-11T14:22:44+01:00
From: Sjur Nørstebø Moshagen <<sjur.n.moshagen>>
To: Sjur Nørstebø Moshagen <<sjur.n.moshagen>>
CC: elena.j.paulsen, sjur.n.moshagen, thomas.omma, trond.trosterud

Last updated: 2018-05-09T12:52:55+02:00

@albbas
Copy link
Contributor Author

albbas commented Jan 11, 2016

Comment 11052

Date: 2016-01-11 14:22:44 +0100
From: Sjur Nørstebø Moshagen <<sjur.n.moshagen>>

./configure --with-hfst --enable-hyperminimisation

gives the following:

[ 1/18][FAIL] Narvijkka+N+Prop+Sem/Plc+Sg+Gen+Der/k+N+Sg+Nom => Missing results: narvijkak
[ 1/18][FAIL] Narvijkka+N+Prop+Sem/Plc+Sg+Gen+Der/k+N+Sg+Nom => Unexpected results: Narvijkak

This is because hyperminimisation inserts an extra symbol at the very beginning of the net: @P.LEXNAME.Root@. This symbol breaks the context requirements of the downcasing regex.

Hyperminimisation is not used very much, but can be turned on for speller optimisations without people being aware of or remembering this bug. It should thus be fixed.

@albbas
Copy link
Contributor Author

albbas commented Jan 12, 2016

Comment 11053

Date: 2016-01-12 08:54:24 +0100
From: Thomas Omma <<thomas.omma>>

hyperminimisation! :O

@albbas
Copy link
Contributor Author

albbas commented May 9, 2018

Comment 12811

Date: 2018-05-09 12:52:55 +0200
From: Sjur Nørstebø Moshagen <<sjur.n.moshagen>>

This is now finally fixed - mostly. There are still a few regressions for the spellers for words with CmpN/*Left tags, so will keep this open until it is completely fixed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working low priority
Projects
None yet
Development

No branches or pull requests

2 participants