-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Marks particle as error instead of the preceding Err/Orth of the same mwe #45
Comments
In both cases I need the original text to be able to reproduce and debug. The paragraph containing the problem should be enough, maybe even just the sentence. |
Jos dal vel Sámis leat sullasaš dilit go davviriikkain muđuid, de fuobmá árvvoštallamiin goit ovtta erenoamáš ášši mii earuha sámi árvvoštallamiid omd. dáža árvvoštallamiin. Girječálli birra, ja su ođđa girji ovddeš bargguiguin veardádallon, gávnnat hárve sámi árvvoštallamiin. Čiekŋaleabbo dieđu go ahte gos čálli lea riegádan ja gos ássá, gávnnat hárve. Oalle dábálaš lea dákkár diehtu lohkkái: «Mus eai leat obanassii sánitge rámidit nn čehppodaga, dajan dušše ahte áŋgirit ja čeahpit gultturbargi ii gávnna ohcaminge.» (Samefolket 1/89, s. 92). Fuobmá maiddái dán čállosa ovdamearkka vuosttas siiddus: «- rohkkes Láhpoluobbala gollenieida …» Čállái báhcá goit rápmi, jos dal ii čiekŋalit ággaduvvon. |
Jos ohcala siva dasa ahte čálli birra leat uhcán dieđut, de vástádus dáidá leat nu álki go ahte nie šaddá lunddolaččat servodagasgos [116] buohkat dovddadit. Árvvoštalli duhtá dasto daid dábálaš dieđuide mat juo buohkain leat čálli birra. Dás han maiddái lea sáhka dušše ođđa girjjiin, ja otnáš čálliin. Ii árvvoštalli arvva lebbet dieđuid čálli eallimis omd., jos dal vel oaivvildeš ahte leat leamaš váikkuheaddji áššit čálli loahpalaš bargui, go dát sáhttet leat oalle persovnnalaččat ja dan sivas eai gula almmolašvuhtii. Sáhttá gal maiddái árvvoštalli leat oaivvildeamen ahte eat dárbbaš dárkilieabbo dieđuid go mat mis juo buohkain leat. Dán dili bahá ja buriid beliid garvván dán oktavuođas. Muhto dattege, jos vuos dieavaslaččat áigut árvvoštallat sámi girjjiid, de fertet árvvoštaladettiin maiddái ohcalit ja čállit biográfalaš dieđuid, muhto dieđusge dakkár dieđuid mat leat relevánta. Geaid luhtte čálli lea ijastallan maŋemuš jagiid, ja galle luovosmáná sus leat, eai leat eanemus relevánta dieđut. Dattege sáhttet leat čálli birrasis dakkár váikkuheaddji elementtat maid birra sáhtášii leat dehálaš diehtit. Ahte čálli eallimis sáhttet leat váikkuheaddji olbmot, dáhpáhusat ja fearánat mat leat váikkuhan su go girjji čálii, leat girjjálašvuođadiehttagis dohkkehan dutkanveara áššin. Historjjábiográfalaš árvvoštallama vuogis dát lea guovddáš ášši, earret dieđusge teaksta dahje girji maid čálli almmuha. |
the same in Googledocs |
The first case, |
The second example, the |
"<girjjálašvuođadiehtta>" |
This is what's going on: $ echo 'girjjálašvuođadiehttagis' | modes/trace-smegramrelease3-cg.mode
"<girjjálašvuođadiehttagis>"
"gis" Pcle <W:0.0> "<gis>" <LastCohort> <firstCohort>
"diehtit" V <EX-Nom-Ani> TV Ind Prs Sg3 Err/Orth <W:0.0> <LastCohort> <firstCohort> SUBSTITUTE:4876
"girji" Ex/N Sem/Txt Der/lasj Ex/A Der/vuota N Cmp/SgGen Cmp <W:0.0> "<girjjálašvuođadiehtta>" <LastCohort> <firstCohort>
"gis" Pcle <W:0.0> "<gis>" <LastCohort> <firstCohort>
"diehtit" V <EX-Nom-Ani> TV Ind Prs Sg3 Err/Orth <W:0.0> <LastCohort> <firstCohort> SUBSTITUTE:4876
"girji" Ex/N Sem/Txt Der/lasj Ex/A Ex/Attr Der/vuota N Cmp/SgGen Cmp <W:0.0> "<girjjálašvuođadiehtta>" <LastCohort> <firstCohort>
"gis" Pcle <W:0.0> "<gis>" <LastCohort> <firstCohort>
"diehtit" V <EX-Nom-Ani> TV Ind Prs Sg3 Err/Orth <W:0.0> <LastCohort> <firstCohort> SUBSTITUTE:4876
"girjjálaš" Ex/A Der/vuota N Cmp/SgGen Cmp <W:0.0> "<girjjálašvuođadiehtta>" <LastCohort> <firstCohort>
"gis" Pcle <W:0.0> "<gis>" <LastCohort> <firstCohort>
"diehtit" V <EX-Nom-Ani> TV Ind Prs Sg3 Err/Orth <W:0.0> <LastCohort> <firstCohort> SUBSTITUTE:4876
"girjjálašvuohta" N Sem/Txt Cmp/SgGen Cmp <W:0.0> "<girjjálašvuođadiehtta>" <LastCohort> <firstCohort>
$ echo 'girjjálašvuođadiehttagis' | modes/trace-smegramrelease4-mwe-split.mode
"<girjjálašvuođadiehtta>"
"girji" Ex/N Sem/Txt Der/lasj Ex/A Der/vuota N Cmp/SgGen Cmp <W:0.0> <LastCohort> <firstCohort>
"girji" Ex/N Sem/Txt Der/lasj Ex/A Ex/Attr Der/vuota N Cmp/SgGen Cmp <W:0.0> <LastCohort> <firstCohort>
"girjjálaš" Ex/A Der/vuota N Cmp/SgGen Cmp <W:0.0> <LastCohort> <firstCohort>
"girjjálašvuohta" N Sem/Txt Cmp/SgGen Cmp <W:0.0> <LastCohort> <firstCohort>
"<gis>"
"gis" Pcle <W:0.0> <LastCohort> <firstCohort>
"diehtit" V <EX-Nom-Ani> TV Ind Prs Sg3 Err/Orth <W:0.0> <LastCohort> <firstCohort> SUBSTITUTE:4876
$ echo 'girjjálašvuođadiehttagis' | modes/trace-smegramrelease.mode
"<girjjálašvuođadiehtta>"
"girji" Ex/N Sem/Txt Der/lasj Ex/A Der/vuota N Cmp/SgGen Cmp <W:0.0> <LastCohort> <firstCohort>
"girji" Ex/N Sem/Txt Der/lasj Ex/A Ex/Attr Der/vuota N Cmp/SgGen Cmp <W:0.0> <LastCohort> <firstCohort>
"girjjálaš" Ex/A Der/vuota N Cmp/SgGen Cmp <W:0.0> <LastCohort> <firstCohort>
"girjjálašvuohta" N Sem/Txt Cmp/SgGen Cmp <W:0.0> <LastCohort> <firstCohort>
"<gis>"
"gis" Pcle <W:0.0> <LastCohort> <firstCohort> @PCLE MAP:22090:r16 &typo ADD:10126:Err/Orth-any
"diehtit" V <EX-Nom-Ani> TV Ind Prs Sg3 Err/Orth <W:0.0> <LastCohort> <firstCohort> SUBSTITUTE:4876
typo
"gis" Pcle <W:0.0> <LastCohort> <firstCohort> @PCLE MAP:22090:r16 &typo &SUGGEST ADD:10126:Err/Orth-any COPY:10135:Err/Orth-any
"diehtit" V <EX-Nom-Ani> Ind Prs Sg3 <W:0.0> <LastCohort> <firstCohort> SUBSTITUTE:4876
diehtit+V+Ind+Prs+Sg3#gis+Pcle ?
Much simplified, we have the following from the analyser:
which
Now the generator gets sent
which doesn't give any results. If we could send just
or in the original example:
which
At least that's one possibility – I have no idea how hard that would be to do on the lexicon side. |
(↑ is divvun/divvun-gramcheck-web#18 , ↓ is this issue)
The text was updated successfully, but these errors were encountered: