-
Notifications
You must be signed in to change notification settings - Fork 1
/
smj.diff
5734 lines (5734 loc) · 322 KB
/
smj.diff
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
380c380
< Removing reference to remove-use_mt-strings.regex in makefile, as this file is moved to giella-shared. (See commit d7741401cd731e5c26c6637448dad743c378155c) 2022-02-23T10:36:48+01:00
---
> Removing reference to remove-use_mt-strings.regex in makefile, as this file is moved to giella-shared. (See commit 6aac46b0b81b5a47e3c1230faeec376ad4a4c082) 2022-02-23T10:36:48+01:00
1691,1692c1691,1692
< had forgotten some tags. continued the _retting av tagger i derivasjoner_ (a66d19a) and (46e7b30) 2020-10-28T14:24:28+01:00
< continued the _retting av tagger i derivasjoner_ that Lene started with in check in (a66d19a). Lene fixed tags for lexicon LEXICON MARJA-plc, I continued and fixed it for other lexicons too. 2020-10-28T11:51:32+01:00
---
> had forgotten some tags. continued the _retting av tagger i derivasjoner_ (17d1b0b) and (2c152c3) 2020-10-28T14:24:28+01:00
> continued the _retting av tagger i derivasjoner_ that Lene started with in check in (17d1b0b). Lene fixed tags for lexicon LEXICON MARJA-plc, I continued and fixed it for other lexicons too. 2020-10-28T11:51:32+01:00
1701c1701
< added Gram/r tag where Lene removed Gram/NomAg, checkin 2f34654 2020-10-27T09:52:48+01:00
---
> added Gram/r tag where Lene removed Gram/NomAg, checkin 27fd8f0 2020-10-27T09:52:48+01:00
2010c2010
< Updated yamls because of changes i checkin (e8ec911): removed +Der/A+A 2020-05-26T14:58:34+02:00
---
> Updated yamls because of changes i checkin (46268dc): removed +Der/A+A 2020-05-26T14:58:34+02:00
2179d2178
< svnignores 2020-04-16T07:38:30+00:00
2320d2318
< Dokumentasjon av GG-vedtak 2020-02-28T13:01:56+00:00
2429d2426
< Updated ignore patterns. 2019-10-23T18:40:46+00:00
2465,2468d2461
< ignore *.fomabin. 2019-10-08T06:35:05+00:00
< ign 2019-10-07T21:32:11+00:00
< ign 2019-10-07T21:15:15+00:00
< ign 2019-10-07T21:00:45+00:00
2470,2471d2462
< ign 2019-10-07T20:59:02+00:00
< Force unix line endings, to make sure it works ok also on the Windows subsystem for Linux. 2019-10-07T17:16:53+00:00
2543d2533
< Retta mime-typen slik at filene blir viste rett i nettlesarar. 2019-08-26T09:02:58+00:00
2578d2567
< Updating svn ignores for tools/analysers/. 2019-06-14T06:38:51+00:00
2634,2635d2622
< Updating svn ignores. 2019-05-24T09:55:04+00:00
< Updating svn ignores. 2019-05-24T09:44:55+00:00
2739d2725
< Updated svn ignores. 2019-02-27T10:18:02+00:00
2814,2815d2799
< Ignore compiled cg3 files in tools/tokenisers/. 2019-01-08T07:08:34+00:00
< Ignore more files, including files that are automatically added to svn when populating a new language. This is done to avoid them showing up as noise for external languages, in which case these files might not be in our svn (but in the external svn repo instead). 2019-01-08T06:55:51+00:00
2925,2926d2908
< ignore for bin 2018-10-14T13:31:01+00:00
< added korp.cg3 to svn ignore. 2018-10-14T12:56:20+00:00
2967d2948
< svn ignore update 2018-09-20T08:44:05+00:00
2969d2949
< updated svn ignore. 2018-09-20T08:28:11+00:00
2975d2954
< More general ignore pattern for tools/mt/apertium/tagsets/. 2018-09-10T11:16:40+00:00
2978d2956
< Updated svn ignore patterns. 2018-09-08T05:26:27+00:00
2995d2972
< Updated svn ignores. 2018-08-30T16:00:09+00:00
2998d2974
< Updated svn ignores. 2018-08-29T05:25:34+00:00
3001d2976
< Updating svn ignores. 2018-08-28T10:47:06+00:00
3084d3058
< More things to ignore. 2018-05-14T10:33:30+00:00
3228,3231d3201
< Added ignore pattern for in.txt 2018-03-01T07:09:50+00:00
< More ignores 2018-03-01T06:52:33+00:00
< More svn ignores. 2018-03-01T06:25:59+00:00
< Added svnignore pattern for sigma.txt. 2018-02-21T09:49:57+00:00
3241d3210
< Two more files to ignore. 2018-02-06T09:44:18+00:00
3263d3231
< Updated svn ignores. 2018-01-31T12:13:59+00:00
3369d3336
< Updated svn ignores. 2017-12-11T12:55:46+00:00
3402,3403d3368
< Updated svn ignores for tokenisers and grammar checkers + subdirs. 2017-10-11T11:47:18+00:00
< Updated svn ignores for tokenisers and grammar checkers + subdirs. 2017-10-11T11:22:45+00:00
3419d3383
< Updating svn ignores. 2017-08-25T10:22:58+00:00
3433,3434d3396
< Updated svn ignores. 2017-06-28T23:37:25+00:00
< Updated svn ignores. 2017-06-28T23:08:42+00:00
3449d3410
< ign 2017-03-21T19:49:19+00:00
3468d3428
< Updated svn ignores. 2017-03-01T12:02:48+00:00
3473a3434
> Moving dictionary work by Joshua from gt/smj/ to words/dicts/, as part of cleaning out the gt/ dir. 2017-02-16T07:15:19+00:00
3488d3448
< Updated svn ignores. 2017-01-30T10:04:48+00:00
3505a3466,3467
> Now that the releases are copied over to /Users/smo036/langtech/big/prooftesting/smj/, they can be removed from /Users/smo036/langtech/main/gt/smj/. 2017-01-24T06:46:06+00:00
> Removed empty or obsolete dirs in old infra. 2017-01-23T11:33:44+00:00
3862d3823
< Updated svn ignores. 2016-06-09T20:11:13+00:00
3959d3919
< Setting svn ignore patterns on tools/spellcheckers/filters/. 2016-05-10T01:00:11+00:00
4023d3982
< Ignore more preprocessor files = fst’s. 2016-04-14T16:01:04+00:00
4065d4023
< Updated svn ignores. 2016-03-15T19:54:49+00:00
4088d4045
< Use a more general svn ignore pattern in src/morphology/. 2016-03-07T17:10:12+00:00
4145d4101
< Updated the svn ignore property for recent changes in the infrastructure. 2016-02-16T22:36:51+00:00
4169d4124
< Updating svn:ignore’s. 2016-02-02T15:34:45+00:00
4174,4175d4128
< Updated svn:ignore’s. 2016-02-02T10:33:44+00:00
< Updated svn:ignore’s. 2016-02-02T10:16:28+00:00
4179d4131
< Updated svn ignores. 2016-01-25T08:11:45+00:00
4222d4173
< Set proper mime types, for better rendering in browsers. 2015-12-08T22:08:49+00:00
4256d4206
< Updated svn:ignore’s. 2015-11-18T23:05:40+00:00
4336d4285
< Updated svn ignores. 2015-10-20T07:50:38+00:00
4561d4509
< Ignore temporary files generated by the speller suggestion test script. 2015-09-03T04:23:51+00:00
4913d4860
< Ignore txt files in speller dirs. 2015-04-09T11:55:38+00:00
5014d4960
< Updated svn ignores. 2015-03-14T10:46:39+00:00
5024d4969
< Updated svn ignores. 2015-03-12T08:28:02+00:00
5045d4989
< Updated svn ignores. 2015-03-09T10:41:02+00:00
5049d4992
< Updated svn ignores. 2015-03-06T15:56:54+00:00
5058d5000
< Updated svn ignores. 2015-03-06T09:24:00+00:00
5098d5039
< Update svn ignores. 2015-02-27T12:55:53+00:00
5203a5145
> Script for turning sme to smj (candidates). Usage: compile to fst with xfst, then pipe sme __lemma__ through it. 2015-02-04T07:22:05+00:00
5207a5150
> version geared towards lexeme conversion 2015-02-02T21:42:16+00:00
5223a5167
> Improved the ea first syll rule, more to come. 2015-01-28T11:16:43+00:00
5240d5183
< Special svn:ignore on src/orthography/. 2015-01-26T10:30:36+00:00
5269d5211
< Setting svn:ignore for Bashkir, updating the other ignores. 2015-01-19T06:20:12+00:00
5287d5228
< Updated svn:ignore's. 2015-01-12T21:50:15+00:00
5289d5229
< svn:ignore also for *.service. 2015-01-12T15:58:08+00:00
5313d5252
< ignore apertium labels. 2015-01-05T07:54:56+00:00
5404d5342
< Update ignores for src/morphology/. 2014-10-23T08:25:38+00:00
5504d5441
< Updated svn:ignore's. 2014-09-08T21:51:28+00:00
5572d5508
< ignore 2014-08-16T15:52:40+00:00
5764d5699
< Ignore cg3 files in the tools/mt/apertium/ dir. 2014-06-02T14:02:42+00:00
5805d5739
< Ignore the generated pkg-config files. 2014-05-12T20:27:01+00:00
5861d5794
< Extra ignores on tools/mt/apertium/filters/. 2014-04-16T05:45:32+00:00
5866d5798
< Updated svn ignores. 2014-04-15T13:30:11+00:00
5870d5801
< Updated svn ignore. 2014-04-10T15:16:59+00:00
5896d5826
< Updated svn ignores. 2014-04-03T10:10:45+00:00
5899d5828
< Updated svn ignore. 2014-04-02T10:58:00+00:00
5913d5841
< Update svn ignores on test/src/phonology/. 2014-03-20T07:32:24+00:00
5929d5856
< Updated ignore patterns for test/tools/spellcheckers/. 2014-03-14T10:26:24+00:00
5944d5870
< Updated svn ignores. 2014-02-28T06:46:07+00:00
6090d6015
< Ignore mk-files in the preprocess/ dir. 2014-01-14T14:20:40+00:00
6104d6028
< Ignore generated easteregg files. 2014-01-12T11:44:07+00:00
6107d6030
< Ignore functions.cg3 and dependency.cg3 if they are copied from a shared location (these files will not be ignored if already in svn). 2014-01-10T10:24:56+00:00
6121d6043
< Updated the ignore pattern for morphology/stems/. 2014-01-09T13:45:44+00:00
6141,6142d6062
< Ignoring the smi-smj file as well. 2014-01-01T10:17:42+00:00
< Ignoring the smi-propernouns.lexc file. 2014-01-01T10:16:44+00:00
6181d6100
< Ignore generated regex files in the src/filters/ dir. 2013-12-03T11:56:43+00:00
6191d6109
< Extra ignores in the filters/ dir. 2013-11-24T15:25:54+00:00
6193d6110
< igno 2013-11-24T15:13:39+00:00
6198d6114
< Ignore the generated, concatenated lexc file. 2013-11-22T13:37:46+00:00
6224d6139
< Ignore generated documents in tools/shellscripts/. 2013-10-18T11:47:12+00:00
6236d6150
< One more ignore on the grammar checker dir. 2013-10-10T09:06:26+00:00
6238d6151
< Update svn ignores. 2013-10-09T11:01:58+00:00
6240d6152
< Updated svn ignores. 2013-10-09T07:04:36+00:00
6279d6190
< Ignore zipped files (built for apertium). 2013-08-19T07:13:39+00:00
6282d6192
< Ignore *.att files in src/ (produced by the apertium targets). 2013-08-17T12:30:28+00:00
6299d6208
< Ignore extra generated files in the tagsets dir. 2013-06-13T14:40:56+00:00
6308d6216
< Updated the svn ignores. 2013-06-06T08:10:31+00:00
6365d6272
< [ignore] Also ignore generated foma transducers. 2013-04-11T16:57:19+00:00
6379d6285
< Ignore the generated voikko-3 dir. 2013-03-18T09:43:22+00:00
6388d6293
< More extensive ignores on src/morphology/. 2013-03-12T08:07:27+00:00
6396d6300
< The old hunspell dir had been left in smj. Now deleted. 2013-03-06T06:11:15+00:00
6401d6304
< Ignore top-level build dir. 2013-02-27T07:18:08+00:00
6403d6305
< Update svn ignores. 2013-02-27T00:08:32+00:00
6407d6308
< Ignore the build/ dir within doc/. 2013-02-26T17:02:56+00:00
6479,6480d6379
< The ignore pattern was wrong - now correct. 2013-01-23T22:34:12+00:00
< Ignore generated yaml tests. 2013-01-23T22:28:44+00:00
6488a6388
> Added smj synlink 2013-01-23T15:32:12+00:00
6496d6395
< Updated svn:ignore with the latest modifications to the ignore patterns. 2013-01-14T19:34:16+00:00
6575a6475,6476
> Saami 2012-11-22T13:05:15+00:00
> Saami 2012-11-22T12:56:38+00:00
6819a6721,6829
> corrected typos in test 2012-09-25T13:29:20+00:00
> new lexicon to the word amen 2012-09-25T13:28:26+00:00
> Added testing of compounds requiring hyphens. SMJ nouns failing this test is now down to just two. 2012-09-25T09:50:57+00:00
> Added space between R and ; where missing, to be consistent and provide an easier and more robust grep pattern for lemma testing. 2012-09-25T09:37:43+00:00
> added a yaml test 2012-09-25T09:31:34+00:00
> Removed useless empty comments. 2012-09-25T09:19:54+00:00
> Updated proper noun ref to the new file after SMI names have been included. 2012-09-25T09:14:03+00:00
> Added proper processing of common nouns from SMI. Now SMJ contains the SMI propers again. 2012-09-25T09:07:38+00:00
> added a yaml test 2012-09-25T09:02:50+00:00
> Renamed to make the content clearer, to differentiate between the common and the language-specific proper nouns. 2012-09-25T09:02:44+00:00
> Added warning about missing YAML testing, with short instructions on how to enable them. Template update. 2012-09-25T06:19:32+00:00
> First merge of the SMI proper nouns. Not yet included in the build system, but now at least part of the src tree, and should be automatically updated from the SMI template for every template run (thus, semi-automatically). 2012-09-25T05:15:37+00:00
> Timestamp files to trigger merges from the smi template. 2012-09-25T05:05:00+00:00
> Copied the smi-smj conversion script from the old infra to the new. First step in enabling the shared propers in the new infra for SMJ. 2012-09-25T04:48:00+00:00
> added new sentences in ref-korpus 2012-09-24T10:02:25+00:00
> removde the double sjnj/snj entries 2012-09-24T09:56:17+00:00
> removde the double sjnj/snj entries 2012-09-24T09:53:07+00:00
> added a twolc rule for sjnj>snj 2012-09-24T09:33:48+00:00
> added a twolc for sjnj>snj 2012-09-24T09:31:11+00:00
> added sentences to ref-korpus 2012-09-24T08:59:57+00:00
> maked a propernoun with Use/-Spell 2012-09-24T08:39:51+00:00
> maked some propernouns with Use/-Spell 2012-09-24T08:38:14+00:00
> corrected entries 2012-09-24T08:34:46+00:00
> corrected the twolc rule for m-bbm, guomoj-guobbmu 2012-09-24T08:26:45+00:00
> new lexicon VÁJES 2012-09-24T08:24:59+00:00
> corrected entries 2012-09-24T08:24:20+00:00
> The top-level syntax include AM file had not been changed to reflect the rle->cg3 suffix change. Merge from template. The previous merge was incomplete due to a bug in the merge script. 2012-09-21T14:16:04+00:00
> corrected entries 2012-09-21T13:10:52+00:00
> changed the sjnj words to sjn in nouns.lexc 2012-09-21T12:03:00+00:00
> Fixed compound testing for nouns. 2012-09-21T11:28:42+00:00
> corrected entries 2012-09-21T11:24:19+00:00
> removed the LEXICON IDENTITET, it's not used anymore 2012-09-21T10:32:46+00:00
> gave lemmas the EHTTA lexicon and removed variations of the same lemma 2012-09-21T10:22:12+00:00
> changed the name of lexicon EHTTA-ÆHTTA to EHTTA, and started changing the tags of the lemmas 2012-09-21T09:15:06+00:00
> changed the name of lexicon EHTTA-ÆHTTA to EHTTA, and started changing the tags of the lemmas 2012-09-21T09:14:54+00:00
> Me and Sjur have edited the morpheme border in MUORRA and SÁMEGIEL, it made a dobble morpheme border which destroied the inflection in this class. Added a EHTTA-ÆHTTA lexicon. 2012-09-21T08:32:24+00:00
> reorganized ÁLLA, ÆLLA and SJONÁLLA lexicons, added a ÁLLA-ÆLLA lexicon 2012-09-20T12:27:25+00:00
> corrected entries,common lemma 2012-09-20T11:27:48+00:00
> corrected entries,common lemma 2012-09-20T10:05:22+00:00
> Corrected a bug in the default generate-noun-lemmas.sh test script. Made file references more robust. Update from template. This also clears any warnings left over from the cg3 file renaming, such that we get a clean merge in the next template update. 2012-09-20T09:26:56+00:00
> The VislCG3 team has lately switched to a *.cg3 suffix. Now we do the same in the new infra - the new suffix is definitely more transparent. 2012-09-19T18:26:12+00:00
> gave lexeme variations common lemma with +v1 and +v2 tags 2012-09-19T13:35:56+00:00
> added ref-korpus 2012-09-19T11:05:24+00:00
> corrected entries 2012-09-19T08:59:05+00:00
> corrected entries 2012-09-19T08:05:40+00:00
> Another tentative fix - Inga, please check. 2012-09-19T07:53:36+00:00
> Updated all scripts to use AC processing, for less relying on environmental variables = better portability. A lot of fine-tuning of the adj. testing. 2012-09-18T18:11:29+00:00
> Correcting entries. 2012-09-18T17:44:01+00:00
> made a new lexicon 2012-09-18T13:03:45+00:00
> removed abbreviated nouns that act as attribute 2012-09-18T12:20:46+00:00
> corrected adjectives entries 2012-09-18T10:52:48+00:00
> t 2012-09-18T09:56:35+00:00
> Corrected testing of TJAVGGÁMUS and PrfPrc_At words. 2012-09-18T09:51:40+00:00
> Corrected testing of VINJO- and At words. 2012-09-18T09:41:31+00:00
> corrected some entries 2012-09-18T08:29:56+00:00
> Variables=cleaner code. Update from the template. 2012-09-18T06:56:51+00:00
> Updated the yaml test runner to properly report the exit value of the yaml tests, and also to give directions for how to see the details of each test if it failed. Update from the template. 2012-09-17T14:10:42+00:00
> Corrected typo in shell scripts. 2012-09-17T11:03:08+00:00
> Several testing shell script updates: correct exit value when data files are not found, proper use of Autoconf-made variables (will free the test scripts from relying on the user setting up environment variables), and better checks on the availability of test data for the lemma and replaced all hard-coded file refs with variables in the noun generation test. 2012-09-17T09:30:31+00:00
> Added check for the Xerox lookup tool, which also defines the LOOKUP variable. Update from template. 2012-09-17T08:14:29+00:00
> Reorganised AC processing of shell scripts to be more future-proof and avoid annoying (and useless) warning from chmod. Added AC variable to the AC-processed shell script to make casual by-lookers aware of the fact that the resulting shell script file is generated by AC. 2012-09-17T07:37:40+00:00
> Corrected error in previous commit. Finally things are working as they should. It might be necessary to run ./autogen.sh and ./configure before compilation is running smoothly again. 2012-09-15T12:52:14+00:00
> Forgot to update configure.ac. 2012-09-15T10:43:23+00:00
> Refined the yaml test runner: more informative banner, ignore extra analyses (= removes false alarms). Merge from template. 2012-09-15T08:17:53+00:00
> Added basic setup for running YAML tests in the test/src/morphology/ dir. The default setup will run all *.yaml files found in this dir, but this can be modified in the shell (*.sh.in) script. If there are yaml files in that dir, they will be automatically run by 'make check'. 2012-09-14T19:28:03+00:00
> gave lemmas new ÁLLA, ÆLLA and SJONÁLLA lexicon 2012-09-14T13:19:32+00:00
> SÁMEGIEL lexicon 2012-09-14T12:53:44+00:00
> Enable yaml tests by magic. Merge from the template. 2012-09-14T11:54:49+00:00
> MiddleNouns lexicon 2012-09-14T11:41:22+00:00
> MiddleNouns lexicon 2012-09-14T11:41:03+00:00
> Added conditional support for running python-based tests in test/src/morphology. 2012-09-14T11:27:52+00:00
> new ÁLLA, ÆLLA and SJONÁLLA lexicon 2012-09-14T11:05:12+00:00
> new ÁLLA, ÆLLA and SJONÁLLA lexicon 2012-09-14T11:04:27+00:00
> Added checks for Python 3.1+ and py-yaml, and defined CAN_YAML_TEST. The idea is that we will run the python-based tests only if the prerequisites are available to us, and skip them if not. 2012-09-14T10:54:10+00:00
> new PrfPrc_At lexicon 2012-09-14T10:40:05+00:00
> Common lemma for sub-maked words 2012-09-14T10:33:47+00:00
> Common lemma for sub-maked words 2012-09-13T13:34:52+00:00
> Removed # on the left/upper/lemma side of the entries. This is the last file. 2012-09-13T10:48:54+00:00
> Corrected two instances of syntax errors introduced when removing # marks in the lemmas. 2012-09-13T10:48:07+00:00
> Changed to common lemma for all variations of each leksem. 2012-09-13T09:42:04+00:00
> Removed # on the left/upper/lemma side of the entries. 2012-09-13T09:40:20+00:00
> Removed # on the left/upper/lemma side of the entries. 2012-09-13T09:16:50+00:00
> Removed # on the left/upper/lemma side of the entry. 2012-09-13T09:03:48+00:00
> Removed # on the left/upper/lemma side of the entry. 2012-09-13T08:57:04+00:00
> Sorted files, and changed a lot of whitespace in some of them. 2012-09-13T08:12:45+00:00
> sorted-main-lex 2012-09-13T06:31:15+00:00
> Whitespace changes only. 2012-09-12T15:37:15+00:00
> Sprted (no ŋ's), and removed morfem boundary marker - conjunctions don't inflect in SMJ, thus there is no need for this marker. 2012-09-12T15:34:28+00:00
> Whitespace changes, and removed morfem boundary marker - conjunctions don't inflect in SMJ, thus there is no need for this marker. 2012-09-12T15:28:56+00:00
> Sorted (no ŋ's). 2012-09-12T15:25:34+00:00
> Sorted (no ŋ's). 2012-09-12T15:22:03+00:00
> Removed empty line. 2012-09-12T15:19:46+00:00
> Undid the sorting, as the sorting destroyed the ŋ. We need to use another sorting routine. 2012-09-12T09:41:16+00:00
> Changed lexicon names and generation patterns to suit SMJ. It seems to work as it should, at least all words I checked are now such that they have lemma errors in the lexicon. 2012-09-12T09:20:13+00:00
> Sometimes TABS are used between stems and continuation lexicons. 2012-09-12T08:58:20+00:00
> Added the lemma-generation tests used in SMA to SMJ. Verbs are almost perfect (one error), propers are good, and adjectives and nouns clearly have an error in the test bench. 2012-09-11T16:58:34+00:00
> Added support for transcribing transducers, ie transducers that change the input from one orthographical representation to another, e.g. date and time expressions as strings or digits to the opposite form. 2012-09-10T10:37:35+00:00
> Renamed the default error model file, to follow the naming scheme used in the zhfst guidelines. 2012-09-10T09:30:47+00:00
> Renamed the default error model file, to follow the naming scheme used in the zhfst guidelines. This makes compilation much easier, and should cause the present makefile to actually build spellers. Tommi already did this for FIN. 2012-09-10T09:25:01+00:00
> Don't remove the *.tmp files - that destroys the dependency relationships for (auto)make, which forces a full recompilation of all target fst's, and a lot of extra waiting time. 2012-09-10T09:00:36+00:00
> Add missing src to hfst spellchecker automaton path. Merge from template. 2012-09-08T13:00:22+00:00
> Added missing reference to dialect tag filter. Update from the template. 2012-09-08T08:42:58+00:00
> Updated my simplistic noun generation script to be aware of its new location. 2012-09-07T14:48:50+00:00
> Reorganised the test dir, in anticipation of a larger set of tools and source types in need of testing. Merge from the template. 2012-09-07T13:49:17+00:00
> sorted lines i most of the stems files 2012-09-07T13:29:44+00:00
> 2012-09-07T07:48:22+00:00
> added typos 2012-09-07T07:35:24+00:00
> Aligned the header with that of the other languages (example, better explanation of format). These two files were moved from their old sources, and thus had old headers. 2012-09-07T06:38:53+00:00
> Moved smj-typos.txt to the new infra. 2012-09-07T06:30:13+00:00
6820a6831,6847
> Added test/data/typos.txt to hold a list of collected typos. The list is used both for testing spellers, and as part of the preprocessor used with the Xerox lookup tool. 2012-09-07T06:27:36+00:00
> Major template update of all languages (except those already updated by Jack): * proper tag deletion of tags only used for transducer manipulation, not for analysis (manipulations mostly not yet implemented) * making optional some tag sets for the generators * updated README with correct and working instructions for first time installers, also for svn users * added hooks for easily adding language-specific operations on transducers * silenced the und.timestamp message unless you are a GTMAINTAINER (thanks to Tommi) => more synchronized template merges, less noise for regular users * a number of other small fixes 2012-09-06T19:19:58+00:00
> added a word 2012-09-06T07:19:30+00:00
> added a word 2012-09-06T07:04:42+00:00
> added a word and changed a lexicon tag 2012-09-05T13:28:52+00:00
> added a word 2012-09-05T12:56:17+00:00
> -rasset 2012-09-05T11:01:15+00:00
> corrected words with typos 2012-09-05T09:25:18+00:00
> added words 2012-09-05T08:57:08+00:00
> added words 2012-09-05T08:29:38+00:00
> to avoid compounds on -asstet 2012-09-05T08:22:01+00:00
> added a word 2012-09-05T07:26:30+00:00
> marked a word as hybrid 2012-09-03T13:09:13+00:00
> 2012-09-03T13:03:48+00:00
> added a sub-marked a word 2012-09-03T12:54:45+00:00
> sub-marked a word 2012-08-30T08:46:23+00:00
> Autoconf updates from the template. Intended goal: better hfst testing before enabling it. 2012-08-30T05:39:16+00:00
6821a6849,6850
> Simple shell script to get back the SMA and SMJ code for easy reference, building reference analyzers, etc. This is the old-vs-new infra safety net. 2012-08-29T18:22:49+00:00
> SMA and SMJ is now moved to the new infra. Not everything is moved yet, but this commit makes the break. Don't commit anything in here anymore! 2012-08-29T16:01:51+00:00
6822a6852,6869
> Added border removal to the basic analyser and generator, such that they become useful. Also changed the order of the dir processing in src/, to ensure that the filters are built before they are needed. 2012-08-29T03:12:39+00:00
> Added border removal to the basic analyser and generator, such that they become useful. Also changed the order of the dir processing in src/, to ensure that the filters are built before they are needed. 2012-08-29T03:06:20+00:00
> Corrected syntax error. 2012-08-29T01:46:18+00:00
> Made the first test script more robust: it bails out if no transducer is found, and gives basic feedback to whether it is testing Xerox or Hfst. The test data files are not deleted after the test run, so that they can be easily inspected if needed, even after a successful test run. 2012-08-29T01:34:20+00:00
> Replaced TAB with space. 2012-08-29T00:31:54+00:00
> Updated the file ref and the tag string to fit SMJ. 2012-08-29T00:23:39+00:00
> Added the first test script: it tests whether noun lemmas do generate. The script does contain some language-specific bits, and must thus be adapted to the requirements of each language. 2012-08-29T00:17:53+00:00
> Corrected reference to inituppercase.?fst. Template update. 2012-08-28T19:34:40+00:00
> Updated the timestamp of three languages - they didn't merge for some reason. 2012-08-28T19:02:44+00:00
> Corrected compilation of hyphenation rules. Template update. 2012-08-28T18:44:51+00:00
> Corrected compilation of phonetic/orth2ipa rules. Merge from the template. 2012-08-28T17:41:58+00:00
> Added basic structure for hyphenation and conversion to IPA. Merge from the template. 2012-08-28T05:45:14+00:00
> Now all inflectional morphology for the main POS's is split in separate files. The SMJ transducer should now be compilable again (with an exception for the common proper nouns). 2012-08-27T19:58:59+00:00
> Added Hunspell dir. Merge from templates. 2012-08-27T16:44:43+00:00
> Hunspell phon-rules file moved from the gt/smj/hunspell dir. 2012-08-27T16:21:19+00:00
> * Corrected configure.ac * renamed one cg3 file to follow filename convention. 2012-08-27T15:53:44+00:00
> Moved SMJ to the new infrastructure. It probably doesn't yet compile - there is some duplicate code in the inflectional files that need to be cleaned up. 2012-08-27T15:11:00+00:00
> compound-tags 2012-08-22T09:18:17+00:00
6824a6872,6873
> changed name/function of speller tags 2012-08-21T07:35:19+00:00
> tuning plc-org lexicons 2012-05-31T11:06:39+00:00
6826a6876
> plural outcommented for Plc and Org 2012-05-31T09:37:15+00:00
6827a6878,6879
> reshuffled DUBAI-lexicons 2012-05-30T13:09:26+00:00
> Added back Der/NomAct in its second incarnation, but outocmmented. I want to keep it for documentation purposes. 2012-05-28T12:03:26+00:00
6829a6882
> Removed doublet multichar symbols 2012-05-27T07:35:22+00:00
6830a6884,6885
> The Der/NomAct fix 2012-05-09T07:37:58+00:00
> unsubbed entries that were subbed because of speller 2012-05-08T14:12:46+00:00
6832a6888
> lulesáme actio-fix, GerII is noun Actio+Ess 2012-05-08T14:05:18+00:00
6833a6890,6891
> Bug1254- innføre svak bøjing av dat etc + subst, time for lulesábme 2012-05-08T09:07:49+00:00
> forgot some Sg-tags 2012-05-08T08:34:22+00:00
6835a6894
> Bug1254- innføre svak bøjing av dat etc + subst, time for lulesábme 2012-05-08T08:12:49+00:00
6836a6896,6897
> smj as it was did not compile, two lexica were missing. One was commented out, I commented it in, the other did not exist, I added it, but without further possible contlexes. 2012-05-05T10:38:21+00:00
> Remove the number of reps, wordlist2hunspell computes it 2012-04-27T00:00:27+00:00
6838a6900
> MARJA-I-obj 2012-04-20T07:12:29+00:00
6839a6902,6903
> missing lexicons from sme dump-part added 2012-04-18T07:26:34+00:00
> compound-marked the ab-nodots Forst since they are easily mixed with case-endings 2012-04-12T07:31:50+00:00
6841a6906
> We will have two more lines, the easter egg 2012-04-07T01:18:30+00:00
6842a6908,6909
> # fix, removed to the left. 2012-03-13T22:01:59+00:00
> added to credits 2012-03-12T18:14:54+00:00
6844a6912
> Renamed version number to reflect the correct version number of the lexicon files. 2012-03-09T17:12:06+00:00
6845a6914,6915
> Added the original 1.0 release PLX speller lexicons to the release dirs. Part of extending the speller test bench. 2012-03-09T17:04:28+00:00
> M4 work on SMJ. All M4 code removed, and the same escaped symbols and morphological boundary markers have been checked and added, to ensure consistency across languages. Also the punctuation file has been updated to follow SME, and some compounding corrections done to numerals. 2012-03-07T11:34:47+00:00
6847a6918
> removed ^ from upper 2012-03-07T11:15:36+00:00
6848a6920,6921
> removed ^ from upper 2012-03-07T08:55:22+00:00
> removed ^ from upper 2012-03-07T08:55:09+00:00
6850a6924
> added some id tags to divs 2012-03-07T00:47:09+00:00
6851a6926,6927
> added new veriable for top margin offset; added other translations to result set; removed entry number from result set 2012-03-07T00:27:15+00:00
> added new variable for top margin offset 2012-03-07T00:26:15+00:00
6853a6930,6931
> added new variable for top margin offset 2012-03-07T00:25:57+00:00
> added new variable for top margin offset 2012-03-07T00:25:34+00:00
6856a6935,6936
> Sort-uniq-ed all Der1 derivations. Most of these were not declared as multichar symbols (update with proper declarations forthcoming). 2012-03-06T17:08:06+00:00
> Commented in a couple of lexicons required by recent additions(?) in the lexicon. 2012-03-06T16:23:43+00:00
6858a6939,6940
> Removed upper-side ^-marks - they showed up in the analysis after the M4 changes. Still many left, Thomas will look at the rest tomorrow. 2012-03-06T16:23:03+00:00
> Whitespace changes, added pipe as CLB to bring the list of CLB symbols in line with SME. 2012-03-06T08:37:03+00:00
6860a6943
> changed name of table in sql DB 2012-03-03T19:22:13+00:00
6861a6945,6946
> minor changes 2012-03-03T17:52:06+00:00
> updated, improved short list 2012-03-03T17:51:22+00:00
6863a6949
> added links, text 2012-03-03T17:50:44+00:00
6864a6951,6952
> modified to adjust page to browser window size 2012-03-02T18:04:07+00:00
> fixed bug 2012-03-02T15:39:08+00:00
6866a6955
> added ability to allow regular expressions 2012-03-02T05:27:01+00:00
6867a6957,6958
> minor addition 2012-03-02T05:26:29+00:00
> modifications to allow for regular expression search interface option 2012-03-02T05:26:10+00:00
6869a6961
> minor modifications 2012-03-02T05:25:20+00:00
6870a6963,6964
> added modification to search interface 2012-03-02T05:24:36+00:00
> added pop-up for list of regular expressions 2012-03-02T05:23:47+00:00
6872a6967
> added part of regular expression search interface 2012-03-02T05:23:06+00:00
6873a6969,6970
> changes to advanced search interface 2012-03-02T05:22:11+00:00
> added addition search interface with regular expressions 2012-03-02T05:21:23+00:00
6875a6973
> added title pop-up text hints to search fields 2012-02-28T16:05:17+00:00
6876a6975,6976
> added title pop-up text hints to search fields 2012-02-28T16:04:51+00:00
> added file for basic search of mávsulasj data 2012-02-28T15:37:07+00:00
6878a6979,6980
> added file for advanced search of mávsulasj data 2012-02-28T15:36:40+00:00
> added folder incl. individual part files for mávsulasj searchable website 2012-02-28T15:35:28+00:00
6881a6984,6985
> added xslt to compare categories in mávsulasj excel data 2012-02-24T15:25:10+00:00
> formatting changes 2012-02-09T08:31:02+00:00
6883a6988,6989
> formatting changes, excuse the colors 2012-02-09T00:23:59+00:00
> added mouse-over effects 2012-02-08T23:28:45+00:00
6885a6992
> slight modification to a table 2012-02-08T22:52:07+00:00
6886a6994,6995
> major changes, added java script to display details on right upon clicking an entry 2012-02-08T22:25:16+00:00
> slight modifications 2012-02-08T15:58:46+00:00
6888a6998
> cleaned up 2012-02-08T15:14:50+00:00
6889a7000,7001
> fixed bug in output 2012-02-08T15:11:30+00:00
> added unique record number to SQL-producing script so this number is in SQL table, too 2012-02-08T15:06:15+00:00
6891a7004
> numbered SMJ alphabet file for creating unique numbers for each record 2012-02-08T14:36:22+00:00
6892a7006,7007
> added calculation and category for unique entry number for each entry based on alphabet and row number 2012-02-08T14:35:31+00:00
> changes to php output and added new <div> for specific results 2012-02-08T11:54:38+00:00
6894a7010
> added folder, php-file for searching mysql DB for mávsulasj data 2012-02-08T10:11:11+00:00
6895a7012,7013
> added folder, file to generate SQL databases for mávsulasj data 2012-02-08T08:47:21+00:00
> added folder, files to generate HTML versions of mávsulasj data 2012-02-08T08:46:40+00:00
6897a7016
> added folder, files for excel to xml script(s) 2012-02-08T08:45:23+00:00
6898a7018,7019
> added folder, files for consistency check scripts 2012-02-08T08:44:45+00:00
> Added the 1.1 release of the speller/hyphenator lexicons for SME and SMJ. These are both stored for reference, and to be able to do proper comparison testing of the different Divvun releases. 2012-02-07T22:25:08+00:00
6900a7022
> changed variable; now double-quote not ok anywhere in incoming EXCEL data 2012-02-06T15:00:23+00:00
6901a7024,7025
> deleted junk 2012-02-06T12:04:10+00:00
> added script to create mysql database for mávsulasj data 2012-02-06T12:03:21+00:00
6903a7028
> added test results for testing responsiveness of live websearch of SMJ G data 2012-01-31T14:03:33+00:00
6904a7030,7031
> updated version with more info for search results 2012-01-24T09:59:54+00:00
> fixed spelling mistake 2012-01-24T09:55:15+00:00
6906a7034
> changed spelling mistake 2012-01-24T09:54:34+00:00
6907a7036,7037
> changed spelling mistake 2012-01-24T09:54:15+00:00
> name correction 2012-01-24T09:44:57+00:00
6909a7040
> update name 2012-01-24T09:36:14+00:00
6910a7042,7043
> lots of data for search data set 2012-01-23T20:51:52+00:00
> added file for testing lots of data for search 2012-01-23T20:51:23+00:00
6912a7046
> added file for testing lots of data for search 2012-01-23T20:51:09+00:00
6913a7048,7049
> minor changes to text displayed 2012-01-23T20:49:23+00:00
> added title 2012-01-23T15:45:57+00:00
6915a7052,7053
> changed file name misspelling from liveserachHTML 2012-01-23T15:41:44+00:00
> added more output values to help refine search results 2012-01-23T15:40:06+00:00
6917a7056,7057
> added more elements to output 2012-01-23T15:39:12+00:00
> new files for testing SMJ data online search 2012-01-23T14:30:12+00:00
6919a7060
> files and folders for testing websearch options 2012-01-23T13:37:22+00:00
6920a7062,7063
> javascript needed for the table in the output file 'Index_SMJentries.html' 2012-01-19T15:54:21+00:00
> slight change to first line of output 2012-01-19T15:42:17+00:00
6922a7066
> updated with more categories in table 2012-01-19T15:37:43+00:00
6923a7068,7069
> added more categories to output xml files 2012-01-19T15:37:22+00:00
> added Index generator for smj layout entries - very initial version 2012-01-19T15:15:45+00:00
6925a7072
> minor changes 2012-01-19T15:15:00+00:00
6926a7074,7075
> fixed amateur mistake 2012-01-19T14:40:14+00:00
> fixed amateur mistake 2012-01-19T14:38:13+00:00
6928a7078
> added xml output files to be able to generate an index 2012-01-19T14:37:13+00:00
6929a7080,7081
> added xslt transformer to get initial results for web presentation - very initial version 2012-01-19T14:36:00+00:00
> added xslt transformer to get initial results for web presentation - very initial version 2012-01-19T14:29:28+00:00
6931a7084
> added dir and files for testing html-layout options for Mávsulasj data 2012-01-19T12:25:38+00:00
6932a7086,7087
> removed nummar91 2011-12-21T15:17:14+00:00
> small change 2011-12-19T16:10:32+00:00
6934a7090
> added better path for testing 2011-12-19T16:00:41+00:00
6935a7092,7093
> added results folder and subfolder consistency for final results 2011-12-19T15:52:22+00:00
> updated with .smallText style 2011-12-19T15:34:21+00:00
6937a7096
> changed location of sentence 2011-12-19T15:33:53+00:00
6938a7098,7099
> javascript to sort HTML tables 2011-12-19T15:28:46+00:00
> added line of text to indicate sorting table function 2011-12-19T15:28:08+00:00
6940a7102
> added javascript command to sort table by data in a column upon clicking header 2011-12-19T15:27:43+00:00
6941a7104,7105
> added comment 2011-12-19T15:08:37+00:00
> added javascript command to sort table by data in a column upon clicking header 2011-12-19T15:07:03+00:00
6943a7108
> added total valid (non-empty) cells out of total cells 2011-12-19T14:34:35+00:00
6944a7110,7111
> Added compound tagging to entries. 2011-12-19T14:19:43+00:00
> added empty and total cell counts 2011-12-19T14:00:29+00:00
6946a7114
> added empty and total cell counts 2011-12-19T13:59:33+00:00
6947a7116,7117
> changes, cleaning up for use with generateIndex.xsl 2011-12-19T12:38:22+00:00
> major overhaul to create Index from generated pattern files 2011-12-19T12:37:47+00:00
6949a7120
> fixed copy_paste error 2011-12-19T11:03:30+00:00
6950a7122,7123
> infos from the xml files 2011-12-19T10:59:22+00:00
> further restructuring 2011-12-19T10:50:59+00:00
6952a7126
> deleted empty lines 2011-12-19T10:46:25+00:00
6953a7128,7129
> restructuring 2011-12-19T10:42:46+00:00
> renamed file 2011-12-19T10:37:48+00:00
6955a7132
> include XML output 2011-12-19T10:32:49+00:00
6956a7134,7135
> deleted the output of the main site 2011-12-19T10:24:44+00:00
> renamed 2011-12-19T10:14:04+00:00
6958a7138
> small change 2011-12-19T10:12:22+00:00
6959a7140,7141
> for testing multiple changes 2011-12-19T09:58:06+00:00
> removed example, input form is compound as well 2011-12-19T09:36:12+00:00
6961a7144
> added .css for consistency checker output 2011-12-18T20:12:12+00:00
6962a7146,7147
> stylistic changes and added css-references for html-output 2011-12-18T20:11:26+00:00
> added total patterns, other info to html 2011-12-18T17:38:41+00:00
6964a7150
> added HTML index with href to each individual category's result page 2011-12-18T17:02:29+00:00
6965a7152,7153
> corrected typo 2011-12-18T12:58:32+00:00
> added initial HTML output documents 2011-12-18T12:57:49+00:00
6967a7156
> merged plain-text output into this script, now two outputDirs are created, one for XML, one for TXT 2011-12-16T22:57:36+00:00
6968a7158,7159
> added consistency checker version with plain-text print out, better for Bruce to look at, and i'm still just practicing 2011-12-16T22:23:14+00:00
> changed pattern count from element to attribute 2011-12-16T20:35:02+00:00
6970a7162
> added pattern counter and a comment 2011-12-16T15:56:39+00:00
6971a7164,7165
> added variables and steps for reordering output based on frequency; renamed some nodes; deleted junk 2011-12-16T15:44:55+00:00
> added variables and steps for reordering output based on frequency; renamed some nodes; deleted junk 2011-12-16T15:38:53+00:00
6973a7168
> cleaning up 2011-12-16T11:25:55+00:00
6974a7170,7171
> grouping the cells based on their content and counting the group elements 2011-12-16T11:17:25+00:00
> baustellenbeschilderung 2011-12-16T11:05:03+00:00
6976a7174
> the most important functionality for the program is now in place 2011-12-16T10:57:35+00:00
6977a7176,7177
> depth optimization 2011-12-16T10:29:54+00:00
> path correction 2011-12-16T10:28:17+00:00
6979a7180
> path correction 2011-12-16T10:28:04+00:00
6980a7182,7183
> further changes 2011-12-16T10:22:44+00:00
> output based on cell content 2011-12-16T10:18:36+00:00
6982a7186
> further change 2011-12-16T10:01:39+00:00
6983a7188,7189
> further change 2011-12-16T09:56:29+00:00
> further change 2011-12-16T09:51:41+00:00
6985a7192
> further change 2011-12-16T09:47:10+00:00
6986a7194,7195
> first important change 2011-12-16T09:40:45+00:00
> only formatting changes the script a bit more readable 2011-12-16T08:57:58+00:00
6988a7198
> added change to output a file name referencing category 2011-12-16T08:43:40+00:00
6989a7200,7201
> script now outputs all patterns of the set category, like a grep function, but needs refining. see comments in the script 2011-12-15T22:17:36+00:00
> trying to remove this folder, not necessary for svn 2011-12-15T21:43:52+00:00
6991a7204
> added folder for script and data checking consistency of Mávsulasj data; first script finished, needs refining 2011-12-15T21:39:44+00:00
6992a7206,7207
> renamed the script to reflect its use 2011-12-13T10:53:16+00:00
> moved the main script to a more appropriate place 2011-12-13T10:52:07+00:00
6995a7211
> Changed smj clitic tags. 2011-12-08T15:34:23+00:00
6996a7213,7214
> added tansformed Excel->xml file for testing consistency of data - transformed using the Excel-killer 2011-12-07T16:05:39+00:00
> added big file of original data in original Excel->xml format for testing 2011-12-07T16:00:51+00:00
6998a7217
> added output directory and file to processing output 2011-12-07T15:58:01+00:00
6999a7219,7220
> changed the names of some elements and attributes 2011-12-07T14:12:03+00:00
> merged mutator into processRow 2011-12-07T13:58:34+00:00
7001a7223
> Added hyphenated nums and increased to include 1 000 000 numerals. 2011-12-07T13:53:16+00:00
7002a7225,7226
> final save before testing of merging mutator into processRow 2011-12-07T11:15:08+00:00
> positive testcase 2011-12-07T10:53:58+00:00
7004a7229
> added automatic output of xsl-file-name 2011-12-07T10:53:28+00:00
7005a7231,7232
> added automatic output of xsl-file-name 2011-12-07T09:20:15+00:00
> copy of original working solution (excel killer) just as backup 2011-12-07T08:26:25+00:00
7007a7235
> comments 2011-12-06T15:55:42+00:00
7008a7237,7238
> simplified the conditions 2011-12-06T15:53:27+00:00
> problem solved 2011-12-06T15:36:50+00:00
7010a7241
> now, pass on the global variable with the labels down to each row in order to be able to use it locally 2011-12-06T14:22:49+00:00
7011a7243,7244
> deleted useless stuff 2011-12-06T14:10:09+00:00
> better debug info in the shell; constructed the global variable to match against each row 2011-12-06T14:08:44+00:00
7013a7247,7248
> tabula rasa: just ovewritten the code here with that from xls-2004xml2simple-xml_JKW_02 2011-12-06T13:29:44+00:00
> changed for-each line for testing 2011-12-06T12:48:38+00:00
7015a7251,7252
> new version of xls-2004xml2simple-xml_JKW_02.xsl for testing cip's recommendation and with a name that's easier to enter 2011-12-06T12:40:06+00:00
> corrected typo 2011-12-06T10:11:02+00:00
7017a7255
> added a possibly workable solution for the empty rows' problem as pseudo-code: Josh's task is to implement it: lykke til 2011-12-06T10:08:43+00:00
7018a7257,7258
> output if the script is not a well-formed xml: corrected 2011-12-06T09:33:57+00:00
> changes to excel-killer to include cells even for cells that were originally missing (and not just empty) in the excel export; not an elegant xsl, but works at least as long as no more than four cells were missing in the original excel export. 2011-12-05T15:44:15+00:00
7020a7261
> change for consistency in attributes for empty elements 2011-11-30T17:48:01+00:00
7021a7263,7264
> small changes to add cataloging info to output file 2011-11-30T17:40:41+00:00
> revised version of the EXCEL killer 2011-11-30T17:15:19+00:00
7023a7267
> revised version of the EXCEL killer 2011-11-30T17:06:27+00:00
7024a7269,7270
> visualizing some info selection for Josh 2011-11-30T14:59:46+00:00
> once again, with descriptors for all cells, latest version 2011-11-30T14:48:02+00:00
7026a7273
> once again, with descriptors for all cells 2011-11-30T14:47:26+00:00
7027a7275,7276
> once again, with descriptors for all cells 2011-11-30T14:38:57+00:00
> now fine-tuned to include original column category as attribute 2011-11-30T14:29:16+00:00
7029a7279
> broke the EXCEL code, it works, just need lots of fine tuning 2011-11-30T14:26:34+00:00
7030a7281,7282
> broke the EXCEL code, it works, just need lots of fine tuning 2011-11-30T14:24:23+00:00
> filter the elements before and after 2011-11-29T15:16:00+00:00
7032a7285
> still debugging 2011-11-29T13:04:34+00:00
7033a7287,7288
> debugging version 2011-11-29T07:56:16+00:00
> working on the problem: partly solved 2011-11-28T21:23:54+00:00
7035a7291
> deleted outcommented stuff; first debugging steps 2011-11-28T20:28:23+00:00
7036a7293,7294
> count all cells, also the empty ones 2011-11-28T20:09:04+00:00
> added another row with different patterns to account for 2011-11-28T20:06:22+00:00
7038a7297
> changed paths 2011-11-28T20:01:43+00:00
7039a7299,7300
> an even smaller file for the first debug operation 2011-11-28T19:53:34+00:00
> debugging Josh' problem 2011-11-28T19:44:52+00:00
7041a7303
> positive testcases 2011-11-24T13:52:29+00:00
7042a7305,7306
> positive testcases 2011-11-24T13:33:40+00:00
> redirected 2011-11-24T13:33:23+00:00
7044a7309
> Changed hyphen final arabic numeral not to compound in the middle part when compounding with nouns and adjs. 2011-11-24T11:04:42+00:00
7045a7311,7312
> positive testcases 2011-11-24T10:21:27+00:00
> Changed arabic numeral compounding with nouns. 2011-11-23T16:32:38+00:00
7047a7315
> Added generated arabic numerals 1-10000 and 0-0000. 2011-11-23T14:34:12+00:00
7048a7317,7318
> Piped the file through uniq. 2011-11-23T14:23:54+00:00
> Added arabic numeral plx file. This includes inflected and hyphen final numbers 0-9 2011-11-23T14:15:07+00:00
7050a7321,7322
> pos testcases 785 2011-11-21T10:01:51+00:00
> Fjerna signatur. 2011-11-20T22:20:42+00:00
7051a7324,7325
> Aligned and updated the test words for all languages. Now we are either using exactly the same word, or words belonging to the same category. This should help us make sure that we don't destroy for each other across languages. 2011-11-15T14:31:05+00:00
> Svn copy of the sme test data file for proper noun PLX conversion testing. 2011-11-15T11:07:40+00:00
7053a7328
> Added sloppy punctuation, to keep parallel with sme and sma. 2011-11-14T09:42:11+00:00
7054a7330,7331
> revisions; works nicely now 2011-11-11T20:12:00+00:00
> new versions of everything 2011-11-11T13:30:41+00:00
7056a7334
> added new comment 2011-11-10T23:45:49+00:00
7057a7336,7337
> updated notes, files and discussion 2011-11-10T23:27:19+00:00
> added for testing 2011-11-10T23:19:14+00:00
7059a7340
> added for testing 2011-11-10T23:18:52+00:00
7060a7342,7343
> additions for testing 2011-11-10T23:18:24+00:00
> adding examples for testing 2011-11-10T15:59:33+00:00
7063a7347
> added source data example from FilemakerPro11 export, used as a source for mutotor_00.xsl 2011-11-10T12:47:32+00:00
7064a7349,7350
> added indenter script for svn 2011-11-10T12:45:20+00:00
> added dir and example for xsl(t) stuff worth saving 2011-11-10T12:43:39+00:00
7066a7353
> Changed +Cmpnd to +RCmpnd. 2011-11-09T16:07:52+00:00
7067a7355,7356
> tuning 2011-11-03T17:18:04+00:00
> Added smj test cases. 2011-11-03T13:16:51+00:00
7069a7359
> added some comments to script 2011-11-01T17:04:56+00:00
7070a7361,7362
> script to play around with, including splicing indivdual characters in a string 2011-11-01T15:48:44+00:00
> practice xml file 2011-11-01T15:46:20+00:00
7072a7365,7366
> changes to test files 2011-10-31T15:42:47+00:00
> test excel file 2011-10-31T15:18:33+00:00
7076a7371,7372
> Whitespace changes to make it easier to compare smj and sme, and easier to debug and change tags. Moved comments around. 2011-10-28T07:45:34+00:00
> Structured the normative compounding tags according to SMA, which has a much clearer list of them. Added the +Der tag, even though it doesn't seem to be needed for SMJ. It is still there for cross-language tag compatibility and potential future use. 2011-10-27T12:23:47+00:00
7078a7375,7376
> Cleaned up and synchronised the acro inflection and compounding of SMA, SME and SMJ. Whitespace changes to make the code more readable. 2011-10-27T12:11:35+00:00
> Corrected one misplaced colon. 2011-10-24T08:45:54+00:00
7080a7379
> More tag cleanup: shuffling things around, whitespace changes, comments. 2011-10-21T11:51:50+00:00
7081a7381,7382
> Completed the separation of the derivation filter and the focus filtering. Now focus filtering is specified in a separate, language-specific file, whereas derivation filtering (ie filtering out impossible combinations) is done by one language-independent filter. This is safe, since the der-filter is only concerned with order, and only looks at the +Der1..5 tags (and +Der/Pass[LS] - to filter out double passive derivations). 2011-10-21T11:02:59+00:00
> +Der/at and +Der/vuohta is +Der5 in SMJ. 2011-10-21T10:57:20+00:00
7083a7385
> DUBAIs back, because some of us get error-message 2011-10-19T13:34:05+00:00
7084a7387,7388
> We don't trace these keywords anymore. 2011-10-18T05:31:21+00:00
> Major cleanup of the multichar tags. Now they are listed in the same order as in SME, and doublets are removed. Also replaced +Cmpnd with +RCmpnd. The CG files are updated at the same time. 2011-10-17T13:31:51+00:00
7086a7391
> these words are already converted 2011-10-17T12:49:21+00:00
7087a7393,7394
> R changed to RHyph 2011-10-17T08:28:59+00:00
> tagged alhpabet +Use/-Spell 2011-10-11T17:16:53+00:00
7089a7397
> This is a real regex now. 2011-10-10T14:52:17+00:00
7090a7399,7400
> tuning compound-tags, thrown away the Sg,Gen's 2011-10-10T07:06:56+00:00
> removed all uppercase+number multichars from here as well 2011-10-06T09:55:35+00:00
7092a7403
> muhto replaced by jali etc 2011-10-05T12:08:33+00:00
7093a7405,7406
> long awaited and much requested transitivity hinto lexicon-name mark-up, done with extraordinary precision and accuracy 2011-09-30T10:28:59+00:00
> subbed away adj-Px's 2011-09-29T12:47:53+00:00
7095a7409
> Removed unused files, svn-copy of the SMA error model file, to be used as a starting point for developing a useful correction mechanism. 2011-09-28T09:13:49+00:00
7096a7411,7412
> The SMJ error model updated with a proper subset of the speller symbols (some chars escaped for bash and make compatibility): 2011-09-27T11:29:45+00:00
> Svn-copy of the SME error model to SMJ and SMA. The alphabet used should be ok for all three languages. 2011-09-27T09:24:35+00:00
7098a7415
> rearranged Refl after sme pattern (cf. bug 567). The thing is that there is no number distinction for possessed. 2011-09-26T13:18:13+00:00
7099a7417,7418
> Number facultative in Refl 2011-09-26T13:17:31+00:00
> commented out not understandable word 2011-09-23T08:46:09+00:00
7101a7421
> forbettered regr-file, better testcases, some testcases out etc 2011-09-23T08:45:34+00:00
7102a7423,7424
> alhpabet with nodot as well 2011-09-22T10:55:53+00:00
> correction 2011-09-21T09:48:33+00:00
7104a7427
> correction 2011-09-21T09:15:44+00:00
7105a7429,7430
> test-cases Bug1144 and 1145 2011-09-21T09:03:41+00:00
> Fixed clitics. 2011-09-20T10:06:45+00:00
7107a7433
> Version 2.1 -> 2.2 2011-09-16T14:17:42+00:00
7108a7435,7436
> case-tags to all abbr:s 2011-09-16T09:39:15+00:00
> case-tags to all abbr:s 2011-09-16T08:30:03+00:00
7110a7439
> Same acro change as in sme. 2011-09-15T11:48:09+00:00
7111a7441,7442
> small letter alphabet 2011-09-14T09:24:14+00:00
> testcase Bug1122 2011-09-12T10:35:51+00:00
7113a7445
> removed the vinjo-examples :D 2011-09-09T10:25:04+00:00
7115a7448,7449
> Added support dirs and file for libvoikko+hfst for SMJ. 2011-09-08T10:23:24+00:00
> Changed all refs from SME to SMJ (the text is really provisional, and full of errors, but serves its purpose for now). 2011-09-07T10:59:32+00:00
7118a7453
> Copied folder and files from SME, to add basic HFST support for SMJ. 2011-09-07T10:49:20+00:00
7119a7455,7456
> better testcases, corrections etc 2011-09-06T11:32:53+00:00
> Removed ^ and 0 from lemma forms. 2011-09-04T06:52:00+00:00
7121a7459
> some actios and an actor 2011-08-24T07:30:14+00:00
7122a7461,7462
> corrections, additions 2011-08-23T13:39:20+00:00
> test-cases, Bug634smj/polderland/regression.txt 2011-08-09T10:00:47+00:00
7124a7465,7466
> removed @U.Cap.Opt@ -flags to get optional lowe/high initial case to placename derivativessme/src/propernoun-sme-morph.txt 2011-08-09T09:13:52+00:00
> symlink 2011-08-08T22:27:43+00:00
7125a7468,7469
> test-cases, Bug837smj/polderland/regression.txt 2011-08-08T07:57:13+00:00
> better test-cases, Bug641sme/polderland/regression.txt 2011-08-05T08:36:10+00:00
7126a7471,7472
> Northern -> North, Southern -> South. 2011-08-04T23:09:26+00:00
> corrected typo, Òdáðahraun to Ódáðahraun. 2011-08-04T09:07:48+00:00
7128a7475,7476
> Replaced two tabs with space 2011-08-04T09:03:38+00:00
> flags for initial uppercasing 2011-08-03T11:36:53+00:00
7130a7479,7480
> flags for initial uppercasing 2011-08-03T11:36:40+00:00
> updates 2011-06-29T19:38:22+00:00
7131a7482,7483
> Fixed the 00:30 bug for sme and smj as well: The hour after 24:00 was marked as 12, and not as 1, as it should. Luckily most people are asleep at that time. 2011-06-21T21:47:05+00:00
> Reverted last change and added additional tags. 2011-06-20T14:32:54+00:00
7133a7486
> Removed 0 and ^ from the left-hand lemma form. Sorted, removed doublets. 2011-06-10T19:11:02+00:00
7134a7488,7489
> missingssme/src/adj-sme-lex.txt 2011-06-09T11:19:47+00:00
> missingssme/src/adj-sme-lex.txt 2011-06-08T12:18:45+00:00
7136a7492
> missings and correction to smj-regression-filesme/src/adj-sme-lex.txt 2011-06-06T09:10:47+00:00
7137a7494,7495
> Bug #855, only one generated form of smj relative pronoun. 2011-06-04T13:08:49+00:00
> missingssmj/src/typos.txt 2011-06-03T11:29:10+00:00
7139a7498
> missingssmj/src/typos.txt 2011-06-03T10:30:31+00:00
7140a7500,7501
> missingssmj/src/typos.txt 2011-06-03T08:37:25+00:00
> missingssmj/src/typos.txt 2011-06-02T08:58:00+00:00
7142a7504
> missingssmj/src/typos.txt 2011-06-01T11:43:41+00:00
7143a7506,7507
> The iesj bug #567. 2011-05-31T20:51:07+00:00
> all kindof hyphens to ARABICLOOPsrc/numeral-sma-lex.txt 2011-05-30T12:11:18+00:00
7145a7510
> corrections 2011-05-30T11:06:26+00:00
7146a7512,7513
> As in sme, now full Refll paradigm 2011-05-26T15:13:43+00:00
> oj - compilation errorsmj/src/noun-smj-lex.txt 2011-05-26T12:03:01+00:00
7148a7516
> compound-tags and more words moved to HyphNounssme/src/noun-sme-lex.txt 2011-05-26T12:01:36+00:00
7149a7518,7519
> more words moved to HyphNouns 2011-05-26T11:31:15+00:00
> moved a line, now it all works againsrc/smj-lex.txt 2011-05-25T11:01:49+00:00
7151a7522
> new lexiconstructure with lexicons renamed and moved to noun-filesrc/noun-smj-lex.txt 2011-05-25T10:43:38+00:00
7152a7524,7525
> new twstpairs Bug607, better reflecting todays structuresme/polderland/regression.txt 2011-05-25T07:55:29+00:00
> testcases Bug913sme/polderland/regression.txt 2011-05-23T18:16:32+00:00
7154a7528
> Added +CmpN/Pref as Multichar_Symbol. 2011-05-23T15:59:37+00:00
7155a7530,7531
> prefix tagssma/src/noun-sma-lex.txt 2011-05-23T14:37:44+00:00
> Changed clitic tag '+' to end tag 'E'. 2011-05-23T13:29:35+00:00
7157a7534
> removed irrelevant test-casessmj/polderland/regression.txt 2011-05-23T12:17:56+00:00
7158a7536,7537
> Ávvir-wordssme/src/propernoun-sme-lex.txt 2011-05-20T09:20:56+00:00
> Ávvir-wordssme/src/propernoun-sme-lex.txt 2011-05-19T10:52:10+00:00
7160a7540
> Ávvir-wordssme/src/propernoun-sme-lex.txt 2011-05-18T11:22:10+00:00
7161a7542,7543
> Missing list from free corpus. 2011-05-05T12:46:37+00:00
> Missing list from bound corpus. 2011-05-05T12:43:18+00:00
7163a7546
> Change to accommodate the new version of HfstTester.py. 2011-05-04T10:23:46+00:00
7164a7548,7549
> compounding copied from sma, functions goodsmj/src/smj-lex.txt 2011-05-02T11:44:00+00:00
> more acro+noun compounds move to quasiacrosrc/acro-sma-lex.txt 2011-05-02T10:36:53+00:00
7166a7552
> LEXICON quasiacro with initial capital nounssrc/acro-sma-lex.txt 2011-05-02T10:31:37+00:00
7167a7554,7555
> the missing tagssme/src/pronoun-sme-morph.txt 2011-05-02T09:18:16+00:00
> DUBAI lečes 2011-04-29T11:11:00+00:00
7169a7558
> removed ^C and ^G, flags to both sides 2011-04-27T11:23:29+00:00
7170a7560,7561
> lexicons because of the sme dump 2011-04-27T08:50:26+00:00
> A small change to make the make file work with the new twol compilation. 2011-04-12T19:22:43+00:00
7172a7564
> Whitespace change only, to make the two contexts easier to compare. 2011-04-12T19:21:10+00:00
7173a7566,7567
> GATA-plc lexicon 2011-04-12T11:52:56+00:00
> Added one test with Px inflection, clitics. This paradigm is probably bigger than anything found in real life, but it ensures we test all possible forms produced by our fst. 2011-04-07T14:46:19+00:00
7176a7571
> Corrected test set ID and comment. 2011-04-04T12:55:40+00:00
7177a7573,7574
> Replaced the dummy sma example with a real smj one, this one also with possessives. No real errors found, the errors reported are related to a bug in the test tool. 2011-04-03T20:31:06+00:00
> Added initial test files for sme and smj for transducer testing, by svn copy of the sma file. Contains one sma test case as an illustration of the format. 2011-04-03T20:04:32+00:00
7179a7577
> more changes #>ENDLEX 2011-03-22T09:38:36+00:00
7180a7579,7580
> test-cases Bug 971 2011-03-21T12:54:35+00:00
> K>#sme/src/acro-sme-lex.txt 2011-03-21T12:17:14+00:00
7182a7583
> more test-cases Bug963 2011-03-15T09:04:47+00:00
7183a7585,7586
> Changed to generate only one clitic; Foc/ge. 2011-03-15T08:25:13+00:00
> Clitics tagging was changed... 2011-03-14T14:21:38+00:00
7185a7589
> transitivity redirectionssme/src/verb-sme-lex.txt 2011-03-14T07:32:53+00:00
7186a7591,7592
> +N to all firstcomponent entriessmj/src/noun-smj-lex.txt 2011-03-11T13:58:17+00:00
> Lexicons for better Pl+Ill formsme/src/propernoun-sme-morph.txt 2011-03-11T13:25:18+00:00
7188a7595
> testcases Bug963 2011-03-09T11:37:24+00:00
7189a7597,7598
> rearranged some to avoid passive actors according to sme-sma samkøyring meeting findingssmj/src/smj-lex.txt 2011-03-08T19:40:32+00:00
> outcommented passive actors according to sme-sma samkøyring meeting findings, have obiously not found everyone yetsme/src/sme-lex.txt 2011-03-08T13:28:29+00:00
7191a7601
> chockingly I found a julevsá me word to add 2011-03-03T09:56:26+00:00
7192a7603,7604
> common 2011-02-27T22:38:12+00:00
> Fixed the » and > bug, changed to »7 and >7, these are used as suffix boundaries. 2011-02-27T19:57:56+00:00
7194a7607
> transitivity-worksma/src/verb-sma-lex.txt 2011-02-24T09:08:36+00:00
7195a7609,7610
> Updated version. 2011-02-23T15:24:26+00:00
> transitivity worksmj/src/verb-smj-lex.txt 2011-02-14T12:56:51+00:00
7197a7613
> transitivity worksmj/src/verb-smj-lex.txt 2011-02-11T13:19:00+00:00
7198a7615,7616
> transitivity worksmj/src/verb-smj-lex.txt 2011-02-11T10:39:17+00:00
> transitivity worksmj/src/verb-smj-lex.txt 2011-02-10T13:06:00+00:00
7200a7619
> transitivity worksmj/src/verb-smj-lex.txt 2011-02-10T10:23:01+00:00
7201a7621,7622
> transitivity worksmj/src/verb-smj-lex.txt 2011-02-10T07:18:54+00:00