-
Notifications
You must be signed in to change notification settings - Fork 65
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Out of date SWORD module OSHB dated 2013-10-11 #47
Comments
SWORD module OSMHB version 2.0.1 is dated 2015-02-27 and is in the CrossWire Attic repository! |
Actually the OSHB obsoletes the OSMHB. I made an attempt to update the OSHB, but ran into a snag. I submitted a question to the SWORD devel mailing list, but received no response. |
The dates seem the wrong way round for that. |
Can you recall what the snag was? Was it technical or simply a lack of response from the "modules team"? I think we should try and remedy the situation. It may have fallen through the cracks after Chris Little left CrossWire. |
OK - I found the relevant thread in the Nabble mirror of sword-devel (which I find easier to search than the pipermail archives). I responded, but nobody else did, so essentially your questions remained unaddressed back in January. |
Here's the text of your message dated 29 January 2017:
|
I think this may be a major programming task on at least two fronts:
I can possibly provide some assistance with the second process. |
NB. Strong's numbers for Hebrew words should retain the H prefix. |
Rather than writing a Python or Perl script to convert the existing OSIS XML file to a format more suitable as input for osis2mod, my inclination would be to develop a bespoke TextPipe filter for this conversion task. I've been using TextPipe Standard for multiple tasks since 2001. I woudn't be without it. |
One question at the forefront of my mind is exactly how morph segmentation should be handled by front-ends? The existing .conf file for module OSHB includes the line:
For OSIS texts having morphological segmentation elements. NB. Module OSMHB does not have this line, yet the source text does have seg elements. I think one thing is clear at least. Correct me if I'm wrong. |
Aside: The SWORD utility diatheke does not yet have a Command Line option switch to handle OSISMorphSegmentation. cf. It has a switch -m for modules having either ThMLMorph or OSISMorph. AFAIK, this has never been pointed out to sword-devel before. |
If SWORD were to be enhanced to support the toggling the display of the solidus in suitably marked modules, would this obviate the need to use seg elements for this function? NB. You'd still be OK having them with a type attribute, as per this counted list of such:
Even so, if the existing method of showing how word segments are encoded works OK, I don't see a pressing need to implement a method that works on something in the text rather than in the XML markup. |
Thanks for all your research. I have just made a working module. I simplified the lemmas, to use just the Strong numbers, since that is all SWORD will look at for now anyway. I have also resolved the seg element issue. The question about the morphology attributes still remains, but the main requirement, at this point, is how to get the Hebrew Morphology Codes into a format SWORD can use. Then the codes could be interpreted, similar to Robinson codes in other modules. |
For modules that have
although Xiphos 4.0.6a has a corresponding module option switch, it seems to have no visible effect on the displayed text. What would you expect/require a font-end app to do with this feature? Do we even understand how SWORD is programmed for it? |
But to return to the main question, this is covered in the wiki. See Marking morphology. There's no section in this page about marking morpheme segmentation. We ought to insert a section and pick the brains of the developers. |
And while we're at it, we should propose a requirements specification for how SWORD should handle your n attributes for the disjunctive accents, or something equivalent should the developers wish to deprecate that use for the n attribute. See also issue #46 |
The Marking morphology topic refers to marking up morph attributes, based on an existing lexicon module. It says:
I would need to know the format for such a module, to construct one for the Hebrew Morphology Codes. |
Just added an incomplete subsection for marking morpheme segmentation to the OSIS Bibles page. Just added a section to the OSIS Bibles talk page. Just added a section to the diatheke talk page. |
The source text provenance isn't evident, but there's a Hebrew module hboWLCeb in the eBible.org repo that already includes some morphology markup. The attached text file is a counted list of morph attribute values extracted from the mod2imp output. hboWLCeb.raw.imp.hbo.morph.count.txt Here's an example:
We'd need to obtain more detailed information from our friend Michael Johnson to find out how he came by the source text. |
IMHO, it's unsuitable for the lemma for Strongs to include a space because (in effect) the markup is split into several unconnected attributes. The prefix letters and augment codes are not independent properties, but rather they are extensions to the one property called Strongs. Practically, the resulting "stretchy spaces" also give rise to serious misalignments when Strongs are displayed in (e.g.) Xiphos. It might be preferable to use a punctuation mark (such as a period), and to have only one attribute value rather than several. How this might work in practice while still giving a meaningful display is unclear. Further discussion with the developers should be sought. |
|
DavidTroidl, I made a module like Robinson for the Westminster text at one point. I can dig out the files. Basically you need a line for each logical possibility in the parsing scheme. |
Essentially I think you create a dictionary module in the imp format. That is the simplest way. Here is an example entry:
The $$$ introduces the dictionary key. The second line is the content of the entry. But most SWORD frontends display the dictionary entry key as well, so there is no need to repeat it in the entry. I hope that makes sense. |
Actually, let me see what I can come up with for the morph module, since I have done this before. |
While David & Daniel are pondering morphology markup, I'm still concerned about morpheme segmentation. It seems to me that the use of the seg element overlooks a very important point. An XML element only determines what happens to the text and elements within it. It's therefore impossible by using seg elements alone to achieve a functionality in (e.g.) SWORD whereby a morpheme segment separator is displayed or hidden. A few years ago, I came across a book with the title, "Space Between Words: The Origins of Silent Reading". Suppose we wanted SWORD users to experience what it's like to see Scriptio Continua? How would this be implemented (assuming SWORD was enhanced to support it) ? My best guess would be by replacing each space in the module text by an OSIS milestone element.
Then the SWORD engine could toggle the display of the marker string (here a single space). Front-end UI would need a module option to select Scriptio Continua. There are features like this already. Think of the Pilcrow sign in the KJV module when you flip from Verse Per Line to Paragraphs. In the same manner, morpheme segmentation could be implemented like this
where "mss" denotes Morpheme Segment Separator. At least, this is how I would do it. Anyone got any better idea? |
Is it the case that STEP Bible is the only front-end that supports morpheme segments?
Here's a question I asked in 2014 to sword-devel:
|
I had also written this to sword-devel in the same thread:
|
On 01/07/2015, Karl Kleinpaste committed a change in the source code for diatheke to add -o M for morpheme segmentation. NB. This is now documented in the wiki page for diatheke. But has it ever been tested? What exactly is the SWORD engine looking for in the OSIS XML whereby the filter will change what SWORD outputs? |
Daniel: Thank you. I appreciate the help. David: There are five or six different topics running in this thread at the same time. It is almost impossible to communicate coherently about all of them at the same time. Morpheme segmentation is something that needs to be resolved with SWORD, before it can be addressed here. Please, let us stick to the main point: getting a workable module out in the near future. On that score the interpretation of morphology is the primary issue. |
I appreciate that point. I'm trying to get my head round why SWORD has a feature that nobody seems to know exactly how it works. cf. The Morpheme segmentation filter is also specified in the WLC module. |
Are the Hebrew morphology codes found in module hboWLCeb substantially the same as used in morphhb ? Albeit perhaps a smaller set, due to when and how the source text was made. Might it even be the case that the source text for the module was simply a earlier snapshot from morphhb and used without proper attribution? |
FYI. The KJV module maintained by CrossWire contains morphology markup for the NT.
The OSIS XML header contains these work elements:
|
CrossWire has just released OSHB module version 1.2 |
I have reported to the "modules team" at CrossWire a number of issues with the updated module. I think these are all in their court. |
The reported issues have not yet been fixed by CrossWire! |
As I see that there have been a signifcant number of commits to the repo since 2013-10-11, it would be certainly useful to SWORD users for the OSHB module to be updated.
Or maybe OSHB should be replaced by one with a slightly different module name, being careful to use the Obsoletes key in the .conf file if that route is taken.
If you can find time to do this, it would be much appreciated.
The text was updated successfully, but these errors were encountered: