segment-line: annotate polygon or clipped image #127

bertsky · 2020-05-12T18:41:27Z

Currently all we get is bounding boxes, which for historic print often overlap heavily.

Tesseract internally of course "knows" (already decided) which component belongs to which line, but how do we get that information via API? There are 2 general paths:

polygon coordinates via baseline; either via existing/old API or via new API we have to get into Tesseract, cf. Add RowAttributes getter to PageIterator tesseract-ocr/tesseract#2971 (comment)
retrieving a clipped line image for each line individually, perhaps via GetTextlines or GetComponentImages.

@wrznr what do you think?

The text was updated successfully, but these errors were encountered:

bertsky · 2021-02-08T15:08:31Z

Although we now have shrink_polygons (#162) as alternative solution (on all hierarchy levels), but GetImage may still be useful in some circumstances:

if the hull polygon still overlaps neighbours (because it should be more concave)
if the precision, which still is the bboxes of contained glyphs, is not enough (images transport the exact glyph polygon)

Here's an example of glyph images extracted by

ocrd-tesserocr-segment as it is (with BoundingBox), combined with ocrd-segment-extract-glyphs:
ocrd-tesserocr-segment modified by GetImage(RIL.SYMBOL, 0, None):

bertsky · 2021-02-08T15:14:15Z

So how about the following parameters for an opt-in (each having the segment images annotated as derived images):

ocrd-tesserocr-segment and ocrd-tesserocr-recognize: array parameter add_alternativeimages with values region, line, word and/or glyph
ocrd-tesserocr-segment-region, ocrd-tesserocr-segment-line and ocrd-tesserocr-segment-word: boolean parameter add_alternativeimages

bertsky · 2021-02-09T11:12:29Z

2. modified by GetImage(RIL.SYMBOL, 0, None):

Unfortunately, this only works with None as 3rd arg, which is equivalent to GetBinaryImage(RIL.SYMBOL). One can pass the raw image there, but Tesseract will only apply the polygon mask above the glyph level in that case. So there is no way to see raw images clipped to white around the polygon.

bertsky added the enhancement New feature or request label May 12, 2020

bertsky mentioned this issue May 12, 2020

segment-region: crop_polygons creates invalid coordinates #98

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

segment-line: annotate polygon or clipped image #127

segment-line: annotate polygon or clipped image #127

bertsky commented May 12, 2020

bertsky commented Feb 8, 2021

bertsky commented Feb 8, 2021

bertsky commented Feb 9, 2021

segment-line: annotate polygon or clipped image #127

segment-line: annotate polygon or clipped image #127

Comments

bertsky commented May 12, 2020

bertsky commented Feb 8, 2021

bertsky commented Feb 8, 2021

bertsky commented Feb 9, 2021