trOCR in onnx format does not read full text #1858

feff2 · 2024-05-16T03:31:57Z

platform: Windows 10
optimum version 1.19.2
transformers version 4.40.2
onnx version 1.16.0
onnxruntime version 1.17.3

I have trained the small-printed trocr on my custom dataset having multiline images. The trained model can read full text. But while converting the model to onnx, the model detects only first line or part of it in first line. I have used this [https://github.com/huggingface/transformers/issues/19811#issuecomment-1303072202](https://gist.github.com/mht-sharma/f38c670930ac7df413c07327e692ee39)
for inference and this command "optimum-cli export onnx -m {model_checkpoints} --task vision2seq-lm onnx/ --atol 1e-3" for convert to onnx

It is unclear why the model recognizes only the first line of text (with almost no loss of quality)

The text was updated successfully, but these errors were encountered:

amyeroberts · 2024-05-16T09:20:15Z

Hi @feff2, thanks for raising an issue!

I'm transferring this issue to the optimum repo, as it seems this is more related to that library.

feff2 · 2024-05-16T14:47:50Z

amyeroberts transferred this issue from huggingface/transformers May 16, 2024

feff2 closed this as completed May 16, 2024

feff2 reopened this May 17, 2024

feff2 closed this as completed Jun 10, 2024

Provide feedback