Skip to main content
Question

How can I get the TesseractCaller to output to the Line port rather than the Text or Reject ports?

  • October 9, 2018
  • 2 replies
  • 59 views

Forum|alt.badge.img

I have downloaded the tesseractcaller and tesseract but when I run a workspace I can not get any output on the Line port. When I use the output type of Text the output is returned in the Text port. But when I choose hOCR a "Tesseract didn't find any text on this raster" error is returned on the Rejected port. I get this result on the TesseractTest.fmwt example as well as the other samples provided by Safe Soft. Let me know if I have missed a setting somewhere.

This post is closed to further activity.
It may be an old question, an answered question, an implemented idea, or a notification-only post.
Please check post dates before relying on any information in a question or answer.
For follow-up or related questions, please post a new question or idea.
If there is a genuine update to be made, please contact us and request that the post is reopened.

2 replies

jovitaatsafe
Safer
Forum|alt.badge.img+11
  • Safer
  • 635 replies
  • October 9, 2018

Hi @fdharris13,

Sorry to hear that you're having some trouble with the TesseractCaller. We have a known issue that is in the works at the moment. The workaround that a colleague of mine found, is to go into the TesseractCaller custom transformer tab, and locate AttributeCreator4 (you can do this quickly in the navigator under Transformers). Change the _extension parameter under Attribute Value of Else to hocr. Then try running the workspace. This should do the trick for the example workspaces.

The second thing to check is what version of Tesseract you have. I have been able to test successfully with the older stable version (3.05.02). Consider installing the current stable version as opposed to the beta if you have edited the parameter and the transformer still isn't working for you. You can install that here.

The last thing to check is to make sure that the file path in the TesseractCaller points to your tesseract.exe file.

 


Forum|alt.badge.img
  • Author
  • 4 replies
  • October 10, 2018

Hi @fdharris13,

Sorry to hear that you're having some trouble with the TesseractCaller. We have a known issue that is in the works at the moment. The workaround that a colleague of mine found, is to go into the TesseractCaller custom transformer tab, and locate AttributeCreator4 (you can do this quickly in the navigator under Transformers). Change the _extension parameter under Attribute Value of Else to hocr. Then try running the workspace. This should do the trick for the example workspaces.

The second thing to check is what version of Tesseract you have. I have been able to test successfully with the older stable version (3.05.02). Consider installing the current stable version as opposed to the beta if you have edited the parameter and the transformer still isn't working for you. You can install that here.

The last thing to check is to make sure that the file path in the TesseractCaller points to your tesseract.exe file.

 

Changing the attribute value solved my problem. Thanks for your help!!