Skip to content

True_inference_with_LayoutLMv2ForTokenClassification, add support for custom OCR#128

Open
amtam0 wants to merge 1 commit into
NielsRogge:masterfrom
amtam0:layoutlmv2-custom-ocr
Open

True_inference_with_LayoutLMv2ForTokenClassification, add support for custom OCR#128
amtam0 wants to merge 1 commit into
NielsRogge:masterfrom
amtam0:layoutlmv2-custom-ocr

Conversation

@amtam0
Copy link
Copy Markdown

@amtam0 amtam0 commented Jun 19, 2022

Hi @NielsRogge,
Thanks for these great tutorials.
This PR to add support for custom OCR alternatives to the built-in one used in LayoutLMv2Processor.
In real world use-cases, Image quality is not good and Tesseract is not the best tool to use, using other alternatives that have better text detectors can improve inference performance (and can be used for fine-tuning). I added modularity to be able to use Tesseract for the recognition part if needed.
Popular OCR tools alternatives added in this Notebook under Inference Chapter:

Let me know if there are some changes to make
Thanks

@review-notebook-app
Copy link
Copy Markdown

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant