Abstract:
This work’s aim is to find an efficient method to measure the Optical Character
Recognition (OCR) accuracy in the absence of the ground truth text. To successfully
obtain the desired result, initially we have tried some efficient supervised (in the
presence of the ground truth text) accuracy measuring techniques. Then we tried
some unsupervised (in the absence of the ground truth text) techniques, which is the
final goal of our project, and compare their performance with respect to the
previously obtained supervised techniques. Our final project goal is to provide an
efficient unsupervised accuracy measuring technique which can help us to automate
the document analysis process.