Please use this identifier to cite or link to this item: http://hdl.handle.net/10263/7153
Full metadata record
DC FieldValueLanguage
dc.contributor.authorGiri, Anjan-
dc.date.accessioned2021-05-13T08:00:27Z-
dc.date.available2021-05-13T08:00:27Z-
dc.date.issued2020-09-
dc.identifier.citation31p.en_US
dc.identifier.urihttp://hdl.handle.net/10263/7153-
dc.descriptionDissertation under the supervision of Dr. UJJWAL BHATTACHARYAen_US
dc.description.abstractState-of-the-art scene text recognition systems perform satisfactorily on samples of benchmark datasets as long as the quality of the text in an image sample is not a ected signi cantly by certain distortions such as blurring etc. However, their performance may drop sharply whenever the input text appears well outside the focus of the image capturing device or it is su ered by motion blur etc. In this study, we considered incidental scene texts which usually exhibit much more diversity, variability and complexity together with the common challenges of scene text recognition compared to their counterparts which are captured by properly positioning the camera and making possible adjustments of various image capturing parameters. In this work, we introduce a trainable deep network that implements a super-resolution technique as the preprocessing module on low quality scene images to boost text recog- nition accuracy of the existing models. There are various super resolution techniques for image available in the literature which mainly focus on reconstructing the detailed texture of image but fails to improve the quality of texts appearing in the image and thus the results of their recognition does not get improved. Here, we propose a novel text-content aware super-resolution network to improve the quality of texts appearing in natural scene image leading to their more accurate recog- nition by automatic methods. Simulation results of the proposed model on the ICDAR 2015 Incidental Scene Text dataset demonstrate its e ectiveness as an e cient preprocessing model. Code developed as a part of this dissertation is available at: https://github.com/ AnjanGiri/Thesis. 3en_US
dc.language.isoenen_US
dc.publisherIndian Statistical Institute, Kolkataen_US
dc.relation.ispartofseriesDissertation;;2020-5-
dc.subjectScene text regognitionen_US
dc.subjectPreprocessing moduleen_US
dc.subjectASTERen_US
dc.titleAn Efficient Preprocessing Module For Incidental Scene Text Recognitionen_US
dc.typeOtheren_US
Appears in Collections:Dissertations - M Tech (CS)

Files in This Item:
File Description SizeFormat 
Anjan_Giri_CS1811_MTCthesis2020.pdf3.7 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.