An HMM-based OCR framework for Telugu using a transfer learning approach - ECU Libraries Catalog

Back to Search Clear Search

An HMM-based OCR framework for Telugu using a transfer learning approach / by Jennifer Andriot.

Author/creator	Andriot, Jennifer author.
Other author/creator	Gudivada, Venkat N. degree supervisor.
Other author/creator	East Carolina University. Department of Computer Science.
Format	Theses and dissertations, Electronic, and Book
Publication Info	[Greenville, N.C.] : [East Carolina University], 2021.
Description	1 online resource (48 pages) : illustrations (chiefly color)
Supplemental Content	Access via ScholarShip
Subject(s)	Optical character recognition. Transfer learning (Machine learning)

Click here for more information about this title

Summary	Optical character recognition (OCR) for complex scripts such as Telugu has gained much attention over the past decade due to the significant advancements made in this area of research. The Telugu OCR framework in this work proposes a Hidden Markov model based approach using transfer learning to estimate the emission probability parameter of the model. This approach incorporates knowledge of the Telugu language into the framework via the hidden Markov model, while the pre-trained convolutional neural network, VGG-16, aids in estimating the emission parameter. A comparative analysis of two estimation techniques for estimating the emission parameter is also provided. One method utilizes Gaussian mixture models clustering using feature vectors obtained from VGG-16 and the second method utilizes the softmax outputs from VGG-16 to obtain emission probability estimates. The results from this framework show that using a pre-trained CNN for parameter estimation instead of as a classifier significantly reduces the resources required for developing an OCR framework for Telugu compared to implementing a CNN framework from scratch.
General note	Presented to the faculty of the Department of Computer Science
General note	Advisor: Venkat Gudivada
General note	Title from PDF t.p. (viewed July 6, 2022).
Dissertation note	M.S. East Carolina University 2021
Bibliography note	Includes bibliographical references.
Technical details	System requirements: Adobe Reader.
Technical details	Mode of access: World Wide Web.
Genre/form	Academic theses.
Genre/form	Academic theses.
Genre/form	Thèses et écrits académiques.

Available Items

Library	Location	Call Number	Status	Item Actions
	Electronic Resources	Access Content Online	✔ Available