IAD Index of Academic Documents
  • Home Page
  • About
    • About Izmir Academy Association
    • About IAD Index
    • IAD Team
    • IAD Logos and Links
    • Policies
  • Submit Paper
    • Submit Article
    • Submit Conference Paper
    • Submit Book
    • Submit Book Chapter
    • Submit Patent Document
    • Submit A Thesis
    • Submit A Technical Report
    • Submit Other Type of Document
  • Publisher/Editor Panel
    • Sign In/Sign Up
  • Open Access Documents Journal
  • Optical Character Recognition for Image Files Containing Bisaya Texts

Optical Character Recognition for Image Files Containing Bisaya Texts

Authors:Eirol Jan Coronado, Tristan Montaner
Pages:1-5
View:20
Download:10
Favorite:2
Abstract:Optical character recognition (OCR) is the mechanical or electronic translation of images of hand-written or printed text into machine-editable text [4]. It is performed by optical character readers which are automated electronic systems. OCR may be defined as the process of converting images of machine printed or handwritten numerals, letters, and symbols into a computer- processable format. This study Show the accuracy of the OCR. PyTesseract is the chosen program to assess the accuracy of the Optical character recognition. We used images from different books in order for us to extract text from images. We have also conducted alpha and beta testing to know if we were able to identify if the results will differ if the program was utilized by us or other person. An inconsistent result had been observed while testing the Pytesseract program. Although this program is very easy to use and most efficient, this study is an evidence that OCR is not always 100.
Keywords:Optical character recognition, text extraction, artificial intelligence, information extraction

ORIGINAL ARTICLE URL PDF URL

All Rights Reserved. İzmir Akademi Derneği
CopyRight © 2023