IDEAS home Printed from https://ideas.repec.org/a/pop/journl/v7y2023i1p57-64.html
   My bibliography  Save this article

Using RPA for data generation using OCR platforms in Mediterranean University of Albania

Author

Listed:
  • Gerild Qordja

    (Mediterranean University of Albania, Faculty Of Informatics, Department of Information Technology Tirana, Albania)

Abstract

The increase in the amount of data today has led to the use of computer applications in order to manage processes precisely. Robotic process automation (RPA), also known as software robotics, uses automation technologies to mimic back-office tasks of human workers, such as extracting data, filling in forms, moving files, et cetera. Optical character recognition (OCR) is sometimes referred to as text recognition. An OCR program extracts and repurposes data from scanned documents, camera images and image-only pdfs. OCR systems use a combination of hardware and software to convert physical, printed documents into machine-readable text. Hardware such as an optical scanner or specialized circuit board copies or reads text then, software typically handles the advanced processing. Process Automation in Azure Automation allows you to automate frequent, time-consuming, and error-prone management tasks. This service helps you focus on work that adds business value. In this paper, I will use the above-mentioned technologies to realize the automatic data generation process for the construction of an online library. In addition, the level of data accuracy will be studied in the automation of data generation from pdf files to mySql. The application will be built in front end html and back end php programming language and mySql database. These tests will be done by inserting more than 17000 books in pdf format.

Suggested Citation

  • Gerild Qordja, 2023. "Using RPA for data generation using OCR platforms in Mediterranean University of Albania," Smart Cities and Regional Development (SCRD) Journal, Smart-EDU Hub, vol. 7(1), pages 57-64, March.
  • Handle: RePEc:pop:journl:v:7:y:2023:i:1:p:57-64
    as

    Download full text from publisher

    File URL: https://scrd.eu/index.php/scrd/article/view/176/139
    Download Restriction: no

    File URL: https://scrd.eu/index.php/scrd/article/view/176
    Download Restriction: no
    ---><---

    More about this item

    Keywords

    Microsoft Azure; Robotic Process Automation (RPA); Optical Character Recognition (OCR); MySql; Html;
    All these keywords.

    JEL classification:

    • O35 - Economic Development, Innovation, Technological Change, and Growth - - Innovation; Research and Development; Technological Change; Intellectual Property Rights - - - Social Innovation

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:pop:journl:v:7:y:2023:i:1:p:57-64. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catalin Vrabie (email available below). General contact details of provider: https://edirc.repec.org/data/fasnsro.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.