IDEAS home Printed from https://ideas.repec.org/a/igg/jirr00/v9y2019i3p12-22.html
   My bibliography  Save this article

Towards Building an Arabic Plagiarism Detection System: Plagiarism Detection in Arabic

Author

Listed:
  • Imtiaz Hussain Khan

    (King Abdulaziz University, Jeddah, Saudi Arabia)

  • Muazzam Ahmed Siddiqui

    (King Abdulaziz University, Jeddah, Saudi Arabia)

  • Kamal M. Jambi

    (King Abdulaziz University, Jeddah, Saudi Arabia)

Abstract

This article describes a plagiarism detection system for the Arabic language that combines different similarity-measure techniques to uncover plagiarism in Arabic documents. The proposed system consists of two main components, one document-retrieval and the other detailed similarity analysis. The document-retrieval component generates queries from a given suspicious document and makes use of Google search API to retrieve candidate source documents from the Web. The similarity analysis component takes each source document in turn and attempts to identify the plagiarized parts in the suspicious document. The proposed system is thoroughly evaluated using an indigenous corpus. At the document-retrieval level, the system achieved above 75% accuracy in terms of f-score, whereas at the detailed similarity-computation level, the f-score is above 70%.

Suggested Citation

  • Imtiaz Hussain Khan & Muazzam Ahmed Siddiqui & Kamal M. Jambi, 2019. "Towards Building an Arabic Plagiarism Detection System: Plagiarism Detection in Arabic," International Journal of Information Retrieval Research (IJIRR), IGI Global, vol. 9(3), pages 12-22, July.
  • Handle: RePEc:igg:jirr00:v:9:y:2019:i:3:p:12-22
    as

    Download full text from publisher

    File URL: http://services.igi-global.com/resolvedoi/resolve.aspx?doi=10.4018/IJIRR.2019070102
    Download Restriction: no
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:igg:jirr00:v:9:y:2019:i:3:p:12-22. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Journal Editor (email available below). General contact details of provider: https://www.igi-global.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.