IDEAS home Printed from https://ideas.repec.org/a/igg/jiit00/v14y2018i1p17-27.html
   My bibliography  Save this article

Word Sense Based Hindi-Tamil Statistical Machine Translation

Author

Listed:
  • Vimal Kumar K.

    (Jaypee Institute of Information Technology, Department of Computer Science and Engineering and Information Technology, Noida, India)

  • Divakar Yadav

    (Jaypee Institute of Information Technology, Department of Computer Science and Engineering and Information Technology, Noida, India)

Abstract

Corpus based natural language processing has emerged with great success in recent years. It is not only used for languages like English, French, Spanish, and Hindi but also is widely used for languages like Tamil, Telugu etc. This paper focuses to increase the accuracy of machine translation from Hindi to Tamil by considering the word's sense as well as its part-of-speech. This system works on word by word translation from Hindi to Tamil language which makes use of additional information such as the preceding words, the current word's part of speech and the word's sense itself. For such a translation system, the frequency of words occurring in the corpus, the tagging of the input words and the probability of the preceding word of the tagged words are required. Wordnet is used to identify various synonym for the words specified in the source language. Among these words, the one which is more relevant to the word specified in source language is considered for the translation to target language. The introduction of the additional information such as part-of-speech tag, preceding word information and semantic analysis has greatly improved the accuracy of the system.

Suggested Citation

  • Vimal Kumar K. & Divakar Yadav, 2018. "Word Sense Based Hindi-Tamil Statistical Machine Translation," International Journal of Intelligent Information Technologies (IJIIT), IGI Global, vol. 14(1), pages 17-27, January.
  • Handle: RePEc:igg:jiit00:v:14:y:2018:i:1:p:17-27
    as

    Download full text from publisher

    File URL: http://services.igi-global.com/resolvedoi/resolve.aspx?doi=10.4018/IJIIT.2018010102
    Download Restriction: no
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:igg:jiit00:v:14:y:2018:i:1:p:17-27. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Journal Editor (email available below). General contact details of provider: https://www.igi-global.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.