IDEAS home Printed from https://ideas.repec.org/a/igg/jkss00/v5y2014i3p36-45.html
   My bibliography  Save this article

Syntactic Pattern Based Word Alignment for Statistical Machine Translation

Author

Listed:
  • Quang-Hung LE

    (Faculty of Information Technology, Quynhon University, Vietnam & University of Engineering and Technology, Vietnam National University, Hanoi, Vietnam)

  • Anh-Cuong LE

    (University of Engineering and Technology, Vietnam National University, Hanoi, Vietnam)

Abstract

Word alignment is the task of aligning bilingual words in a corpus of parallel sentences, and determining the probabilities for these aligned bilingual word pairs. It is the most important factor affecting the quality of any Statistical Machine Translation (SMT) systems. The IBM word alignment models are most well-known in the SMT research community. These models are pure statistical models and therefore they are not good for some language pairs which have differences in linguistic aspects (e.g. grammatical structures). This paper aims to improve the IBM models by using syntactic information. The authors first propose a new type of constraint based on bilingual syntactic patterns, and then integrate it into the IBM models. Finally, they show how to estimate the models' parameters using this new type of constraint. The experiments are conducted on the English-Vietnamese language pair for evaluation.

Suggested Citation

  • Quang-Hung LE & Anh-Cuong LE, 2014. "Syntactic Pattern Based Word Alignment for Statistical Machine Translation," International Journal of Knowledge and Systems Science (IJKSS), IGI Global, vol. 5(3), pages 36-45, July.
  • Handle: RePEc:igg:jkss00:v:5:y:2014:i:3:p:36-45
    as

    Download full text from publisher

    File URL: http://services.igi-global.com/resolvedoi/resolve.aspx?doi=10.4018/ijkss.2014070103
    Download Restriction: no
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:igg:jkss00:v:5:y:2014:i:3:p:36-45. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Journal Editor (email available below). General contact details of provider: https://www.igi-global.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.