IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0339762.html

An entropy-based study of Simplification in ChatGPT translations compared to neural machine translation and human translation across genres

Author

Listed:
  • Guangyuan Yao
  • Lingxi Fan

Abstract

This study investigates the phenomenon of simplification in Chinese-to-English translation across Human Translation (HT), neural machine translation (NMT), and large language model (LLM)-based translation, ChatGPT as an example. Employing entropy-based metrics (unigram entropy and Part-of-Speech (POS) entropy) to assess lexical and syntactic complexity, the research analyzes translations across three genres: political texts, fiction, and academic. Findings reveal that political and academic texts exhibit lexical simplification, and texts of all genres show a syntactic simplification trend, with the simplified degree varying across translation modes. While genre exerts minimal influence on lexical complexity, it significantly impacts syntactic complexity, with academic texts showing the lowest and fiction the highest complexity levels. Notably, ChatGPT’s translations consistently exhibit greater lexical complexity, as evidenced by higher unigram entropy scores compared to those of Neural Machine Translation. These results challenge the notion of simplification as a universal feature of translation, instead highlighting its probabilistic nature influenced by translation mode and genre. The study underscores the efficacy of entropy-based measures in capturing nuanced differences in translation complexity and advocates for a modal approach to translation studies that accounts for the unique characteristics of various translation methods.

Suggested Citation

  • Guangyuan Yao & Lingxi Fan, 2025. "An entropy-based study of Simplification in ChatGPT translations compared to neural machine translation and human translation across genres," PLOS ONE, Public Library of Science, vol. 20(12), pages 1-16, December.
  • Handle: RePEc:plo:pone00:0339762
    DOI: 10.1371/journal.pone.0339762
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0339762
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0339762&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0339762?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0339762. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.