Author
Listed:
- Sugyeong Eo
(Department of Software, Yonsei University Mirae Campus, Wonju 26493, Republic of Korea)
- Chanjun Park
(School of Software, Soongsil University, Seoul 06978, Republic of Korea)
Abstract
With the advent of large language models (LLMs), significant progress has been made in improving the fluency of machine translation (MT). However, hallucination remains a persistent challenge to translation accuracy, making Critical Error Detection (CED) increasingly important. In this paper, we introduce a simple yet effective approach, termed external knowledge-guided tuning, for the CED task. We focus on sentence-level CED, formulated as a binary classification task that determines whether an MT output contains critical errors. Although the task is binary, the data consist of diverse error cases, including issues related to toxicity, safety, named entities, sentiment, and numerical information, which may manifest as hallucination, mistranslation, or deletion. Our approach restructures model inputs in a cloze-style format and incorporates auxiliary descriptions, casting CED within a masked language modeling framework. By integrating additional contextual signals, including demonstration examples and outputs from commercial systems, our method guides the model to acquire task-specific knowledge and compare alternative MT outputs. Experimental results demonstrate the effectiveness of our approach, achieving state-of-the-art (SOTA) performance on the English–Czech language pair and a second-place ranking on English–German. We further provide a comprehensive analysis of the aggregated effects of external knowledge and examine the contribution of each component within the proposed framework. Our proposed method enables the model to internalize task-relevant knowledge through parameter updates within a prompt-based formulation, providing a principled way to incorporate external knowledge into CED and enhancing the model’s ability to identify critical errors in practice.
Suggested Citation
Sugyeong Eo & Chanjun Park, 2026.
"External Knowledge-Guided Tuning for Critical Error Detection in Machine Translation,"
Mathematics, MDPI, vol. 14(9), pages 1-14, April.
Handle:
RePEc:gam:jmathe:v:14:y:2026:i:9:p:1484-:d:1930706
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:14:y:2026:i:9:p:1484-:d:1930706. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.