IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v9y2021i7p726-d525479.html
   My bibliography  Save this article

A Metamorphic Testing Approach for Assessing Question Answering Systems

Author

Listed:
  • Kaiyi Tu

    (School of Information Science and Technology, Zhejiang Sci-Tech University, Hangzhou 310018, China)

  • Mingyue Jiang

    (School of Information Science and Technology, Zhejiang Sci-Tech University, Hangzhou 310018, China)

  • Zuohua Ding

    (School of Information Science and Technology, Zhejiang Sci-Tech University, Hangzhou 310018, China)

Abstract

Question Answering (QA) enables the machine to understand and answer questions posed in natural language, which has emerged as a powerful tool in various domains. However, QA is a challenging task and there is an increasing concern about its quality. In this paper, we propose to apply the technique of metamorphic testing (MT) to evaluate QA systems from the users’ perspectives, in order to help the users to better understand the capabilities of these systems and then to select appropriate QA systems for their specific needs. Two typical categories of QA systems, namely, the textual QA (TQA) and visual QA (VQA), are studied, and a total number of 17 metamorphic relations (MRs) are identified for them. These MRs respectively focus on some characteristics of different aspects of QA. We further apply MT to four QA systems (including two APIs from the AllenNLP platform, one API from the Transformers platform, and one API from CloudCV) by using all of the MRs. Our experimental results demonstrate the capabilities of the four subject QA systems from various aspects, revealing their strengths and weaknesses. These results further suggest that MT can be an effective method for assessing QA systems.

Suggested Citation

  • Kaiyi Tu & Mingyue Jiang & Zuohua Ding, 2021. "A Metamorphic Testing Approach for Assessing Question Answering Systems," Mathematics, MDPI, vol. 9(7), pages 1-15, March.
  • Handle: RePEc:gam:jmathe:v:9:y:2021:i:7:p:726-:d:525479
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/9/7/726/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/9/7/726/
    Download Restriction: no
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Lingzi Jin & Zuohua Ding & Huihui Zhou, 2022. "Evaluation of Chinese Natural Language Processing System Based on Metamorphic Testing," Mathematics, MDPI, vol. 10(8), pages 1-27, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:9:y:2021:i:7:p:726-:d:525479. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.