IDEAS home Printed from https://ideas.repec.org/a/gam/jdataj/v5y2020i1p21-d329269.html
   My bibliography  Save this article

Processing on Structural Data Faultage in Data Fusion

Author

Listed:
  • Fan Chen

    (School of Computer Engineering and Science, Shanghai University, Shangda Road 99, Shanghai 200444, China)

  • Ruoqi Hu

    (XianDa College of Economics and Humanities, Shanghai International Studies University, East Tiyuhui Road 390, Shanghai 200083, China)

  • Jiaoxiong Xia

    (School of Computer Engineering and Science, Shanghai University, Shangda Road 99, Shanghai 200444, China
    XianDa College of Economics and Humanities, Shanghai International Studies University, East Tiyuhui Road 390, Shanghai 200083, China
    Information Centre, Shanghai Municipal Education Commission, Dagu Road 100, Shanghai 200003, China)

  • Jie Tao

    (XianDa College of Economics and Humanities, Shanghai International Studies University, East Tiyuhui Road 390, Shanghai 200083, China)

Abstract

With the rapid development of information technology, the development of information management system leads to the generation of heterogeneous data. The process of data fusion will inevitably lead to such problems as missing data, data conflict, data inconsistency and so on. We provide a new perspective that combines the theory in geology to conclude such kind of data errors as structural data faultage. Structural data faultages after data integration often lead to inconsistent data resources and inaccurate data information. In order to solve such problems, this article starts from the attributes of data. We come up with a new solution to process structural data faultages based on attribute similarity. We use the relation of similarity to define three new operations: Attribute cementation, Attribute addition, and Isomorphous homonuclear. Isomorphous homonuclear uses digraph to combine attributes. These three operations are mainly used to handle multiple data errors caused by data faultages, so that the redundancy of data can be reduced, and the consistency of data after integration can be ensured. Finally, it can eliminate the structural data faultage in data fusion. The experiment uses the data of doctoral dissertation in Shanghai University. Three types of dissertation data tables are fused. In addition, the structural data faultages after fusion are processed by the new method proposed by us. Through the statistical analysis of the experiment results and compare with the existing algorithm, we verify the validity and accuracy of this method to process structural data faultages.

Suggested Citation

  • Fan Chen & Ruoqi Hu & Jiaoxiong Xia & Jie Tao, 2020. "Processing on Structural Data Faultage in Data Fusion," Data, MDPI, vol. 5(1), pages 1-22, March.
  • Handle: RePEc:gam:jdataj:v:5:y:2020:i:1:p:21-:d:329269
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2306-5729/5/1/21/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2306-5729/5/1/21/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jdataj:v:5:y:2020:i:1:p:21-:d:329269. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.