Author
Listed:
- Ziqi Liu
(School of Mathematical Sciences, Beihang University, Beijing 100191, China
Key Laboratory of Mathematics, Informatics and Behavioral Semantics and State, Key Laboratory of Software Development Environment, Beihang University, Beijing 100191, China)
- Ziqiao Yin
(Key Laboratory of Mathematics, Informatics and Behavioral Semantics and State, Key Laboratory of Software Development Environment, Beihang University, Beijing 100191, China
Beijing Advanced Innovation Center for Future Blockchain and Privacy Computing, Institute of Artificial Intelligence, Beihang University, Beijing 100191, China
Zhongguancun Laboratory, Beijing 100094, China
Hangzhou International Innovation Institute of Beihang University, Hangzhou 311115, China)
- Zhilong Mi
(Key Laboratory of Mathematics, Informatics and Behavioral Semantics and State, Key Laboratory of Software Development Environment, Beihang University, Beijing 100191, China
Beijing Advanced Innovation Center for Future Blockchain and Privacy Computing, Institute of Artificial Intelligence, Beihang University, Beijing 100191, China)
- Binghui Guo
(Key Laboratory of Mathematics, Informatics and Behavioral Semantics and State, Key Laboratory of Software Development Environment, Beihang University, Beijing 100191, China
Beijing Advanced Innovation Center for Future Blockchain and Privacy Computing, Institute of Artificial Intelligence, Beihang University, Beijing 100191, China
Zhongguancun Laboratory, Beijing 100094, China)
- Zhiming Zheng
(Key Laboratory of Mathematics, Informatics and Behavioral Semantics and State, Key Laboratory of Software Development Environment, Beihang University, Beijing 100191, China
Beijing Advanced Innovation Center for Future Blockchain and Privacy Computing, Institute of Artificial Intelligence, Beihang University, Beijing 100191, China
Zhongguancun Laboratory, Beijing 100094, China)
Abstract
Multisource and multimodal data fusion plays a pivotal role in large-scale artificial intelligence applications involving big data. However, the choice of fusion strategies for different scenarios is often based on experimental comparisons, which leads to increased computational costs during model training and suboptimal performance during testing. In this paper, we present a theoretical analysis of early fusion, late fusion, and gradual fusion methods. We derive equivalence conditions between early and late fusions within the framework of generalized linear models. Moreover, we analyze the failure conditions of early fusion in the presence of nonlinear feature-label relationships. Furthermore, we propose an approximate equation for evaluating the accuracy of early and late fusion methods as a function of sample size, feature quantity, and modality number. We also propose a critical sample size threshold at which the performance dominance of early fusion and late fusion models undergoes a reversal. Finally, we introduce a fusion method selection paradigm for selecting the most appropriate fusion method prior to task execution and demonstrate its effectiveness through extensive numerical experiments. Our theoretical framework is expected to solve the problems of computational and resource costs in model construction, improving the scalability and efficiency of data fusion methods.
Suggested Citation
Ziqi Liu & Ziqiao Yin & Zhilong Mi & Binghui Guo & Zhiming Zheng, 2025.
"A Comparative Analysis of Three Data Fusion Methods and Construction of the Fusion Method Selection Paradigm,"
Mathematics, MDPI, vol. 13(8), pages 1-21, April.
Handle:
RePEc:gam:jmathe:v:13:y:2025:i:8:p:1218-:d:1630219
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:13:y:2025:i:8:p:1218-:d:1630219. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.