IDEAS home Printed from https://ideas.repec.org/a/hin/complx/5937274.html
   My bibliography  Save this article

Two-Phase Incremental Kernel PCA for Learning Massive or Online Datasets

Author

Listed:
  • Feng Zhao
  • Islem Rekik
  • Seong-Whan Lee
  • Jing Liu
  • Junying Zhang
  • Dinggang Shen

Abstract

As a powerful nonlinear feature extractor, kernel principal component analysis (KPCA) has been widely adopted in many machine learning applications. However, KPCA is usually performed in a batch mode, leading to some potential problems when handling massive or online datasets. To overcome this drawback of KPCA, in this paper, we propose a two-phase incremental KPCA (TP-IKPCA) algorithm which can incorporate data into KPCA in an incremental fashion. In the first phase, an incremental algorithm is developed to explicitly express the data in the kernel space. In the second phase, we extend an incremental principal component analysis (IPCA) to estimate the kernel principal components. Extensive experimental results on both synthesized and real datasets showed that the proposed TP-IKPCA produces similar principal components as conventional batch-based KPCA but is computationally faster than KPCA and its several incremental variants. Therefore, our algorithm can be applied to massive or online datasets where the batch method is not available.

Suggested Citation

  • Feng Zhao & Islem Rekik & Seong-Whan Lee & Jing Liu & Junying Zhang & Dinggang Shen, 2019. "Two-Phase Incremental Kernel PCA for Learning Massive or Online Datasets," Complexity, Hindawi, vol. 2019, pages 1-17, February.
  • Handle: RePEc:hin:complx:5937274
    DOI: 10.1155/2019/5937274
    as

    Download full text from publisher

    File URL: http://downloads.hindawi.com/journals/8503/2019/5937274.pdf
    Download Restriction: no

    File URL: http://downloads.hindawi.com/journals/8503/2019/5937274.xml
    Download Restriction: no

    File URL: https://libkey.io/10.1155/2019/5937274?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Nicole, Sandro, 2000. "Feedforward neural networks for principal components extraction," Computational Statistics & Data Analysis, Elsevier, vol. 33(4), pages 425-437, June.
    2. Woojin Soh & Heeyoung Kim & Bong-Jin Yum, 2018. "Application of kernel principal component analysis to multi-characteristic parameter design problems," Annals of Operations Research, Springer, vol. 263(1), pages 69-91, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Banguero, Edison & Correcher, Antonio & Pérez-Navarro, Ángel & García, Emilio & Aristizabal, Andrés, 2020. "Diagnosis of a battery energy storage system based on principal component analysis," Renewable Energy, Elsevier, vol. 146(C), pages 2438-2449.
    2. Ma, Mina & Li, Xiaoyu & Gao, Wei & Sun, Jinhua & Wang, Qingsong & Mi, Chris, 2022. "Multi-fault diagnosis for series-connected lithium-ion battery pack with reconstruction-based contribution based on parallel PCA-KPCA," Applied Energy, Elsevier, vol. 324(C).
    3. Guangqi Liang & Dongxiao Niu & Yi Liang, 2020. "Core Competitiveness Evaluation of Clean Energy Incubators Based on Matter-Element Extension Combined with TOPSIS and KPCA-NSGA-II-LSSVM," Sustainability, MDPI, vol. 12(22), pages 1-26, November.
    4. Gaudart, Jean & Giusiano, Bernard & Huiart, Laetitia, 2004. "Comparison of the performance of multi-layer perceptron and linear regression for epidemiological data," Computational Statistics & Data Analysis, Elsevier, vol. 44(4), pages 547-570, January.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hin:complx:5937274. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Mohamed Abdelhakeem (email available below). General contact details of provider: https://www.hindawi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.