IDEAS home Printed from https://ideas.repec.org/a/spr/aodasc/v5y2018i1d10.1007_s40745-017-0135-y.html
   My bibliography  Save this article

A Novel Multiview Topic Model to Compute Correlation of Heterogeneous Data

Author

Listed:
  • Jinsheng Shen

    (Fudan University)

  • Mingmin Chi

    (Fudan University)

Abstract

With fast development of Internet technologies and sensor techniques, it is much easier to acquire data from different sources in different dates and times. However, how to compute the correlation of those heterogeneous data is a big challenge for data mining and information retrieval. Here, data feature from one source is called as a view, and the multiview features denote the same data point. In the paper, hidden correlation of two-view features is proposed to construct a Heterogeneous (multiview) Topic Model (HTM). In particular, probabilistic topic model is utilized for different views as usually, generative models provide much richer features when handling high-dimensional data such as texts. Nevertheless, it is necessary to know the form of probability distribution for most existent probabilistic topic models, such as latent Dirichlet allocation. By avoiding the limitation of probabilistic topic model, the HTM is reduced to solving a non-negative matrix tri-factorization problem with certain constraints such that the proposed approach can be used in terms of an arbitrary model.

Suggested Citation

  • Jinsheng Shen & Mingmin Chi, 2018. "A Novel Multiview Topic Model to Compute Correlation of Heterogeneous Data," Annals of Data Science, Springer, vol. 5(1), pages 9-19, March.
  • Handle: RePEc:spr:aodasc:v:5:y:2018:i:1:d:10.1007_s40745-017-0135-y
    DOI: 10.1007/s40745-017-0135-y
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s40745-017-0135-y
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s40745-017-0135-y?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Daniel D. Lee & H. Sebastian Seung, 1999. "Learning the parts of objects by non-negative matrix factorization," Nature, Nature, vol. 401(6755), pages 788-791, October.
    2. Fama, Eugene F, 1970. "Efficient Capital Markets: A Review of Theory and Empirical Work," Journal of Finance, American Finance Association, vol. 25(2), pages 383-417, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Chonghui Guo & Menglin Lu & Wei Wei, 2021. "An Improved LDA Topic Modeling Method Based on Partition for Medium and Long Texts," Annals of Data Science, Springer, vol. 8(2), pages 331-344, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. David M. Ritzwoller & Joseph P. Romano, 2019. "Uncertainty in the Hot Hand Fallacy: Detecting Streaky Alternatives to Random Bernoulli Sequences," Papers 1908.01406, arXiv.org, revised Apr 2021.
    2. Shazia Ghani, 2011. "A re-visit to Minsky after 2007 financial meltdown," Post-Print halshs-01027435, HAL.
    3. Steininger, Lea & Hesse, Casimir, 2024. "Buying into new ideas: The ECB’s evolving justification of unlimited liquidity," Department of Economics Working Paper Series 357, WU Vienna University of Economics and Business.
    4. Christiane Goodfellow & Dirk Schiereck & Steffen Wippler, 2013. "Are behavioural finance equity funds a superior investment? A note on fund performance and market efficiency," Journal of Asset Management, Palgrave Macmillan, vol. 14(2), pages 111-119, April.
    5. Cagli, Efe Caglar & Taskin, Dilvin & Evrim Mandaci, Pınar, 2019. "The short- and long-run efficiency of energy, precious metals, and base metals markets: Evidence from the exponential smooth transition autoregressive models," Energy Economics, Elsevier, vol. 84(C).
    6. Andrew Weinbach & Rodney J. Paul, 2009. "National television coverage and the behavioural bias of bettors: the American college football totals market," International Gambling Studies, Taylor & Francis Journals, vol. 9(1), pages 55-66, April.
    7. Plantinga, Andrew J. & Provencher, Bill, 2001. "Internal Consistency In Models Of Optimal Resource Use Under Uncertainty," 2001 Annual meeting, August 5-8, Chicago, IL 20712, American Agricultural Economics Association (New Name 2008: Agricultural and Applied Economics Association).
    8. Growitsch Christian & Nepal Rabindra & Stronzik Marcus, 2015. "Price Convergence and Information Efficiency in German Natural Gas Markets," German Economic Review, De Gruyter, vol. 16(1), pages 87-103, February.
    9. Oxelheim, Lars & Rafferty, Michael, 2005. "On the static efficiency of secondary bond markets," Journal of Multinational Financial Management, Elsevier, vol. 15(2), pages 117-135, April.
    10. Baoqiang Zhan & Shu Zhang & Helen S. Du & Xiaoguang Yang, 2022. "Exploring Statistical Arbitrage Opportunities Using Machine Learning Strategy," Computational Economics, Springer;Society for Computational Economics, vol. 60(3), pages 861-882, October.
    11. Shi, Huai-Long & Zhou, Wei-Xing, 2022. "Factor volatility spillover and its implications on factor premia," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 80(C).
    12. Gaio, Luiz Eduardo & Stefanelli, Nelson Oliveira & Pimenta, Tabajara & Bonacim, Carlos Alberto Grespan & Gatsios, Rafael Confetti, 2022. "The impact of the Russia-Ukraine conflict on market efficiency: Evidence for the developed stock market," Finance Research Letters, Elsevier, vol. 50(C).
    13. Anastasios Evgenidis & Stephanos Papadamou, 2021. "The impact of unconventional monetary policy in the euro area. Structural and scenario analysis from a Bayesian VAR," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 26(4), pages 5684-5703, October.
    14. Nuruddeen Usman & Kodili Nwanneka & Nduka, 2023. "Announcement Effect of COVID-19 on Cryptocurrencies," Asian Economics Letters, Asia-Pacific Applied Economics Association, vol. 3(3), pages 1-4.
    15. Del Corso, Gianna M. & Romani, Francesco, 2019. "Adaptive nonnegative matrix factorization and measure comparisons for recommender systems," Applied Mathematics and Computation, Elsevier, vol. 354(C), pages 164-179.
    16. Tihana Škrinjarić, 2019. "Time Varying Spillovers between the Online Search Volume and Stock Returns: Case of CESEE Markets," IJFS, MDPI, vol. 7(4), pages 1-30, October.
    17. Carol Alexander & Anca Dimitriu, 2003. "Equity Indexing: Conitegration and Stock Price Dispersion: A Regime Switiching Approach to market Efficiency," ICMA Centre Discussion Papers in Finance icma-dp2003-02, Henley Business School, University of Reading.
    18. P Fogel & C Geissler & P Cotte & G Luta, 2022. "Applying separative non-negative matrix factorization to extra-financial data," Working Papers hal-03689774, HAL.
    19. Robert C. Merton, 2006. "Paul Samuelson and Financial Economics," The American Economist, Sage Publications, vol. 50(2), pages 9-31, October.
    20. Xiao-Bai Li & Jialun Qin, 2017. "Anonymizing and Sharing Medical Text Records," Information Systems Research, INFORMS, vol. 28(2), pages 332-352, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:aodasc:v:5:y:2018:i:1:d:10.1007_s40745-017-0135-y. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.