IDEAS home Printed from https://ideas.repec.org/a/eee/phsmap/v598y2022ics0378437122002801.html
   My bibliography  Save this article

Peeking strategy for online news diffusion prediction via machine learning

Author

Listed:
  • Zhang, Yaotian
  • Feng, Mingming
  • Shang, Ke-ke
  • Ran, Yijun
  • Wang, Cheng-Jun

Abstract

For computational social scientists, cascade size prediction and fake news detection are two primary problems in news diffusion or computational communication research. Previous studies predict news diffusion via peeking the social process (temporal structure) data in the initial stage, which is summarized as Peeking strategy. However, the accuracy of Peeking strategy for cascade size prediction still should be improved, and the advantage or limitation of Peeking strategy for fake news detection has not been fully investigated. To predict cascade size and detect fake news, we adopt Peeking strategy based on well-known machine learning algorithms. Our results show that Peeking strategy can effectively improve the accuracy of cascade size prediction. Meanwhile, we can peek into a smaller time window to achieve a higher performance in predicting the cascade size compared with previous methods. Nevertheless, we find that Peeking strategy with network structures fails in significantly improving the performance of fake news detection. Finally, we argue that cascade structure properties can aid in prediction of cascade size, but not for the fake news detection.

Suggested Citation

  • Zhang, Yaotian & Feng, Mingming & Shang, Ke-ke & Ran, Yijun & Wang, Cheng-Jun, 2022. "Peeking strategy for online news diffusion prediction via machine learning," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 598(C).
  • Handle: RePEc:eee:phsmap:v:598:y:2022:i:c:s0378437122002801
    DOI: 10.1016/j.physa.2022.127357
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0378437122002801
    Download Restriction: Full text for ScienceDirect subscribers only. Journal offers the option of making the article available online on Science direct for a fee of $3,000

    File URL: https://libkey.io/10.1016/j.physa.2022.127357?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Ke-Ke Shang & Wei-Sheng Yan & Xiao-Ke Xu, 2014. "Limitation of degree information for analyzing the interaction evolution in online social networks," International Journal of Modern Physics C (IJMPC), World Scientific Publishing Co. Pte. Ltd., vol. 25(10), pages 1-10.
    2. Giannone, Domenico & Reichlin, Lucrezia & Small, David, 2008. "Nowcasting: The real-time informational content of macroeconomic data," Journal of Monetary Economics, Elsevier, vol. 55(4), pages 665-676, May.
    3. Sharad Goel & Ashton Anderson & Jake Hofman & Duncan J. Watts, 2016. "The Structural Virality of Online Diffusion," Management Science, INFORMS, vol. 62(1), pages 180-196, January.
    4. Knut Are Aastveit & Karsten R. Gerdrup & Anne Sofie Jore & Leif Anders Thorsrud, 2014. "Nowcasting GDP in Real Time: A Density Combination Approach," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 32(1), pages 48-68, January.
    5. Domenico Giannone & Lucrezia Reichlin & David H. Small, 2005. "Nowcasting GDP and inflation: the real-time informational content of macroeconomic data releases," Finance and Economics Discussion Series 2005-42, Board of Governors of the Federal Reserve System (U.S.).
    6. Wang, Cheng-Jun & Wu, Lingfei, 2016. "The scaling of attention networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 448(C), pages 196-204.
    7. Hyndman, Rob J. & Koehler, Anne B., 2006. "Another look at measures of forecast accuracy," International Journal of Forecasting, Elsevier, vol. 22(4), pages 679-688.
    8. Goldsmith, Jeff & Scheipl, Fabian, 2014. "Estimator selection and combination in scalar-on-function regression," Computational Statistics & Data Analysis, Elsevier, vol. 70(C), pages 362-372.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Aastveit, Knut Are & Jore, Anne Sofie & Ravazzolo, Francesco, 2016. "Identification and real-time forecasting of Norwegian business cycles," International Journal of Forecasting, Elsevier, vol. 32(2), pages 283-292.
    2. Knotek, Edward S. & Zaman, Saeed, 2023. "Real-time density nowcasts of US inflation: A model combination approach," International Journal of Forecasting, Elsevier, vol. 39(4), pages 1736-1760.
    3. Knut Are Aastveit & Francesco Ravazzolo & Herman K. van Dijk, 2018. "Combined Density Nowcasting in an Uncertain Economic Environment," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 36(1), pages 131-145, January.
    4. Claudia Foroni & Massimiliano Marcellino, 2013. "A survey of econometric methods for mixed-frequency data," Working Paper 2013/06, Norges Bank.
    5. Nikoleta Anesti & Ana Beatriz Galvao & Silvia Miranda-Agrippino, 2018. "Uncertain Kingdom: Nowcasting GDP and its Revisions," Discussion Papers 1824, Centre for Macroeconomics (CFM).
    6. Holtemöller, Oliver & Kozyrev, Boris, 2023. "Forecasting Economic Activity with a Neural Network in Uncertain Times: Monte Carlo Evidence and Application to German GDP," VfS Annual Conference 2023 (Regensburg): Growth and the "sociale Frage" 277688, Verein für Socialpolitik / German Economic Association.
    7. Byron Botha & Geordie Reid & Tim Olds & Daan Steenkamp & Rossouw van Jaarsveld, 2021. "Nowcasting South African GDP using a suite of statistical models," Working Papers 11001, South African Reserve Bank.
    8. Stock, J.H. & Watson, M.W., 2016. "Dynamic Factor Models, Factor-Augmented Vector Autoregressions, and Structural Vector Autoregressions in Macroeconomics," Handbook of Macroeconomics, in: J. B. Taylor & Harald Uhlig (ed.), Handbook of Macroeconomics, edition 1, volume 2, chapter 0, pages 415-525, Elsevier.
    9. Jon Ellingsen & Vegard H. Larsen & Leif Anders Thorsrud, 2020. "News Media vs. FRED-MD for Macroeconomic Forecasting," CESifo Working Paper Series 8639, CESifo.
    10. Antonello D’Agostino & Domenico Giannone & Michele Lenza & Michele Modugno, 2016. "Nowcasting Business Cycles: A Bayesian Approach to Dynamic Heterogeneous Factor Models," Advances in Econometrics, in: Dynamic Factor Models, volume 35, pages 569-594, Emerald Group Publishing Limited.
    11. Götz, Thomas B. & Hecq, Alain & Urbain, Jean-Pierre, 2016. "Combining forecasts from successive data vintages: An application to U.S. growth," International Journal of Forecasting, Elsevier, vol. 32(1), pages 61-74.
    12. Knut Are Aastveit & Andrea Carriero & Todd E. Clark & Massimiliano Marcellino, 2017. "Have Standard VARS Remained Stable Since the Crisis?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 32(5), pages 931-951, August.
    13. Bańbura, Marta & Giannone, Domenico & Modugno, Michele & Reichlin, Lucrezia, 2013. "Now-Casting and the Real-Time Data Flow," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 2, chapter 0, pages 195-237, Elsevier.
    14. Tony Chernis & Taylor Webley, 2022. "Nowcasting Canadian GDP with Density Combinations," Discussion Papers 2022-12, Bank of Canada.
    15. Andrea Carriero & Todd E. Clark & Marcellino Massimiliano, 2020. "Nowcasting Tail Risks to Economic Activity with Many Indicators," Working Papers 20-13R2, Federal Reserve Bank of Cleveland, revised 22 Sep 2020.
    16. Jack Fosten & Daniel Gutknecht, 2021. "Horizon confidence sets," Empirical Economics, Springer, vol. 61(2), pages 667-692, August.
    17. Rusnák, Marek, 2016. "Nowcasting Czech GDP in real time," Economic Modelling, Elsevier, vol. 54(C), pages 26-39.
    18. Henzel Steffen R. & Wohlrabe Klaus & Lehmann Robert, 2015. "Nowcasting Regional GDP: The Case of the Free State of Saxony," Review of Economics, De Gruyter, vol. 66(1), pages 71-98, April.
    19. Soybilgen, Barış & Yazgan, Ege, 2018. "Evaluating nowcasts of bridge equations with advanced combination schemes for the Turkish unemployment rate," Economic Modelling, Elsevier, vol. 72(C), pages 99-108.
    20. Knut Are Aastveit & Claudia Foroni & Francesco Ravazzolo, 2017. "Density Forecasts With Midas Models," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 32(4), pages 783-801, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:phsmap:v:598:y:2022:i:c:s0378437122002801. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/physica-a-statistical-mechpplications/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.