IDEAS home Printed from https://ideas.repec.org/a/bla/jamist/v64y2013i7p1399-1410.html
   My bibliography  Save this article

On predicting the popularity of newly emerging hashtags in Twitter

Author

Listed:
  • Zongyang Ma
  • Aixin Sun
  • Gao Cong

Abstract

Because of Twitter's popularity and the viral nature of information dissemination on Twitter, predicting which Twitter topics will become popular in the near future becomes a task of considerable economic importance. Many Twitter topics are annotated by hashtags. In this article, we propose methods to predict the popularity of new hashtags on Twitter by formulating the problem as a classification task. We use five standard classification models (i.e., Naïve bayes, k‐nearest neighbors, decision trees, support vector machines, and logistic regression) for prediction. The main challenge is the identification of effective features for describing new hashtags. We extract 7 content features from a hashtag string and the collection of tweets containing the hashtag and 11 contextual features from the social graph formed by users who have adopted the hashtag. We conducted experiments on a Twitter data set consisting of 31 million tweets from 2 million Singapore‐based users. The experimental results show that the standard classifiers using the extracted features significantly outperform the baseline methods that do not use these features. Among the five classifiers, the logistic regression model performs the best in terms of the Micro‐F1 measure. We also observe that contextual features are more effective than content features.

Suggested Citation

  • Zongyang Ma & Aixin Sun & Gao Cong, 2013. "On predicting the popularity of newly emerging hashtags in Twitter," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 64(7), pages 1399-1410, July.
  • Handle: RePEc:bla:jamist:v:64:y:2013:i:7:p:1399-1410
    DOI: 10.1002/asi.22844
    as

    Download full text from publisher

    File URL: https://doi.org/10.1002/asi.22844
    Download Restriction: no

    File URL: https://libkey.io/10.1002/asi.22844?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Cui, Hao & Kertész, János, 2023. "“Born in Rome” or “Sleeping Beauty”: Emergence of hashtag popularity on the Chinese microblog Sina Weibo," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 619(C).
    2. Wai Hong Tan & Feng Chen, 2021. "Predicting the popularity of tweets using internal and external knowledge: an empirical Bayes type approach," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 105(2), pages 335-352, June.
    3. Ali Daud & Muhammad Ahmad & M. S. I. Malik & Dunren Che, 2015. "Using machine learning techniques for rising star prediction in co-author network," Scientometrics, Springer;Akadémiai Kiadó, vol. 102(2), pages 1687-1711, February.
    4. Sharad Goel & Ashton Anderson & Jake Hofman & Duncan J. Watts, 2016. "The Structural Virality of Online Diffusion," Management Science, INFORMS, vol. 62(1), pages 180-196, January.
    5. Jaebong Son & Jintae Lee & Kai R. Larsen & Jiyoung Woo, 2020. "Understanding the uncertainty of disaster tweets and its effect on retweeting: The perspectives of uncertainty reduction theory and information entropy," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 71(10), pages 1145-1161, October.
    6. António Fonseca & Jorge Louçã, 2018. "Explaining the emergence of online popularity through a model of information diffusion," Computational and Mathematical Organization Theory, Springer, vol. 24(2), pages 169-187, June.
    7. Paige Brown Jarreau & Imogene A Cancellare & Becky J Carmichael & Lance Porter & Daniel Toker & Samantha Z Yammine, 2019. "Using selfies to challenge public stereotypes of scientists," PLOS ONE, Public Library of Science, vol. 14(5), pages 1-23, May.
    8. Son, Jaebong & Lee, Hyung Koo & Jin, Sung & Lee, Jintae, 2019. "Content features of tweets for effective communication during disasters: A media synchronicity theory perspective," International Journal of Information Management, Elsevier, vol. 45(C), pages 56-68.
    9. Jabłońska-Sabuka, Matylda & Sitarz, Robert & Kraslawski, Andrzej, 2014. "Forecasting research trends using population dynamics model with Burgers’ type interaction," Journal of Informetrics, Elsevier, vol. 8(1), pages 111-122.
    10. Zhao, Qihang & Feng, Xiaodong, 2022. "Utilizing citation network structure to predict paper citation counts: A Deep learning approach," Journal of Informetrics, Elsevier, vol. 16(1).
    11. Arora, Anuja & Bansal, Shivam & Kandpal, Chandrashekhar & Aswani, Reema & Dwivedi, Yogesh, 2019. "Measuring social media influencer index- insights from facebook, Twitter and Instagram," Journal of Retailing and Consumer Services, Elsevier, vol. 49(C), pages 86-101.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jamist:v:64:y:2013:i:7:p:1399-1410. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.asis.org .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.