IDEAS home Printed from https://ideas.repec.org/a/inm/orijds/v1y2022i1p81-95.html

HeBERT and HebEMO: A Hebrew BERT Model and a Tool for Polarity Analysis and Emotion Recognition

Author

Listed:
  • Avihay Chriqui

    (Coller School of Management, Tel Aviv University, Tel Aviv 6997801, Israel)

  • Inbal Yahav

    (Coller School of Management, Tel Aviv University, Tel Aviv 6997801, Israel)

Abstract

Sentiment analysis of user-generated content (UGC) can provide valuable information across numerous domains, including marketing, psychology, and public health. Currently, there are very few Hebrew models for natural language processing in general, and for sentiment analysis in particular; indeed, it is not straightforward to develop such models because Hebrew is a morphologically rich language (MRL) with challenging characteristics. Moreover, the only available Hebrew sentiment analysis model, based on a recurrent neural network, was developed for polarity analysis (classifying text as positive, negative, or neutral) and was not used for detection of finer-grained emotions (e.g., anger, fear, or joy). To address these gaps, this paper introduces HeBERT and HebEMO. HeBERT is a transformer-based model for modern Hebrew text, which relies on a BERT (bidirectional encoder representations from transformers) architecture. BERT has been shown to outperform alternative architectures in sentiment analysis and is suggested to be particularly appropriate for MRLs. Analyzing multiple BERT specifications, we find that whereas model complexity correlates with high performance on language tasks that aim to understand terms in a sentence, a more parsimonious model better captures the sentiment of an entire sentence. Notably, regardless of the complexity of the BERT specification, our BERT-based language model outperforms all existing Hebrew alternatives on all language tasks examined. HebEMO is a tool that uses HeBERT to detect polarity and extract emotions from Hebrew UGC. HebEMO is trained on a unique COVID-19-related UGC data set that we collected and annotated for this study. Data collection and annotation followed an active learning procedure that aimed to maximize predictability. We show that HebEMO yields a better performance accuracy for polarity classification. Emotion detection reaches high performance for various target emotions, with the exception of surprise, which the model failed to capture. These results are better than the best reported performance, even among English-language models of emotion detection.

Suggested Citation

  • Avihay Chriqui & Inbal Yahav, 2022. "HeBERT and HebEMO: A Hebrew BERT Model and a Tool for Polarity Analysis and Emotion Recognition," INFORMS Joural on Data Science, INFORMS, vol. 1(1), pages 81-95, April.
  • Handle: RePEc:inm:orijds:v:1:y:2022:i:1:p:81-95
    DOI: 10.1287/ijds.2022.0016
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/ijds.2022.0016
    Download Restriction: no

    File URL: https://libkey.io/10.1287/ijds.2022.0016?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Panagiotis Adamopoulos & Anindya Ghose & Vilma Todri, 2018. "The Impact of User Personality Traits on Word of Mouth: Text-Mining Social Media Platforms," Information Systems Research, INFORMS, vol. 29(3), pages 612-640, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Kononov, Natalia & Linzen, Itai, 2026. "Consumer attitudes toward brand supportive communication during geopolitical crises," Journal of Business Research, Elsevier, vol. 203(C).
    2. Laura Panza & Yanos Zylberberg, 2025. "Nation-building and mass migration: Evidence from Mandatory Palestine," Bristol Economics Discussion Papers 25/796, School of Economics, University of Bristol, UK.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kunpeng Zhang & Wendy Moe, 2021. "Measuring Brand Favorability Using Large-Scale Social Media Data," Information Systems Research, INFORMS, vol. 32(4), pages 1128-1139, December.
    2. Warut Khern-am-nuai & Hyunji So & Maxime C. Cohen & Yossiri Adulyasak, 2024. "Selecting Cover Images for Restaurant Reviews: AI vs. Wisdom of the Crowd," Manufacturing & Service Operations Management, INFORMS, vol. 26(1), pages 330-349, January.
    3. David Bendig & Antonio Bräunche, 2025. "The role of artificial intelligence algorithms in information systems research: a conceptual overview and avenues for research," Management Review Quarterly, Springer, vol. 75(4), pages 2863-2908, December.
    4. Uttara Ananthakrishnan & Davide Proserpio & Siddhartha Sharma, 2023. "I Hear You: Does Quality Improve with Customer Voice?," Marketing Science, INFORMS, vol. 42(6), pages 1143-1161, November.
    5. Scott Schanke & Gordon Burtch & Gautam Ray, 2026. "Digital Lyrebirds: Experimental Evidence That Voice-Based Deep Fakes Influence Trust," Management Science, INFORMS, vol. 72(1), pages 386-405, January.
    6. Tamilla Triantoro & Ram Gopal & Raquel Benbunan-Fich & Guido Lang, 0. "Personality and games: enhancing online surveys through gamification," Information Technology and Management, Springer, vol. 0, pages 1-10.
    7. Kai Yang & Raymond Y. K. Lau & Ahmed Abbasi, 2023. "Getting Personal: A Deep Learning Artifact for Text-Based Measurement of Personality," Information Systems Research, INFORMS, vol. 34(1), pages 194-222, March.
    8. Tianyi Li & Xiaoquan (Michael) Zhang, 2024. "Development Trajectory of Blockchain Platforms: The Role of Multirole," Information Systems Research, INFORMS, vol. 35(3), pages 1296-1323, September.
    9. Alper Yayla & Ersin Dincelli & Srikanth Parameswaran, 2024. "A Mining Town in a Digital Land: Browser-Based Cryptocurrency Mining as an Alternative to Online Advertising," Information Systems Frontiers, Springer, vol. 26(2), pages 609-631, April.
    10. Wen Wang & Beibei Li, 2025. "Learning Personalized Privacy Preference from Public Data," Information Systems Research, INFORMS, vol. 36(2), pages 761-780, June.
    11. Rui Zhang & Menglin Cheng & Yifan Wang & Mei Xue, 2025. "Opportunity or nonsense? Examining the role of CEOs’ social media usage in raising funds," Humanities and Social Sciences Communications, Palgrave Macmillan, vol. 12(1), pages 1-12, December.
    12. Tamilla Triantoro & Ram Gopal & Raquel Benbunan-Fich & Guido Lang, 2020. "Personality and games: enhancing online surveys through gamification," Information Technology and Management, Springer, vol. 21(3), pages 169-178, September.
    13. Hongchao Zhang & Kunlu Zhou & Hang Chen & Qinghua Feng, 2026. "Does strategic return promote word-of-mouth recommendation? An empirical study based on consumption experience theory," Electronic Commerce Research, Springer, vol. 26(1), pages 275-296, February.
    14. Shankar, Shardul & Tewari, Vijayshri, 2025. "Investigating leadership discussion on social media: A NLP and machine learning perspective," Technology in Society, Elsevier, vol. 82(C).
    15. Hyelim Oh & Khim-Yong Goh & Tuan Q. Phan, 2023. "Are You What You Tweet? The Impact of Sentiment on Digital News Consumption and Social Media Sharing," Information Systems Research, INFORMS, vol. 34(1), pages 111-136, March.
    16. Grewal, Dhruv & Herhausen, Dennis & Ludwig, Stephan & Villarroel Ordenes, Francisco, 2022. "The Future of Digital Communication Research: Considering Dynamics and Multimodality," Journal of Retailing, Elsevier, vol. 98(2), pages 224-240.
    17. Zhang, Zhiyun & Zhang, Ziqiong & Liu, Sen & Zhang, Zili, 2024. "Are high-status reviewers more likely to seek anonymity? Evidence from an online review platform," Journal of Retailing and Consumer Services, Elsevier, vol. 78(C).
    18. Amir Manzoor, 2024. "Antecedents to Following Brands on Facebook and Instagram with Moderating Role of User Experience," FIIB Business Review, , vol. 13(5), pages 586-599, October.
    19. Anandasivam Gopal & Pei-yu Chen & Wonseok Oh & Sean Xin Xu & Suprateek Sarker, 2024. "On Crafting Effective Theoretical Contributions for Empirical Papers in Economics of Information Systems: Some Editorial Reflections," Information Systems Research, INFORMS, vol. 35(3), pages 917-935, September.
    20. Argyris, Young Anna & Muqaddam, Aziz & Miller, Steven, 2021. "The effects of the visual presentation of an Influencer's Extroversion on perceived credibility and purchase intentions—moderated by personality matching with the audience," Journal of Retailing and Consumer Services, Elsevier, vol. 59(C).

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:orijds:v:1:y:2022:i:1:p:81-95. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.