HeBERT and HebEMO: A Hebrew BERT Model and a Tool for Polarity Analysis and Emotion Recognition

My bibliography Save this article

HeBERT and HebEMO: A Hebrew BERT Model and a Tool for Polarity Analysis and Emotion Recognition

Author

Listed:

Avihay Chriqui
(Coller School of Management, Tel Aviv University, Tel Aviv 6997801, Israel)
Inbal Yahav
(Coller School of Management, Tel Aviv University, Tel Aviv 6997801, Israel)

Registered:

Abstract

Sentiment analysis of user-generated content (UGC) can provide valuable information across numerous domains, including marketing, psychology, and public health. Currently, there are very few Hebrew models for natural language processing in general, and for sentiment analysis in particular; indeed, it is not straightforward to develop such models because Hebrew is a morphologically rich language (MRL) with challenging characteristics. Moreover, the only available Hebrew sentiment analysis model, based on a recurrent neural network, was developed for polarity analysis (classifying text as positive, negative, or neutral) and was not used for detection of finer-grained emotions (e.g., anger, fear, or joy). To address these gaps, this paper introduces HeBERT and HebEMO. HeBERT is a transformer-based model for modern Hebrew text, which relies on a BERT (bidirectional encoder representations from transformers) architecture. BERT has been shown to outperform alternative architectures in sentiment analysis and is suggested to be particularly appropriate for MRLs. Analyzing multiple BERT specifications, we find that whereas model complexity correlates with high performance on language tasks that aim to understand terms in a sentence, a more parsimonious model better captures the sentiment of an entire sentence. Notably, regardless of the complexity of the BERT specification, our BERT-based language model outperforms all existing Hebrew alternatives on all language tasks examined. HebEMO is a tool that uses HeBERT to detect polarity and extract emotions from Hebrew UGC. HebEMO is trained on a unique COVID-19-related UGC data set that we collected and annotated for this study. Data collection and annotation followed an active learning procedure that aimed to maximize predictability. We show that HebEMO yields a better performance accuracy for polarity classification. Emotion detection reaches high performance for various target emotions, with the exception of surprise, which the model failed to capture. These results are better than the best reported performance, even among English-language models of emotion detection.

Suggested Citation

Avihay Chriqui & Inbal Yahav, 2022. "HeBERT and HebEMO: A Hebrew BERT Model and a Tool for Polarity Analysis and Emotion Recognition," INFORMS Joural on Data Science, INFORMS, vol. 1(1), pages 81-95, April.

Handle: RePEc:inm:orijds:v:1:y:2022:i:1:p:81-95
DOI: 10.1287/ijds.2022.0016

Download full text from publisher

References listed on IDEAS

Panagiotis Adamopoulos & Anindya Ghose & Vilma Todri, 2018. "The Impact of User Personality Traits on Word of Mouth: Text-Mining Social Media Platforms," Information Systems Research, INFORMS, vol. 29(3), pages 612-640, September.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Kunpeng Zhang & Wendy Moe, 2021. "Measuring Brand Favorability Using Large-Scale Social Media Data," Information Systems Research, INFORMS, vol. 32(4), pages 1128-1139, December.
Uttara Ananthakrishnan & Davide Proserpio & Siddhartha Sharma, 2023. "I Hear You: Does Quality Improve with Customer Voice?," Marketing Science, INFORMS, vol. 42(6), pages 1143-1161, November.
Tamilla Triantoro & Ram Gopal & Raquel Benbunan-Fich & Guido Lang, 0. "Personality and games: enhancing online surveys through gamification," Information Technology and Management, Springer, vol. 0, pages 1-10.
Kai Yang & Raymond Y. K. Lau & Ahmed Abbasi, 2023. "Getting Personal: A Deep Learning Artifact for Text-Based Measurement of Personality," Information Systems Research, INFORMS, vol. 34(1), pages 194-222, March.
Tianyi Li & Xiaoquan (Michael) Zhang, 2024. "Development Trajectory of Blockchain Platforms: The Role of Multirole," Information Systems Research, INFORMS, vol. 35(3), pages 1296-1323, September.
Alper Yayla & Ersin Dincelli & Srikanth Parameswaran, 2024. "A Mining Town in a Digital Land: Browser-Based Cryptocurrency Mining as an Alternative to Online Advertising," Information Systems Frontiers, Springer, vol. 26(2), pages 609-631, April.
Tamilla Triantoro & Ram Gopal & Raquel Benbunan-Fich & Guido Lang, 2020. "Personality and games: enhancing online surveys through gamification," Information Technology and Management, Springer, vol. 21(3), pages 169-178, September.
Hyelim Oh & Khim-Yong Goh & Tuan Q. Phan, 2023. "Are You What You Tweet? The Impact of Sentiment on Digital News Consumption and Social Media Sharing," Information Systems Research, INFORMS, vol. 34(1), pages 111-136, March.
Grewal, Dhruv & Herhausen, Dennis & Ludwig, Stephan & Villarroel Ordenes, Francisco, 2022. "The Future of Digital Communication Research: Considering Dynamics and Multimodality," Journal of Retailing, Elsevier, vol. 98(2), pages 224-240.
Zhang, Zhiyun & Zhang, Ziqiong & Liu, Sen & Zhang, Zili, 2024. "Are high-status reviewers more likely to seek anonymity? Evidence from an online review platform," Journal of Retailing and Consumer Services, Elsevier, vol. 78(C).
Amir Manzoor, 2024. "Antecedents to Following Brands on Facebook and Instagram with Moderating Role of User Experience," FIIB Business Review, , vol. 13(5), pages 586-599, October.
Anandasivam Gopal & Pei-yu Chen & Wonseok Oh & Sean Xin Xu & Suprateek Sarker, 2024. "On Crafting Effective Theoretical Contributions for Empirical Papers in Economics of Information Systems: Some Editorial Reflections," Information Systems Research, INFORMS, vol. 35(3), pages 917-935, September.
Argyris, Young Anna & Muqaddam, Aziz & Miller, Steven, 2021. "The effects of the visual presentation of an Influencer's Extroversion on perceived credibility and purchase intentionsâ€”moderated by personality matching with the audience," Journal of Retailing and Consumer Services, Elsevier, vol. 59(C).
Xiaoning Wang & Yakov Bart & Serguei Netessine & Lynn Wu, 2025. "Impact of Multi-Platform Social Media Strategy on Sales in E-Commerce," Papers 2503.09083, arXiv.org.
Siqing Shan & Qi Yan & Yigang Wei, 2020. "Infectious or Recovered? Optimizing the Infectious Disease Detection Process for Epidemic Control and Prevention Based on Social Media," IJERPH, MDPI, vol. 17(18), pages 1-25, September.
Chenshuo Sun & Panagiotis Adamopoulos & Anindya Ghose & Xueming Luo, 2022. "Predicting Stages in Omnichannel Path to Purchase: A Deep Learning Model," Information Systems Research, INFORMS, vol. 33(2), pages 429-445, June.
Kraus, Mathias & Feuerriegel, Stefan & Oztekin, Asil, 2020. "Deep learning in business analytics and operations research: Models, applications and managerial implications," European Journal of Operational Research, Elsevier, vol. 281(3), pages 628-641.
Xiaoxi Zhu & Changhui Yang & Kai Liu & Rui Zhang & Qingquan Jiang, 2022. "Cooperation and decision making in a two-sided market motivated by the externality of a third-party social media platform," Annals of Operations Research, Springer, vol. 316(1), pages 117-142, September.
Vilma Todri, 2022. "Frontiers: The Impact of Ad-Blockers on Online Consumer Behavior," Marketing Science, INFORMS, vol. 41(1), pages 7-18, January.
Konstantin Bauman & Alexander Tuzhilin, 2022. "Know Thy Context: Parsing Contextual Information from User Reviews for Recommendation Purposes," Information Systems Research, INFORMS, vol. 33(1), pages 179-202, March.

More about this item

Keywords

; ; ; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:orijds:v:1:y:2022:i:1:p:81-95. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

HeBERT and HebEMO: A Hebrew BERT Model and a Tool for Polarity Analysis and Emotion Recognition

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data