IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0217591.html
   My bibliography  Save this article

Sentimental text mining based on an additional features method for text classification

Author

Listed:
  • Ching-Hsue Cheng
  • Hsien-Hsiu Chen

Abstract

Owing to the emergence of the Internet and its rapid growth, people can use mobile devices on many social media platforms (blogs, Facebook forums, etc.), and the platforms provide well-known websites for people to express and share their daily activities and ideas on global issues. Many consumers utilize product review websites before making a purchase. Many well-known websites are searched for relevant product reviews and experiences of product use. We can easily collect large amounts of structured and unstructured product data and further analyze the data to determine the desired product information. For this reason, many researchers are gradually focusing on sentiment analysis or opinion exploration (opinion mining) and use this technique to extract and analyze customer opinions and emotions. This paper proposes a sentimental text mining method based on an additional features method to enhance accuracy and reduce implementation time and uses singular value decomposition and principal component analysis for data dimension reduction. This study has four contributions: (1) the proposed algorithm for preprocessing the data for sentiment classification, (2) the additional features to enhance the accuracy of the sentiment classification, (3) the application of singular value decomposition and principal component analysis for data dimension reduction, and (4) the design of five modules based on different features, with or without stemming, to compare the performance results. The experimental results show that the proposed method has better accuracy than other methods and that the proposed method can decrease the implementation time.

Suggested Citation

  • Ching-Hsue Cheng & Hsien-Hsiu Chen, 2019. "Sentimental text mining based on an additional features method for text classification," PLOS ONE, Public Library of Science, vol. 14(6), pages 1-17, June.
  • Handle: RePEc:plo:pone00:0217591
    DOI: 10.1371/journal.pone.0217591
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0217591
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0217591&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0217591?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Nikolay Archak & Anindya Ghose & Panagiotis G. Ipeirotis, 2011. "Deriving the Pricing Power of Product Features by Mining Consumer Reviews," Management Science, INFORMS, vol. 57(8), pages 1485-1509, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Haoran Zhu & Lei Lei, 2022. "The Research Trends of Text Classification Studies (2000–2020): A Bibliometric Analysis," SAGE Open, , vol. 12(2), pages 21582440221, April.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Baxendale, Shane & Macdonald, Emma K. & Wilson, Hugh N., 2015. "The Impact of Different Touchpoints on Brand Consideration," Journal of Retailing, Elsevier, vol. 91(2), pages 235-253.
    2. Wu, Xingli & Liao, Huchang, 2021. "Modeling personalized cognition of customers in online shopping," Omega, Elsevier, vol. 104(C).
    3. Jiyeon Hong & Paul R. Hoban, 2022. "Writing More Compelling Creative Appeals: A Deep Learning-Based Approach," Marketing Science, INFORMS, vol. 41(5), pages 941-965, September.
    4. Gökçe Esenduran & James A. Hill & In Joon Noh, 2020. "Understanding the Choice of Online Resale Channel for Used Electronics," Production and Operations Management, Production and Operations Management Society, vol. 29(5), pages 1188-1211, May.
    5. Schneider, Matthew J. & Gupta, Sachin, 2016. "Forecasting sales of new and existing products using consumer reviews: A random projections approach," International Journal of Forecasting, Elsevier, vol. 32(2), pages 243-256.
    6. Yufei Zhang & Clay M. Voorhees & G. Tomas M. Hult, 2024. "Dynamic interplays between online reviews and marketing promotions," Journal of the Academy of Marketing Science, Springer, vol. 52(6), pages 1820-1841, November.
    7. Borchert, Philipp & Coussement, Kristof & De Weerdt, Jochen & De Caigny, Arno, 2024. "Industry-sensitive language modeling for business," European Journal of Operational Research, Elsevier, vol. 315(2), pages 691-702.
    8. Younghoon Lee, 2022. "Identifying Competitive Attributes Based on an Ensemble of Explainable Artificial Intelligence," Business & Information Systems Engineering: The International Journal of WIRTSCHAFTSINFORMATIK, Springer;Gesellschaft für Informatik e.V. (GI), vol. 64(4), pages 407-419, August.
    9. Jun Hwan Kim & Hyun Cheol Lee, 2019. "Understanding the Repurchase Intention of Premium Economy Passengers Using an Extended Theory of Planned Behavior," Sustainability, MDPI, vol. 11(11), pages 1-19, June.
    10. Agnieszka Zablocki & Bodo Schlegelmilch & Michael J. Houston, 2019. "How valence, volume and variance of online reviews influence brand attitudes," AMS Review, Springer;Academy of Marketing Science, vol. 9(1), pages 61-77, June.
    11. Bin Guo & Shasha Zhou, 2017. "What makes population perception of review helpfulness: an information processing perspective," Electronic Commerce Research, Springer, vol. 17(4), pages 585-608, December.
    12. Laura Toschi & Elisa Ughetto & Andrea Fronzetti Colladon, 2023. "The identity of social impact venture capitalists: exploring social linguistic positioning and linguistic distinctiveness through text mining," Small Business Economics, Springer, vol. 60(3), pages 1249-1280, March.
    13. Yanni Ping & Chelsey Hill & Yun Zhu & Jorge Fresneda, 2023. "Antecedents and consequences of the key opinion leader status: an econometric and machine learning approach," Electronic Commerce Research, Springer, vol. 23(3), pages 1459-1484, September.
    14. Zhuolan Bao & Wenwen Li & Pengzhen Yin & Michael Chau, 2021. "Examining the impact of review tag function on product evaluation and information perception of popular products," Information Systems and e-Business Management, Springer, vol. 19(2), pages 517-539, June.
    15. Tunç, Murat & Cavusoglu, Huseyin & Raghunathan, Srinivasan, 2021. "Online product reviews : Is a finer-grained rating scheme superior to a coarser one?," Other publications TiSEM ec57cbf3-7415-4427-aafc-6, Tilburg University, School of Economics and Management.
    16. Dominik Gutt, 2018. "In the Eye of the Beholder? Empirically Decomposing Different Economic Implications of the Online Rating Variance," Working Papers Dissertations 40, Paderborn University, Faculty of Business Administration and Economics.
    17. Marc R. Dotson & Joachim Büschken & Greg M. Allenby, 2020. "Explaining Preference Heterogeneity with Mixed Membership Modeling," Marketing Science, INFORMS, vol. 39(2), pages 407-426, March.
    18. Alasdair Reid, 2023. "Closing the Affordable Housing Gap: Identifying the Barriers Hindering the Sustainable Design and Construction of Affordable Homes," Sustainability, MDPI, vol. 15(11), pages 1-27, May.
    19. Mengyue Wang & Xin Li & Patrick Y. K. Chau, 2021. "Leveraging Image-Processing Techniques for Empirical Research: Feasibility and Reliability in Online Shopping Context," Information Systems Frontiers, Springer, vol. 23(3), pages 607-626, June.
    20. Christof Naumzik & Stefan Feuerriegel & Markus Weinmann, 2022. "I Will Survive: Predicting Business Failures from Customer Ratings," Marketing Science, INFORMS, vol. 41(1), pages 188-207, January.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0217591. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.