IDEAS home Printed from https://ideas.repec.org/a/igg/jdwm00/v13y2017i2p13-44.html
   My bibliography  Save this article

A Two-Dimensional Webpage Classification Model

Author

Listed:
  • Shih-Ting Yang

    (Nanhua University, Chiayi, Taiwan)

  • Chia-Wei Huang

    (Department of Information Management, Nanhua University, Chiayi, Taiwan)

Abstract

Regarding the webpage classification topics, most classification mechanisms may lack of consideration from the webpage article writer's perspective and the display characteristics of the webpage (color, graphic layout). Hence, this paper develops a Two-dimensional Webpage Classification model to analyze the webpage textual information and display characteristics from the perspectives of webpage users and designers. This model is consisted of the Webpage Block Distribution Analysis (WBDA) module, Webpage Emotion Category Determination (WECD) module and Webpage Specialty Category Determination (WSCD) module. Firstly, in WBDA module, the user and designer habits (such as the web browsing movement and the writing perspective of webpage) should be considered by combining with the eye movement tracking and tag-region judgment to determine the critical blocks and information of the webpage. Secondly, in WECD module, the webpage color codes are acquired to calculate the major colors of the webpage, and further determine the emotional category of webpage. Thirdly, the WSCD module analyzes the webpage textual information by integrating the keyword acquisition technology to identify the specialty category of the webpage. After that, the Two-dimensional category of the webpage can be obtained. In addition, this paper develops a web-based system accordingly for case verification to confirm the feasibility of the methodology. The verification results show that firstly for webpage emotion category judgment when 128 webpage files for training are imported into this system, the respondent's emotion evaluation score is increased to above Level 5 and the system recommendation success rate is increased to 75.78%. Secondly, for specialty category determination, when system uses 1010 to 1120 webpage files for training, the system performance can be increased to above 80%. Hence, the developed system has a high-performance level in webpage emotion category and specialty category determination. That is, this paper proposes a methodology of Two-dimensional Webpage Classification to classify the webpage file information contents and the effects on the emotions of the demanders to assist webpage providers in providing webpage suitable for demanders with the generated two-dimensional information of the webpage.

Suggested Citation

  • Shih-Ting Yang & Chia-Wei Huang, 2017. "A Two-Dimensional Webpage Classification Model," International Journal of Data Warehousing and Mining (IJDWM), IGI Global, vol. 13(2), pages 13-44, April.
  • Handle: RePEc:igg:jdwm00:v:13:y:2017:i:2:p:13-44
    as

    Download full text from publisher

    File URL: http://services.igi-global.com/resolvedoi/resolve.aspx?doi=10.4018/IJDWM.2017040102
    Download Restriction: no
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:igg:jdwm00:v:13:y:2017:i:2:p:13-44. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Journal Editor (email available below). General contact details of provider: https://www.igi-global.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.