IDEAS home Printed from https://ideas.repec.org/a/spr/infosf/v15y2013i3d10.1007_s10796-012-9404-7.html
   My bibliography  Save this article

Semantic similarity measurement using historical google search patterns

Author

Listed:
  • Jorge Martinez-Gil

    (University of Malaga)

  • José F. Aldana-Montes

    (University of Malaga)

Abstract

Computing the semantic similarity between terms (or short text expressions) that have the same meaning but which are not lexicographically similar is an important challenge in the information integration field. The problem is that techniques for textual semantic similarity measurement often fail to deal with words not covered by synonym dictionaries. In this paper, we try to solve this problem by determining the semantic similarity for terms using the knowledge inherent in the search history logs from the Google search engine. To do this, we have designed and evaluated four algorithmic methods for measuring the semantic similarity between terms using their associated history search patterns. These algorithmic methods are: a) frequent co-occurrence of terms in search patterns, b) computation of the relationship between search patterns, c) outlier coincidence on search patterns, and d) forecasting comparisons. We have shown experimentally that some of these methods correlate well with respect to human judgment when evaluating general purpose benchmark datasets, and significantly outperform existing methods when evaluating datasets containing terms that do not usually appear in dictionaries.

Suggested Citation

  • Jorge Martinez-Gil & José F. Aldana-Montes, 2013. "Semantic similarity measurement using historical google search patterns," Information Systems Frontiers, Springer, vol. 15(3), pages 399-410, July.
  • Handle: RePEc:spr:infosf:v:15:y:2013:i:3:d:10.1007_s10796-012-9404-7
    DOI: 10.1007/s10796-012-9404-7
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10796-012-9404-7
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10796-012-9404-7?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Silke Retzer & Pak Yoong & Val Hooper, 2012. "Inter-organisational knowledge transfer in social networks: A definition of intermediate ties," Information Systems Frontiers, Springer, vol. 14(2), pages 343-361, April.
    2. Zhang, Guoqiang & Eddy Patuwo, B. & Y. Hu, Michael, 1998. "Forecasting with artificial neural networks:: The state of the art," International Journal of Forecasting, Elsevier, vol. 14(1), pages 35-62, March.
    3. Angelos Hliaoutakis & Giannis Varelas & Epimenidis Voutsakis & Euripides G.M. Petrakis & Evangelos Milios, 2006. "Information Retrieval by Semantic Similarity," International Journal on Semantic Web and Information Systems (IJSWIS), IGI Global, vol. 2(3), pages 55-73, July.
    4. Jiexun Li & G. Alan Wang & Hsinchun Chen, 2011. "Identity matching using personal and social identity features," Information Systems Frontiers, Springer, vol. 13(1), pages 101-113, March.
    5. Leo Egghe & Loet Leydesdorff, 2009. "The relation between Pearson's correlation coefficient r and Salton's cosine measure," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 60(5), pages 1027-1036, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jorge Martinez-Gil & Alejandra Lorena Paoletti & Mario Pichler, 0. "A Novel Approach for Learning How to Automatically Match Job Offers and Candidate Profiles," Information Systems Frontiers, Springer, vol. 0, pages 1-10.
    2. Malu Castellanos & Florian Daniel & Irene Garrigós & Jose-Norberto Mazón, 2013. "Business Intelligence and the Web," Information Systems Frontiers, Springer, vol. 15(3), pages 307-309, July.
    3. Jorge Martinez-Gil & Alejandra Lorena Paoletti & Mario Pichler, 2020. "A Novel Approach for Learning How to Automatically Match Job Offers and Candidate Profiles," Information Systems Frontiers, Springer, vol. 22(6), pages 1265-1274, December.
    4. Rolando Quintero & Miguel Torres-Ruiz & Magdalena Saldaña-Pérez & Carlos Guzmán Sánchez-Mejorada & Felix Mata-Rivera, 2023. "A Conceptual Graph-Based Method to Compute Information Content," Mathematics, MDPI, vol. 11(18), pages 1-22, September.
    5. Lin-Chih Chen, 0. "Interactive Topic Search System Based on Topic Cluster Technology," Information Systems Frontiers, Springer, vol. 0, pages 1-17.
    6. Lin-Chih Chen, 2021. "Interactive Topic Search System Based on Topic Cluster Technology," Information Systems Frontiers, Springer, vol. 23(5), pages 1227-1243, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chulhwan Chris Bang, 2015. "Information systems frontiers: Keyword analysis and classification," Information Systems Frontiers, Springer, vol. 17(1), pages 217-237, February.
    2. Balkin, Sandy, 2001. "On Forecasting Exchange Rates Using Neural Networks: P.H. Franses and P.V. Homelen, 1998, Applied Financial Economics, 8, 589-596," International Journal of Forecasting, Elsevier, vol. 17(1), pages 139-140.
    3. Barrow, Devon & Kourentzes, Nikolaos, 2018. "The impact of special days in call arrivals forecasting: A neural network approach to modelling special days," European Journal of Operational Research, Elsevier, vol. 264(3), pages 967-977.
    4. Daniel Buncic, 2012. "Understanding forecast failure of ESTAR models of real exchange rates," Empirical Economics, Springer, vol. 43(1), pages 399-426, August.
    5. Apostolos Ampountolas & Titus Nyarko Nde & Paresh Date & Corina Constantinescu, 2021. "A Machine Learning Approach for Micro-Credit Scoring," Risks, MDPI, vol. 9(3), pages 1-20, March.
    6. Ebrahimpour, Reza & Nikoo, Hossein & Masoudnia, Saeed & Yousefi, Mohammad Reza & Ghaemi, Mohammad Sajjad, 2011. "Mixture of MLP-experts for trend forecasting of time series: A case study of the Tehran stock exchange," International Journal of Forecasting, Elsevier, vol. 27(3), pages 804-816, July.
    7. Hewamalage, Hansika & Bergmeir, Christoph & Bandara, Kasun, 2021. "Recurrent Neural Networks for Time Series Forecasting: Current status and future directions," International Journal of Forecasting, Elsevier, vol. 37(1), pages 388-427.
    8. Leung, Philip C.M. & Lee, Eric W.M., 2013. "Estimation of electrical power consumption in subway station design by intelligent approach," Applied Energy, Elsevier, vol. 101(C), pages 634-643.
    9. Donya Rahmani & Saeed Heravi & Hossein Hassani & Mansi Ghodsi, 2016. "Forecasting time series with structural breaks with Singular Spectrum Analysis, using a general form of recurrent formula," Papers 1605.02188, arXiv.org.
    10. Wei Sun & Yujun He & Hong Chang, 2015. "Forecasting Fossil Fuel Energy Consumption for Power Generation Using QHSA-Based LSSVM Model," Energies, MDPI, vol. 8(2), pages 1-21, January.
    11. Saman, Corina, 2011. "Scenarios of the Romanian GDP Evolution With Neural Models," Journal for Economic Forecasting, Institute for Economic Forecasting, vol. 0(4), pages 129-140, December.
    12. Ghiassi, M. & Saidane, H. & Zimbra, D.K., 2005. "A dynamic artificial neural network model for forecasting time series events," International Journal of Forecasting, Elsevier, vol. 21(2), pages 341-362.
    13. Barrow, Devon K., 2016. "Forecasting intraday call arrivals using the seasonal moving average method," Journal of Business Research, Elsevier, vol. 69(12), pages 6088-6096.
    14. Jani, D.B. & Mishra, Manish & Sahoo, P.K., 2017. "Application of artificial neural network for predicting performance of solid desiccant cooling systems – A review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 80(C), pages 352-366.
    15. Oscar Claveria & Salvador Torra, 2013. "“Forecasting Business surveys indicators: neural networks vs. time series models”," AQR Working Papers 201312, University of Barcelona, Regional Quantitative Analysis Group, revised Nov 2013.
    16. Bento, P.M.R. & Pombo, J.A.N. & Calado, M.R.A. & Mariano, S.J.P.S., 2018. "A bat optimized neural network and wavelet transform approach for short-term price forecasting," Applied Energy, Elsevier, vol. 210(C), pages 88-97.
    17. Nataša Glišović & Miloš Milenković & Nebojša Bojović & Libor Švadlenka & Zoran Avramović, 2016. "A hybrid model for forecasting the volume of passenger flows on Serbian railways," Operational Research, Springer, vol. 16(2), pages 271-285, July.
    18. Timothy Praditia & Thilo Walser & Sergey Oladyshkin & Wolfgang Nowak, 2020. "Improving Thermochemical Energy Storage Dynamics Forecast with Physics-Inspired Neural Network Architecture," Energies, MDPI, vol. 13(15), pages 1-26, July.
    19. Ata, Rasit, 2015. "Artificial neural networks applications in wind energy systems: a review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 49(C), pages 534-562.
    20. Christian Fieberg & Daniel Metko & Thorsten Poddig & Thomas Loy, 2023. "Machine learning techniques for cross-sectional equity returns’ prediction," OR Spectrum: Quantitative Approaches in Management, Springer;Gesellschaft für Operations Research e.V., vol. 45(1), pages 289-323, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:infosf:v:15:y:2013:i:3:d:10.1007_s10796-012-9404-7. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.