IDEAS home Printed from https://ideas.repec.org/a/bla/jinfst/v72y2021i8p1011-1027.html
   My bibliography  Save this article

Full coverage of a reader's interests in context‐based information filtering

Author

Listed:
  • Alexandra Dumitrescu
  • Simone Santini

Abstract

We present a collection of algorithms to filter a stream of documents in such a way that the filtered documents will cover as well as possible the interest of a person, keeping in mind that, at any given time, the offered documents should not only be relevant, but should also be diversified, in the sense of covering all the interests of the person. We use a modification of the WEBSOM algorithm to create a user model based on a self‐organizing network trained using a collection of documents representative of the person's interests. We introduce the concepts of freshness and coverage. A document is fresh if it belongs to a semantic area of interest to a person for which no documents were seen in the recent past; a group of documents has coverage to the extent to which it is a good representation of all the interests of a person. Our tests show that these algorithms can effectively increase the coverage of the documents that are shown to the user without overly affecting precision.

Suggested Citation

  • Alexandra Dumitrescu & Simone Santini, 2021. "Full coverage of a reader's interests in context‐based information filtering," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 72(8), pages 1011-1027, August.
  • Handle: RePEc:bla:jinfst:v:72:y:2021:i:8:p:1011-1027
    DOI: 10.1002/asi.24470
    as

    Download full text from publisher

    File URL: https://doi.org/10.1002/asi.24470
    Download Restriction: no

    File URL: https://libkey.io/10.1002/asi.24470?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Stefano Mizzaro, 1997. "Relevance: The whole history," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 48(9), pages 810-832, September.
    2. Chong Ju Choi & Carla C. J. M. Millar & Caroline Y. L. Wong, 2005. "Knowledge and the State," Palgrave Macmillan Books, in: Knowledge Entanglements, chapter 0, pages 19-38, Palgrave Macmillan.
    3. Yunjie (Calvin) Xu & Zhiwei Chen, 2006. "Relevance judgment: What do information users consider beyond topicality?," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 57(7), pages 961-973, May.
    4. Yunjie Xu & Hainan Yin, 2008. "Novelty and topicality in interactive information retrieval," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 59(2), pages 201-215, January.
    5. Scott Deerwester & Susan T. Dumais & George W. Furnas & Thomas K. Landauer & Richard Harshman, 1990. "Indexing by latent semantic analysis," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 41(6), pages 391-407, September.
    6. S. E. Robertson & K. Sparck Jones, 1976. "Relevance weighting of search terms," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 27(3), pages 129-146, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jingfei Li & Peng Zhang & Dawei Song & Yue Wu, 2017. "Understanding an enriched multidimensional user relevance model by analyzing query logs," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 68(12), pages 2743-2754, December.
    2. Kagie, M. & van der Loos, M.J.H.M. & van Wezel, M.C., 2008. "Including Item Characteristics in the Probabilistic Latent Semantic Analysis Model for Collaborative Filtering," ERIM Report Series Research in Management ERS-2008-053-MKT, Erasmus Research Institute of Management (ERIM), ERIM is the joint research institute of the Rotterdam School of Management, Erasmus University and the Erasmus School of Economics (ESE) at Erasmus University Rotterdam.
    3. Gyeong Kim, Min & Chang Lee, Kun, 2025. "Proposing the “Digital Agenticity Theory” to analyze user engagement in conversational AI chatbot," Journal of Business Research, Elsevier, vol. 189(C).
    4. Meen Chul Kim & Chaomei Chen, 2015. "A scientometric review of emerging trends and new developments in recommendation systems," Scientometrics, Springer;Akadémiai Kiadó, vol. 104(1), pages 239-263, July.
    5. Lawrence Bunnell & Kweku-Muata Osei-Bryson & Victoria Y. Yoon, 0. "RecSys Issues Ontology: A Knowledge Classification of Issues for Recommender Systems Researchers," Information Systems Frontiers, Springer, vol. 0, pages 1-42.
    6. Ca' Zorzi, Michele & Manu, Ana-Simona & Lopardo, Gianluigi, 2025. "Verba volant, transcripta manent: what corporate earnings calls reveal about the AI stock rally," Working Paper Series 3093, European Central Bank.
    7. Curci, Ylenia & Mongeau Ospina, Christian A., 2016. "Investigating biofuels through network analysis," Energy Policy, Elsevier, vol. 97(C), pages 60-72.
    8. Joanna Sokolowska & Patrycja Sleboda, 2015. "The Inverse Relation Between Risks and Benefits: The Role of Affect and Expertise," Risk Analysis, John Wiley & Sons, vol. 35(7), pages 1252-1267, July.
    9. Donald R. Haurin & Stuart S. Rosenthal, 2009. "Language, Agglomeration and Hispanic Homeownership," Real Estate Economics, American Real Estate and Urban Economics Association, vol. 37(2), pages 155-183, June.
    10. Chao Wei & Senlin Luo & Xincheng Ma & Hao Ren & Ji Zhang & Limin Pan, 2016. "Locally Embedding Autoencoders: A Semi-Supervised Manifold Learning Approach of Document Representation," PLOS ONE, Public Library of Science, vol. 11(1), pages 1-20, January.
    11. Jong Won Min, 2019. "The Influence of Stigma and Views on Mental Health Treatment Effectiveness on Service Use by Age and Ethnicity: Evidence From the CDC BRFSS 2007, 2009, and 2012," SAGE Open, , vol. 9(3), pages 21582440198, September.
    12. Pietro Fera & Nicola Moscariello & Gianmarco Salzillo & Emilio Farina, 2025. "Towards the Regulation of Non‐Financial Reporting: The Impact on Environmental Disclosure Within the Oil and Gas Sector," Corporate Social Responsibility and Environmental Management, John Wiley & Sons, vol. 32(3), pages 4053-4067, May.
    13. Alwang, Jeffrey & Larochelle, Catherine & Barrera, Victor, 2017. "Farm Decision Making and Gender: Results from a Randomized Experiment in Ecuador," World Development, Elsevier, vol. 92(C), pages 117-129.
    14. Yanina Welp & Ferran Urgell & Eduard Aibar, 2007. "From Bureaucratic Administration to Network Administration? An Empirical Study on E-Government Focus on Catalonia," Public Organization Review, Springer, vol. 7(4), pages 299-316, December.
    15. Brent Hammer & Helen Vallianatos & Candace Nykiforuk & Laura Nieuwendyk, 2015. "Perceptions of healthy eating in four Alberta communities: a photovoice project," Agriculture and Human Values, Springer;The Agriculture, Food, & Human Values Society (AFHVS), vol. 32(4), pages 649-662, December.
    16. Maksym Polyakov & Morteza Chalak & Md. Sayed Iftekhar & Ram Pandit & Sorada Tapsuwan & Fan Zhang & Chunbo Ma, 2018. "Authorship, Collaboration, Topics, and Research Gaps in Environmental and Resource Economics 1991–2015," Environmental & Resource Economics, Springer;European Association of Environmental and Resource Economists, vol. 71(1), pages 217-239, September.
    17. Ding, Ying, 2011. "Community detection: Topological vs. topical," Journal of Informetrics, Elsevier, vol. 5(4), pages 498-514.
    18. Cao, Jingcun & Li, Xiaolin & Zhang, Lingling, 2025. "Is relevancy everything? A deep-learning approach to understand the effect of image-text congruence," LSE Research Online Documents on Economics 128215, London School of Economics and Political Science, LSE Library.
    19. Juan Shi & Kin Keung Lai & Ping Hu & Gang Chen, 2018. "Factors dominating individual information disseminating behavior on social networking sites," Information Technology and Management, Springer, vol. 19(2), pages 121-139, June.
    20. Parag, Yael & Darby, Sarah, 2009. "Consumer-supplier-government triangular relations: Rethinking the UK policy path for carbon emissions reduction from the UK residential sector," Energy Policy, Elsevier, vol. 37(10), pages 3984-3992, October.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jinfst:v:72:y:2021:i:8:p:1011-1027. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.asis.org .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.