IDEAS home Printed from https://ideas.repec.org/a/igg/jswis0/v12y2016i3p134-163.html
   My bibliography  Save this article

SPORTAL: Profiling the Content of Public SPARQL Endpoints

Author

Listed:
  • Ali Hasnain

    (INSIGHT Centre for Data Analytics, National University of Ireland, Galway, Ireland)

  • Qaiser Mehmood

    (INSIGHT Centre for Data Analytics, National University of Ireland, Galway, Ireland)

  • Syeda Sana e Zainab

    (INSIGHT Centre for Data Analytics, National University of Ireland, Galway, Ireland)

  • Aidan Hogan

    (Center for Semantic Web Research, Department of Computer Science, University of Chile, Santiago, Chile)

Abstract

Access to hundreds of knowledge bases has been made available on the Web through public SPARQL endpoints. Unfortunately, few endpoints publish descriptions of their content (e.g., using VoID). It is thus unclear how agents can learn about the content of a given SPARQL endpoint or, relatedly, find SPARQL endpoints with content relevant to their needs. In this paper, the authors investigate the feasibility of a system that gathers information about public SPARQL endpoints by querying them directly about their own content. With the advent of SPARQL 1.1 and features such as aggregates, it is now possible to specify queries whose results would form a detailed profile of the content of the endpoint, comparable with a large subset of VoID. In theory it would thus be feasible to build a rich centralised catalogue describing the content indexed by individual endpoints by issuing them SPARQL (1.1) queries; this catalogue could then be searched and queried by agents looking for endpoints with content they are interested in. In practice, however, the coverage of the catalogue is bounded by the limitations of public endpoints themselves: some may not support SPARQL 1.1, some may return partial responses, some may throw exceptions for expensive aggregate queries, etc. The authors' goal in this paper is thus twofold: (i) using VoID as a bar, to empirically investigate the extent to which public endpoints can describe their own content, and (ii) to build and analyse the capabilities of a best-effort online catalogue of current endpoints based on the (partial) results collected.

Suggested Citation

  • Ali Hasnain & Qaiser Mehmood & Syeda Sana e Zainab & Aidan Hogan, 2016. "SPORTAL: Profiling the Content of Public SPARQL Endpoints," International Journal on Semantic Web and Information Systems (IJSWIS), IGI Global, vol. 12(3), pages 134-163, July.
  • Handle: RePEc:igg:jswis0:v:12:y:2016:i:3:p:134-163
    as

    Download full text from publisher

    File URL: http://services.igi-global.com/resolvedoi/resolve.aspx?doi=10.4018/IJSWIS.2016070105
    Download Restriction: no
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:igg:jswis0:v:12:y:2016:i:3:p:134-163. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Journal Editor (email available below). General contact details of provider: https://www.igi-global.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.