IDEAS home Printed from https://ideas.repec.org/a/igg/jswis0/v8y2012i3p42-73.html
   My bibliography  Save this article

Elementary: Large-Scale Knowledge-Base Construction via Machine Learning and Statistical Inference

Author

Listed:
  • Feng Niu

    (Computer Sciences Department, University of Wisconsin-Madison, USA)

  • Ce Zhang

    (Computer Sciences Department, University of Wisconsin-Madison, USA)

  • Christopher Ré

    (Computer Sciences Department, University of Wisconsin-Madison, USA)

  • Jude Shavlik

    (Computer Sciences Department, University of Wisconsin-Madison, USA)

Abstract

Researchers have approached knowledge-base construction (KBC) with a wide range of data resources and techniques. The authors present Elementary, a prototype KBC system that is able to combine diverse resources and different KBC techniques via machine learning and statistical inference to construct knowledge bases. Using Elementary, they have implemented a solution to the TAC-KBP challenge with quality comparable to the state of the art, as well as an end-to-end online demonstration that automatically and continuously enriches Wikipedia with structured data by reading millions of webpages on a daily basis. The authors describe several challenges and their solutions in designing, implementing, and deploying Elementary. In particular, the authors first describe the conceptual framework and architecture of Elementary to integrate different data resources and KBC techniques in a principled manner. They then discuss how they address scalability challenges to enable Web-scale deployment. The authors empirically show that this decomposition-based inference approach achieves higher performance than prior inference approaches. To validate the effectiveness of Elementary’s approach to KBC, they experimentally show that its ability to incorporate diverse signals has positive impacts on KBC quality.

Suggested Citation

  • Feng Niu & Ce Zhang & Christopher Ré & Jude Shavlik, 2012. "Elementary: Large-Scale Knowledge-Base Construction via Machine Learning and Statistical Inference," International Journal on Semantic Web and Information Systems (IJSWIS), IGI Global, vol. 8(3), pages 42-73, July.
  • Handle: RePEc:igg:jswis0:v:8:y:2012:i:3:p:42-73
    as

    Download full text from publisher

    File URL: http://services.igi-global.com/resolvedoi/resolve.aspx?doi=10.4018/jswis.2012070103
    Download Restriction: no
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Mayank Kejriwal & Pedro Szekely, 2019. "myDIG: Personalized Illicit Domain-Specific Knowledge Discovery with No Programming," Future Internet, MDPI, vol. 11(3), pages 1-23, March.
    2. Yu Zhang & Min Wang & Morteza Saberi & Elizabeth Chang, 2020. "Knowledge fusion through academic articles: a survey of definitions, techniques, applications and challenges," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(3), pages 2637-2666, December.
    3. Yu Zhang & Morteza Saberi & Elizabeth Chang, 2018. "A semantic-based knowledge fusion model for solution-oriented information network development: a case study in intrusion detection field," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(2), pages 857-886, November.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:igg:jswis0:v:8:y:2012:i:3:p:42-73. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Journal Editor (email available below). General contact details of provider: https://www.igi-global.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.