Building a Large Knowledge Base from a Structured Source: The CIA World Fact Book

From Tetherless World Wiki

Jump to: navigation, search

Citation: Gleb Frank and Adam Farquhar and Richard Fikes. (1998) Building a Large Knowledge Base from a Structured Source: The CIA World Fact Book. In KSL-98-16, April,1998.

Publication techreport ( Edit )
type Technical Report
bibtype techreport
Bibtex basics
author Gleb Frank and Adam Farquhar and Richard Fikes
title Building a Large Knowledge Base from a Structured Source: The CIA World Fact Book
number KSL-98-16
institution Knowledge Systems, AI Laboratory
address Stanford, CA, USA
year 1998
month April
Bibtex more
Access Paper
abstract The on-line world is populated by an increasing number of knowledge-rich resources. Furthermore, there is a growing trend among authors to provide semantic markup of these resources. This presents a tantalizing prospect. Perhaps we can leverage the person-years of effort invested in building these knowledge-rich resources to create large-scale knowledge bases. The World Fact Book knowledge base has been an experiment in the construction of a large-scale knowledge base from a source authored using semantic markup. The content of the knowledge base is, in large part, derived from the CIA World Fact Book, and covers a broad range of information about the world's nations. The World Fact Book is a highly structured document with a complex underlying ontology. The structure makes it possible to parse the document in order to carry out the knowledge extraction. However, irregularities of the text written by humans and the complexity of the domain make the knowledge extraction process non-trivial. We describe the process we used to construct the World Fact Book knowledge base, including parsing the source, refining the implicit knowledge, constructing a substantial supporting ontology, and reusing existing ontologies. We also discuss some of the key representational issues addressed and show how the resulting axioms can be used to answer a variety of queries. We hope that the broad accessibility of the resulting knowledge base and its neutral representational format will enable others to work with and extend the content, as well as explore issues of structuring and inferencing in large-scale knowledge bases.

KSL Technical Report ID: KSL-98-16
Facts about Building a Large Knowledge Base from a Structured Source: The CIA World Fact BookRDF feed
Abstract The on-line world is populated by an incre The on-line world is populated by an increasing number of knowledge-rich resources. Furthermore, there is a growing trend among authors to provide semantic markup of these resources. This presents a tantalizing prospect. Perhaps we can leverage the person-years of effort invested in building these knowledge-rich resources to create large-scale knowledge bases. The World Fact Book knowledge base has been an experiment in the construction of a large-scale knowledge base from a source authored using semantic markup. The content of the knowledge base is, in large part, derived from the CIA World Fact Book, and covers a broad range of information about the world's nations. The World Fact Book is a highly structured document with a complex underlying ontology. The structure makes it possible to parse the document in order to carry out the knowledge extraction. However, irregularities of the text written by humans and the complexity of the domain make the knowledge extraction process non-trivial. We describe the process we used to construct the World Fact Book knowledge base, including parsing the source, refining the implicit knowledge, constructing a substantial supporting ontology, and reusing existing ontologies. We also discuss some of the key representational issues addressed and show how the resulting axioms can be used to answer a variety of queries. We hope that the broad accessibility of the resulting knowledge base and its neutral representational format will enable others to work with and extend the content, as well as explore issues of structuring and inferencing in large-scale knowledge bases. nferencing in large-scale knowledge bases.
Address Stanford, CA, USA  +
Author Gleb Frank and Adam Farquhar and Richard Fikes  +
Bibtype techreport  +
Has author Gleb Frank and Adam Farquhar and Richard Fikes  +
Has identifier KSL-98-16  +
Has publishing details April,1998  +
Has title Building a Large Knowledge Base from a Structured Source: The CIA World Fact Book  +
Has where published KSL-98-16  +
Has year 1998  +
Institution Knowledge Systems, AI Laboratory  +
Ksl tr id KSL-98-16  +
Month April  +
Number KSL-98-16  +
Process note NO  +
Title Building a Large Knowledge Base from a Structured Source: The CIA World Fact Book  +
Year 1998  +
Personal tools