Open Information Extraction
To address these questions, the Open IE project has been developing a Web-scale information extraction system that reads arbitrary text from any domain on the Web, extracts meaningful information, and stores it in a unified knowledge base for efficient querying. In contrast to traditional information extraction, the Open Information Extraction paradigm attempts to overcome the knowledge acquisition bottleneck by extracting a large number of relations at once.
Demo: TextRunner extracted over 500,000,000 assertions from 100 million Web pages.
Software: ReVerb Open Information Extraction Software and additional information.
Data: Horn-clause inference rules learned by the Sherlock system.
Demo: Selectional Preferences from Web Text compute admissible argument values for a relation.
Data: 10,000 Functional Relations learned from Web Text predict the functionality of a phrase.
People
Publications
- Identifying Relations for Open Information Extraction (2011)
- Open Information Extraction: the Second Generation (2011)
- Inference Over the Web (2011)
- Commonsense from the Web: Relation Properties (2010)
- Identifying Functional Relations in Web Text (2010)
- Learning First-Order Horn Clauses from Web Text (2010)
- A Latent Dirichlet Allocation method for Selectional Preferences (2010)
- Extracting Sequences from the Web (2010)
- Open Information Extraction using Wikipedia (2010)
- Unsupervised Ontology Induction from Text (2010)
- Machine Reading at the University of Washington (2010)
- Semantic Role Labeling for Open Information Extraction (2010)
- Analysis of a Probabilistic Model of Redundancy in Unsupervised Information Extraction (2010)
- Identifying Interesting Assertions from the Web (2009)
- Unsupervised Semantic Parsing (2009)
- What Is This, Anyway: Automatic Hypernym Discovery (2009)
- Unsupervised Methods for Determining Object and Relation Synonyms on the Web (2009)
- Open Information Extraction from the Web (2008)
- It's a Contradiction -- No, It's Not: A Case Study using Functional Relations (2008)
- Scaling Textual Inference to the Web (2008)
- Information Extraction from Wikipedia: Moving Down the long Tail (2008)
- The Tradeoffs Between Open and Traditional Relation Extraction (2008)
- Strategies for Lifelong Knowledge Extraction from the Web (2007)
- Sparse Information Extraction: Unsupervised Language Models to the Rescue (2007)
- Unsupervised Resolution of Objects and Relations on the Web (2007)
- Machine Reading (2007)
- Open Information Extraction from the Web (2007)
- Machine Reading (2006)
- A Probabilistic Model of Redundancy in Information Extraction (2005)

cs.