Home

Awesome

Open Information Extraction (OIE) Resources

A curated list of Open Information Extraction (OIE) resources: research papers, code, data, applications, etc. The list is not limited to Open Information Extraction systems exclusively. It also includes work highly related to OIE, such as taxonomizing open relations and using OIE in downstream applications.

Table of contents

Introduction to OIE

Open Information Extraction (OIE) systems aim to extract unseen relations and their arguments from unstructured text in unsupervised manner. In its simplest form, given a natural language sentence, they extract information in the form of a triple, consisted of subject (S), relation (R) and object (O).

Suppose we have the following input sentence:

AMD, which is based in U.S., is a technology company.

An OIE system aims to make the following extractions:

("AMD"; "is based in"; "U.S.")
("AMD"; "is"; "technology company")

Papers sorted in chronological order

2006

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

Papers grouped by category

Surveys

Evaluation

OIE for downstream applications

OIE's output has been shown to be a useful input for many downstream tasks. In this section, several downstream tasks that benefited from OIE output are listed.

Question Answering

Slot Filling

Event Extraction

Text Summarization

Knowledge Base Population

Knowledge Base Construction

Entity Linking

Relation Linking

Open Link Prediction

Relation Extraction

Relating Entities

Story Comprehension

Text Generation

Video Grounding

OIE in Different Languages

Most of the OIE systems are focused on extractions made from text written on English. However, some OIE systems either are focused on a language other than English, or are multilingual. In this section, OIE systems on languages other than English or multilingual OIE systems are listed.

Multilingual OIE Systems

OIE Systems for German Language

OIE Systems for Portugese Language

OIE Systems for Spanish Language

OIE Systems for Chinese Language

OIE Systems for Persian Language

OIE Systems for Italian Language

OIE Systems for Indonesian Language

OIE Systems for Greek Language

Supervised OIE

Canonicalization of OIE

Slides

Talks

Code

Data

OIE output is used as a useful input in many other downstream tasks, such as question answering, event schema induction or generating inference rules. Moreover, OIE output can be used as a "fuel" to derive further resources. Here, the data is organized into two major categories: 1) OIE corpora; 2) Resources derived from OIE output.

OIE corpora

Resources derived from OIE output

PhD theses

Demos