Unsupervised Discovery of Relations and Discriminative Extraction Patterns

Alan Akbik, Larysa Visengeriyeva, Priska Herger, Holmer Hemsen and Alexander Löser:

Unsupervised Discovery of Relations and Discriminative Extraction Patterns

In: ACL Anthology – A Digital Archive of Research Papers in Computational Linguistics. Proceedings of COLING 2012, December 2012, Mumbai, India, pp.17-32, https://aclweb.org/anthology/C/C12/

Abstract:

Unsupervised Relation Extraction (URE) is the task of extracting relations of a priori unknown semantic types using clustering methods on a vector space model of entity pairs and patterns. In this paper, we show that an informed feature generation technique based on dependency trees significantly improves clustering quality, as measured by the F-score, and therefore the ability of the URE method to discover relations in text. Furthermore, we extend URE to produce a set of weighted patterns for each identified relation that can be used by an information extraction system to find further instances of this relation. Each pattern is assigned to one or multiple relations with different confidence strengths, indicating how reliably a pattern evokes a relation, using the theory of Discriminative Category Matching. We evaluate our findings in two tasks against strong baselines and show significant improvements both in relation discovery and information extraction.

Download: Link

 

Leave a Reply