Knowledge bases (KBs) such as Freebase and Yago are rather incomplete, and the situation is more serious in non-English KBs, such as Chinese KBs. In this paper, we present a
language-independent framework to tackle the slot-filling task by searching the Web with high-precision queries, and deriving lightweight extraction
patterns. The
patterns are based on string matching, and since they make no use of complex NLP resources, which may be unavailable in some
languages, they are very
language-independent.
We use a traditional bootstrapping approach for extraction, but also use a novel approach to suppress the noise associated with distant supervision: in particular, we use a pseudo-testing method to validate the patterns derived from different sentences. Experiments show that our framework achieves very encouraging results.