In this paper, we examine the task of extracting information about terrorism related events hidden in a large document collection.
The task assumes that a terrorism related event can be described by a set of entity and relation instances. To reduce the
amount of time and efforts in extracting these event related instances, one should ideally perform the task on the relevant documents only. We
... [Show full abstract] have therefore proposed some document selection strategies based on information extraction (IE) patterns.
Each strategy attempts to select one document at a time such that the gain of event related instance information is maximized. Our IE-based document selection strategies assume that some IE patterns are given to extract event instances. We conducted some experiments for one terrorism related event. Experiments have shown that our proposed IE based document selection strategies
work well in the extraction task for news collections of various size.