- A preview of this full-text is provided by Springer Nature.
- Learn more
Preview content only
Content available from Knowledge and Information Systems
This content is subject to copyright. Terms and conditions apply.
Knowledge and Information Systems (2023) 65:827–853
https://doi.org/10.1007/s10115-022-01781-7
REGULAR PAPER
A storytree-based model for inter-document causal relation
extraction from news articles
Chong Zhang1·Jiagao Lyu1·Ke Xu1
Received: 3 March 2021 / Revised: 9 October 2022 / Accepted: 16 October 2022 /
Published online: 3 November 2022
© The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2022
Abstract
With more and more news articles appearing on the Internet, discovering causal relations
between news articles is very important for people to understand the development of news.
Extracting the causal relations between news articles is an inter-document relation extraction
task. Existing works on relation extraction cannot solve it well because of the following
two reasons: (1) most relation extraction models are intra-document models, which focus
on relation extraction between entities. However, news articles are many times longer and
more complex than entities, which makes the inter-document relation extraction task harder
than intra-document. (2) Existing inter-document relation extraction models rely on simi-
larity information between news articles, which could limit the performance of extraction
methods. In this paper, we propose an inter-document model based on storytree information
to extract causal relations between news articles. We adopt storytree information to integer
linear programming (ILP) and design the storytree constraints for the ILP objective function.
Experimental results show that all the constraints are effective and the proposed method out-
performs widely used machine learning models and a state-of-the-art deep learning model,
with F1 improved by more than 5% on three different datasets. Further analysis shows that
five constraints in our model improve the results to varying degrees and the effects on the
three datasets are different. The experiment about link features also suggests the positive
influence of link information.
Keywords Relation classification ·News article ·Causal relation ·Constraint
1 Introduction
News keeps people informed about events happening around the world. With the increase in
the amount of information, the amount of news on news websites has exploded.Understanding
the relation between various news articles allows us to better sort out the development of
events and has a deeper understanding of various news. Therefore, it is meaningful and
BKe Xu
kexu@buaa.edu.cn
1State Key Lab of Software Development Environment, School of Computer Science and Engineering,
Beihang University, Beijing, China
123
Content courtesy of Springer Nature, terms of use apply. Rights reserved.