Chapter

A Hybrid Sentence Ordering Strategy in Multi-document Summarization

10/2006; DOI:10.1007/11912873_35 pp.339-349

ABSTRACT In extractive summarization, a proper arrangement of extracted sentences must be found if we want to generate a logical, coherent
and readable summary. This issue is special in multi-document summarization. In this paper, several existing methods each
of which generate a reference relation are combined through linear combination of the resulting relations. We use 4 types
of relationships between sentences (chronological relation, positional relation, topical relation and dependent relation)
to build a graph model where the vertices are sentences and edges are weighed relationships of the 4 types. And then apply
a variation of page rank to get the ordering of sentences for multi-document summaries. We tested our hybrid model with two
automatic methods: distance to manual ordering and ROUGE score. Evaluation results show a significant improvement of the ordering
over strategies losing some relations. The results also indicate that this hybrid model is robust for articles with different
genre which were used on DUC2004 and DUC2005.

0 0
 · 
0 Bookmarks
 · 
23 Views

Keywords

4 types
 
articles
 
automatic methods
 
chronological relation
 
dependent relation
 
Evaluation results
 
extractive summarization
 
graph model
 
hybrid model
 
logical
 
multi-document summaries
 
multi-document summarization
 
page rank
 
positional relation
 
proper arrangement
 
reference relation
 
relationships
 
resulting relations
 
ROUGE score
 
vertices
 

Yanxiang He