Conference Proceedings

International Conference on Advances in Information Processing and Communication Technology - IPCT 2014

Outlier Document Filtering Applied to the Extractive Summarization

Author(s) : COSKUN SONMEZ   , METIN TURAN   

Abstract

Summarization requires selection of the more informative sentences within a set of documents. Generally, process assumes the document set includes related topics to a subject. However, some of the documents may be outlier and the effect of an outlier document might affect the success of extractive summary. Research is focused on filtering documents at the extraction stage these are outlier. Extraction finds the outlier documents far distance from representative document set word vector (DSWV). DUC 2006 data set is used for tests. System summaries are compared with the systems generated by DUC participants. Results points out that filtering outlier documents overwhelm all the systems fairly.

Conference Title : International Conference on Advances in Information Processing and Communication Technology - IPCT 2014
Conference Date(s) : 07- 08 June,2014
Place : Hotel Novotel Roma La Rustica, Rome, Italy
No fo Author(s) : 2
DOI : 10.15224/978-1-63248-021-7-67
Page(s) : 1 - 4
Electronic ISBN : 978-1-63248-021-7
Views : 818   |   Download(s) : 235