Conference Proceedings

Third International Conference on Advances in Computing, Electronics and Communication - ACEC 2015

A Semantic approach for Text Clustering using WordNet based on Multi-Objective Genetic Algorithms

Author(s) : HAN HEE HAHM, JONG JOO LEE, JUNG SONG LEE, SOON CHEOL PARK

Abstract

In this paper, we propose a method of Multi- Objective Genetic Algorithms (MOGAs), NSGA-II and SPEA2, for document clustering with semantic similarity measures based on WordNet. The MOGAs showed a high performance compared to other clustering algorithms. The main problem in the application of MOGAs for document clustering in the Vector Space Model (VSM) is that it ignores relationships between important terms or words. The hierarchical structure of WordNet as thesaurus-based ontology is an effective technique, which is used in semantic similarity measure. We tested these algorithms on Reuter-21578 collection data sets and compared them with Genetic Algorithms (GA) in conjunction with the semantic similarity measures based on WordNet. Also, we used F-measure to evaluate the performance of these clustering algorithms. The experimental results show that the performance of MOGAs based on WordNet is superior to those of the other clustering algorithms in the same similarity environments.

Conference Title : Third International Conference on Advances in Computing, Electronics and Communication - ACEC 2015
Conference Date(s) : 10-11 October, 2015
Place : Hotel Novotel Zurich City-West, Zurich, Switzerland
No fo Author(s) : 4
DOI : 10.15224/978-1-63248-064-4-27
Page(s) : 133 - 137
Electronic ISBN : 978-1-63248-064-4
Views : 480   |   Download(s) : 127