Conference Proceedings

Second International Conference on Advances in Information Processing and Communication Technology - IPCT 2015

Document Classification Using Distributed Machine Learning

Author(s) : GALIP AYDIN, IBRAHIM RIZA HALLAC

Abstract

In this paper, we investigate several machine learning algorithms for automatic classification of Turkish news into predetermined categories like economy, life, health etc. We use Apache Big Data technologies such as Hadoop, HDFS, Spark and Mahout, and distributed machine learning frameworks.

Conference Title : Second International Conference on Advances in Information Processing and Communication Technology - IPCT 2015
Conference Date(s) : 18 - 19 April, 2015
Place : Hotel Novotel Roma La Rustica, Rome, Italy
No fo Author(s) : 2
DOI : 10.15224/978-1-63248-044-6-129
Page(s) : 166 - 169
Electronic ISBN : 978-1-63248-044-6
Views : 646   |   Download(s) : 175