Clustering in the Cloud: Clustring Algorithms to Hadoop Map/Reduce Framework
Date
2010-05-04
Authors
Wang, Xuan
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
Cloud computing has gained an increasing popularity over the years for its great potentials. It is a logical and forward-thinking solution for addressing key business demands. Cloud computing truly represents what enterprise IT always needs: a way to increase capacity or add capabilities on the fly without investing in new infrastructure, training new personnel, or licensing new software. Cloud computing encompasses any subscription-based or pay-per-use service that, in real time over the Internet, extends IT's existing capabilities. This study investigates how clustering algorithms in data mining can benefit from running in the "Cloud".
Description
Keywords
hadoop, mapreduce, Amazon EC2, clustering, Kmeans, algorithms, cloud computing, framework, clustering algorithms, data mining, Computer Science
Citation
Wang, X. (2010). Clustering in the cloud (Report No. TXSTATE-CS-TR-2010-24). Texas State University-San Marcos, Department of Computer Science.