Clustering in the Cloud: Clustring Algorithms to Hadoop Map/Reduce Framework

Date

2010-05-04

Authors

Wang, Xuan

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Cloud computing has gained an increasing popularity over the years for its great potentials. It is a logical and forward-thinking solution for addressing key business demands. Cloud computing truly represents what enterprise IT always needs: a way to increase capacity or add capabilities on the fly without investing in new infrastructure, training new personnel, or licensing new software. Cloud computing encompasses any subscription-based or pay-per-use service that, in real time over the Internet, extends IT's existing capabilities. This study investigates how clustering algorithms in data mining can benefit from running in the "Cloud".

Description

Keywords

hadoop, mapreduce, Amazon EC2, clustering, Kmeans, algorithms, cloud computing, framework, clustering algorithms, data mining, Computer Science

Citation

Wang, X. (2010). Clustering in the cloud (Report No. TXSTATE-CS-TR-2010-24). Texas State University-San Marcos, Department of Computer Science.

Rights

Rights Holder

Rights License

Rights URI