Clustering in the Cloud: Clustring Algorithms to Hadoop Map/Reduce Framework
Abstract
Cloud computing has gained an increasing popularity over the years for its great potentials. It is a logical and forward-thinking solution for addressing key business demands. Cloud computing truly represents what enterprise IT always needs: a way to increase capacity or add capabilities on the fly without investing in new infrastructure, training new personnel, or licensing new software. Cloud computing encompasses any subscription-based or pay-per-use service that, in real time over the Internet, extends IT's existing capabilities. This study investigates how clustering algorithms in data mining can benefit from running in the "Cloud".