Parallel K-Means Algorithm based on Hadoop-MapReduce for Mining - Lays Helena Lopes Veloso,Luciano José Senger
-30% ar kodu BOOKS
Piegāde 12-18 darba dienu laikā
30 dienu atgriešanas politika
This work aimed to investigate the use of a parallel K-Means clustering algorithm, based on the MapReduce programming model, to improve the response time of data mining. The algorithm's performance was evaluated in terms of SpeedUp and ScaleUp. To this end, experiments were performed on a Hadoop cluster consisting of six computers with standard hardware. The clustered data are measurements from flow towers ... Pilns apraksts
Jums varētu patikt arī
Aprašymas
This work aimed to investigate the use of a parallel K-Means clustering algorithm, based on the MapReduce programming model, to improve the response time of data mining. The algorithm's performance was evaluated in terms of SpeedUp and ScaleUp. To this end, experiments were performed on a Hadoop cluster consisting of six computers with standard hardware. The clustered data are measurements from flow towers in agricultural regions and belong to Ameriflux. The experiments were performed using 3, 4, and 6 machines, respectively. The results showed that with the increase in the number of machines, there was a gain in performance, with the best time obtained using six machines, reaching a SpeedUp of 3.25. It was found that the application scales well with the equivalent increase in data size and number of machines in the cluster, achieving similar performance in the tests.
Vairāk informācijas
| Autors | Lays Helena Lopes Veloso, Luciano José Senger |
|---|---|
| Izdevējs | Our Knowledge Publishing |
| Izlaides gads | 2025 |
| Vāka tips | Mīkstais vāks |
| EAN | 9786209114083 |