題名: An Efficient Distributed Hierarchical-Clustering Algorithm for Large Scale Data
作者: Tang, Cheng-Hsien Jr
Huang, An-Ching Jr
Tsai, Meng-Feng Jr
Wang, Wei-Jen Jr
關鍵字: Hierarchical Clustering
Parallel Computing
期刊名/會議名稱: 2010 ICS會議
摘要: The data-classification process can possibly involve a huge amount of data in today’s cloud computing environment. It could take a long time for processing, and could consume many resources for computation and storage. This study focuses on the problem of using the traditional hierarchical agglomerative clustering algorithm on a distributed environment since hierarchical agglomerative clustering has high applicability and efficiency. A parallel hierarchical agglomerative clustering algorithm is proposed in this study. The proposed algorithm divides the whole computation into several small tasks, distribute the tasks to message-passing processes, and merge the results to form a hierarchical cluster. A threshold is used to reduce the storage requirement during the computation. To evaluate the performance and limitation of our algorithm, this study has conducted several experiments using real astronomical data, the main asteroid belt catalog. The experimental results confirm that the proposed parallel algorithm is efficient.
日期: 2011-02-18T03:25:48Z
分類:2010年 ICS 國際計算機會議(如需查看全文,請連結至IEEE Xplore網站)

文件中的檔案:
沒有與此文件相關的檔案。


在 DSpace 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。