[an error occurred while processing the directive]
[an error occurred while processing the directive]
[an error occurred while processing the directive]
Home > 教育と研究 > 研究科案内 > 専攻と講義科目 > 数理情報学
教育と研究

数理情報学 movie
専攻の目的Department Objective教員と研究室Faculty and Labs講義
学位論文(修士)学位論文(博士)入試案内Admissionsお知らせ
Technical Reports計数工学科数理情報工学コース

>> 学位論文(修士)へ戻る

Parallelizing Spectral K-Means Algorithm with Map-Reduce Framework (Map-ReduceによるSpectral K-Meansアルゴリズムの並列化に関する研究)

胡 ヱ康

(指導教員:武市正人 教授)
資料PDF(wkhu.pdf
研究概要

scalability on multi-nodes
Map-reduce is the hottest parallel programming framework which enables automatic parallelization and distribution of large scale computations on distributed systems composed with a number of general purpose machines via network. Meanwhile, implementing spectral clustering based algorithms with map-reduce framework was usually not applicable because the memory usage of establishing dense similarity matrix of large datasets is too heavy for general purpose machines. In this thesis, we proposed a spectral K-means algorithm using approximated sparse similarity matrix to make it suitable for general purpose machines and implemented it with Hadoop's map-reduce framework. By experiments on TDT-2 and RCV1 corpus, we obtained both good clustering quality and scalability with this implementation.
修論の感想

During the two years in the master program, I tried implementing various algorithms on different parallel programming enviroments and finally decided to do the above work as my thesis. Writing the thesis is an interesting but challenging work, and thanks to my supervisor and all our lab members for their instructions and advices.
page top


[an error occurred while processing the directive]