Parallelizing data mining algorithms in GT4
Hai Ly-Hoang
Faculty of Information Technology, Ho Chi Minh City University of Technology
Hieu Duong Ngoc
Faculty of Information Technology, Ho Chi Minh City University of Technology Tran Khanh Dang
Faculty of Information Technology, Ho Chi Minh City University of Technology Duc-Cuong Nguyen
School of Computer Science and Engineering, National University of HCMC Van Hoai Tran
Faculty of Information Technology, Ho Chi Minh City University of Technology Nam Thoai
Faculty of Information Technology, Ho Chi Minh City University of Technology
Abstract
Grid is emerging as a suitable environment to solve data and computing intensive problems. Typically, data mining algorithms process large datasets and need strong computing resources. This is not usually provided by a single personal computer or even massive CPU systems due to its high cost. In this paper we introduce a solution to parallel data mining algorithms in grid environment. Experimental results with real datasets prove the efficiency of our proposed solution.
|