Computational Technology Resources - CCP

Keywords: data grid, data management, scheduling, OptorSim, access cost.

Summary

The static replication strategy replicates a file whenever there is a miss without considering file popularity at the source site. As a result of replication of even non-popular files there is an increase in the replica management cost and the overall job execution cost. The existing strategies for dynamic replication, replicates a file when a file is popular at the source site. Request for access to non-popular files is done through remote access causing increased file access latency and increased average job execution time. Relocating such files to the point of consumption would result in decreased file access latency and hence decreased overall job execution time. Consideration of the replication approach alone does not optimize the scenario. Hence this necessitates a mechanism that incorporates relocation along with replication dynamically. The proposed dynamic hybrid data-management (DHD) mechanism is implemented based on the access behaviour of files to decide on creating a replica or relocating. The mechanism is implemented in the replica optimizer component of OptorSim. The simulation was carried for different sets of jobs for different job schedulers to evaluate the implication of pure replication, pure relocation and our PDIR strategy on the overall disk space utilization and overall job execution time. Our scheme DHD has been evaluated with different schedulers such as the random scheduler, the queue length scheduler, the access cost scheduler and the queue access cost scheduler against mechanisms such as complete relocation, complete replication using the optimizers LRU, LFU and the economic model Zipf distribution. Average job execution times and the percentage of storage element filled graphs were drawn for the mechanisms considered. Also with the number of jobs being increased the variation of average job execution time and percentage of storage element is shown. It is concluded that the enhanced dynamic replication strategy results in a performance-efficient system in terms of overall job execution time and overall percentage of the storage element filled.

purchase the full-text of this paper (price £20)

go to the previous paper
go to the next paper
return to the table of contents
return to the book description
purchase this book (price £40 +P&P)

	Computational & Technology Resources an online resource for computational, engineering & technology publications
	not logged in - login
Front Page Browse CCP CSETS CTR IJRT Other Authors Search Purchase Guide FAQ Contact us	Civil-Comp Proceedings ISSN 1759-3433 CCP: 101 PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, GRID AND CLOUD COMPUTING FOR ENGINEERING Edited by: B.H.V. Topping and P. Iványi Paper 35 A Dynamic Hybrid Data-Management Mechanism for Data Grids K.K. Pattanaik and U.B. Reddy Atal Bihari Vajpayee-Indian Institute of Information Technology and Management, Gwalior, Madhya Pradesh, India doi:10.4203/ccp.101.35 purchase the full-text of this paper Full Bibliographic Reference for this paper K.K. Pattanaik, U.B. Reddy, "A Dynamic Hybrid Data-Management Mechanism for Data Grids", in B.H.V. Topping, P. Iványi, (Editors), "Proceedings of the Third International Conference on Parallel, Distributed, Grid and Cloud Computing for Engineering", Civil-Comp Press, Stirlingshire, UK, Paper 35, 2013. doi:10.4203/ccp.101.35 Keywords: data grid, data management, scheduling, OptorSim, access cost. Summary The static replication strategy replicates a file whenever there is a miss without considering file popularity at the source site. As a result of replication of even non-popular files there is an increase in the replica management cost and the overall job execution cost. The existing strategies for dynamic replication, replicates a file when a file is popular at the source site. Request for access to non-popular files is done through remote access causing increased file access latency and increased average job execution time. Relocating such files to the point of consumption would result in decreased file access latency and hence decreased overall job execution time. Consideration of the replication approach alone does not optimize the scenario. Hence this necessitates a mechanism that incorporates relocation along with replication dynamically. The proposed dynamic hybrid data-management (DHD) mechanism is implemented based on the access behaviour of files to decide on creating a replica or relocating. The mechanism is implemented in the replica optimizer component of OptorSim. The simulation was carried for different sets of jobs for different job schedulers to evaluate the implication of pure replication, pure relocation and our PDIR strategy on the overall disk space utilization and overall job execution time. Our scheme DHD has been evaluated with different schedulers such as the random scheduler, the queue length scheduler, the access cost scheduler and the queue access cost scheduler against mechanisms such as complete relocation, complete replication using the optimizers LRU, LFU and the economic model Zipf distribution. Average job execution times and the percentage of storage element filled graphs were drawn for the mechanisms considered. Also with the number of jobs being increased the variation of average job execution time and percentage of storage element is shown. It is concluded that the enhanced dynamic replication strategy results in a performance-efficient system in terms of overall job execution time and overall percentage of the storage element filled. purchase the full-text of this paper (price £20) go to the previous paper go to the next paper return to the table of contents return to the book description purchase this book (price £40 +P&P)
Back to top	©Civil-Comp Limited 2023 - terms & conditions