A Light-weight Data Replication for Cloud Data Centers Environment
Unlike traditional high performance computing environment, such as cluster and supercomputers, the cloud computing is a collection of interconnected and virtualized computing resources that are managed to be one unified highperformance computing power. However, the Cloud environment constitutes a heterogeneous and a highly dynamic environment. Failures on the data centers nodes are normal rather because of the large scale of physical resources and data. As a result, the cloud environment requires an efficient adaptive data replication management in order to cope with the inherent characteristic of the Cloud environment. In this paper, we propose a data replication strategy which adaptively selects the data files for replication in order to improve the overall reliability of the system and to meet the required quality of services. Further, the proposed strategy decides dynamically the number of replicas as well as the effective data nodes for replication. The popular data files are selected for replication based on employing a lightweight time-series technique, which analyzes the recent pattern of data files requests, and provides predictions for the future data requests.Experimental results show that the proposed strategy behaves effectively to improve the reliability of the Cloud system under study.