Optimization of Backup Storage by Reducing Fragmentation in Distributed Environment

Abstract

In modern backup systems, Deduplication plays a vital role in the elimination of duplicate data in a storage system which one of the technique to reduce storage costs. Deduplication divides a backup stream into variable sized chunks of data used to map stored chunks to their physical addresses. These chunks of data are physically distributed and create a problem known as fragmentation. Primarily fragmentation categorized into sparse and out-of-order containers. The sparse container adversely affects the performance while restoring the database and garbage collection effectively, while the out-of-order container brings an adverse effect on the performance issue if the restore cache built is small. To minimize the fragmentation problem, we implement the History-Aware Rewriting algorithm (HAR) and Cache-Aware Filter (CAF). HAR will collect the historical information in backup systems to identify and reduce sparse containers, and CAF to restore cache knowledge to find the out-of-order containers that impacts restore performance. We exploit Container-Marker Algorithm (CMA) to gather valid containers instead of valid chunks which help in garbage collection. My results help to prove how HAR, CMA improves the restore performance.

Authors and Affiliations

Sandeep Wagh, Prof. Sagar Bhakre

Keywords

Related Articles

Effect of Poly Vinyl Acetate and Poly Vinyl Alcohol as Cement Admixture on Strength of Concrete

This paper reviews the observations of addition of poly vinyl alcohol and poly vinyl acetate the poly fibers together to cement bond matrix. Polymer fiber serves as superplastitisizer which results in low rate of water...

An Analysis on Cyber Crime, Cyber Threats and Role of Cyber Analyst

Computer crime is a general term that embraces such crimes as phishing, credit card frauds, bank robbery, illegal downloading, industrial espionage, child pornography, kidnapping children via chat rooms, scams, cyber te...

Contact Stress Analysis and Stress Optimization of Spur Gear

The gears for transmission develop stresses at the mating positions over the teeth. A pair of spur gear teeth is generally subjected to two types of cyclic stresses as bending stresses and contact stresses. These stress...

A Least Path Matrix Concept to Detect Isomorphism in Planar Kinematic chain’s

In the early stage of mechanism design, it is helpful to have all possible kinematic chain with required number of links and degree of freedom. The mechanism will lead to systematic development of enumeration, identific...

Optical Character Recognition.

Character Recognition (CR) is the process of recognizing handwritten characters. It is an active area of research and it is used in various applications such as reading license plate numbers, document processing, readin...

Download PDF file
  • EP ID EP24329
  • DOI -
  • Views 239
  • Downloads 8

How To Cite

Sandeep Wagh, Prof. Sagar Bhakre (2017). Optimization of Backup Storage by Reducing Fragmentation in Distributed Environment. International Journal for Research in Applied Science and Engineering Technology (IJRASET), 5(5), -. https://europub.co.uk/articles/-A-24329