Data storage has grown at exponential rates within the last decade. This is evidenced by the success of data storage companies such as NetApp. In many organizations, data growth is out of control. There is also a significant need to deal with server sprawl. Environmental concerns are also prevalent as many organizations are running out of power and cooling within the datacenter. According to a report published by the US Environmental Protection Agency in 2007, data centers across the US have almost doubled their power consumption. One of the reasons for this exponential increase is the high demand for data storage. Thus, there is an ever-increasing need to find an effective solution to the increasing demand for data storage.
If you search around online, you might find different technical documents describing deduplication and how it works, but most of these documents are fairly technical. I thought it might be helpful to describe deduplication in a way that would make sense to anyone who wants to understand what it does.
Read the rest of this entry »