Efficient Inline Deduplication on VM Images in Desktop Virtualization Environment

Article Preview

Abstract:

Enterprise service is transforming from traditional physical computing nodes to virtual machines which provides better isolation and more effective use of computing ability of the hardware. However, the widely deployment of virtual machines also increase the pressure of storage significantly with the fact that each must has at list one multi-gigabytes image file to store. To address the high pressure of storage from virtual machine images, we developed a user level inline deduplication file system with Content Addressable Storage (CAS). We use the open-source framework FUSE to encapsulate the deduplication process so as to achieve portability and flexibility. Compared to an ordinary file system without deduplication, we show that our file system can save at least 30% of space of single VM image and even more of multiple VM images while achieving considerable run time performance.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

488-493

Citation:

Online since:

February 2013

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2013 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Partho Nath, Michael A. Kozuch, David R. O'Hallaron. Design Tradeoffs in Applying Content Addressable Storage to Enterprise-scale Systems Based on Virtual Machines. Annual Tech '06: 2006 USENIX Annual Technical Conference.

Google Scholar

[2] C. Ungureanu, B. Atkin, A. Aranya, S. Gokhale, S. Rago, G. Calkowski, C. Dubnicki, and A. Bohra. HydraFS: a High-Throughput File System for the HYDRAstor Content Addressable Storage System. In Proc. of USENIX FAST, (2010).

Google Scholar

[3] C. Dubnicki, L. Gryz, L. Heldt, M. Kaczmarczyk, W. Kilian, P. Strzelczak, J. Szczepkowski, C. Ungureanu, and M. Welnicki. Hydrastor. A scalable secondary storage. In Proc. USENIX FAST, (2009).

Google Scholar

[4] S. Quinlan and S. Dorward. Venti: a new approach to archival storage. In Proc. USENIX FAST, (2002).

Google Scholar

[5] B. Zhu, K. Li, and H. Patterson. Avoiding the disk bottleneck in the data domain deduplication file system. In Proc. USENIX FAST, (2008).

Google Scholar

[6] K. Jin and E. L. Miller. The effectiveness of deduplication on virtual machine disk images. In Proc. ACM SYSTOR, (2009).

DOI: 10.1145/1534530.1534540

Google Scholar

[7] Liguori and E. Van Hensbergen. Experiences with Content Addressable Storage and Virtual Disks. In WIOV'08, (2008).

Google Scholar

[8] Chun-Ho Ng, Mingcao Ma, Tsz-YeungWong. Live Deduplication Storage of Virtual Machine Images in an Open-Source Cloud. Middleware (2011).

Google Scholar

[9] File System in User Space. http: /fuse. sourceforge. net.

Google Scholar