• Frontiers of Optoelectronics
  • Vol. 3, Issue 3, 308 (2010)
Xuejun NIE, Leihua QIN*, Jingli ZHOU, Ke LIU, Jianfeng ZHU, and Yu WANG
Author Affiliations
  • School of Computer Science and Technology, Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Wuhan 430074, China
  • show less
    DOI: 10.1007/s12200-010-0103-z Cite this Article
    Xuejun NIE, Leihua QIN, Jingli ZHOU, Ke LIU, Jianfeng ZHU, Yu WANG. Optimization for data de-duplication algorithm based on file content[J]. Frontiers of Optoelectronics, 2010, 3(3): 308 Copy Citation Text show less
    References

    [1] Tony A, Biggar H. Data De-Duplication and Disk-to-Disk Backup Systems: Technical and Business Considerations. The Enterprise Strategy Group Technical Report. 2007

    [2] Biggar H. Experiencing in Data De-Duplication: Improving Efficiency and Reducing Capacity Requirements. The Enterprise Strategy Group Technical Report. 2007

    [3] Lillibridge M, Eshghi K, Bhagwat D, Deolalikar V, Trezise G, Camble P. Sparse indexing: large scale, inline deduplication using sampling and locality. In: Proceedings of the 7th USERNIX Conference on File and Storage Technologies. 2009

    [4] Cox L P, Murray C D, Noble B D. Pastiche: making backup cheap and easy. In: Proceedings of the 5th Symposium on Operating Systems Design and Implementation. 2002, 285-298

    [5] Quinlan S, Dorward S. Venti: a new approach to archival storage. In: Proceedings of the Conference on File and Storage Technologies. 2002, 89-101

    [6] Jain N, Dahlia M, Tewari R. TAPER: tiered approach for eliminating redundancy in replica synchronization. In: Proceedings of the 4th USENIX Conference on File and Storage Technologies. 2005, 4: 21

    [7] Bobbarjung D R, Jagannathan S, Dubnicki C. Improving duplicate elimination in storage systems. ACM Transactions on Storage, 2006, 2(4): 424-448

    [8] Zhu B, Kai L, Patterson H. Avoiding the disk bottleneck in the data domain deduplication file system. In: Proceedings of the 6th USENIX Conference on File and Storage Technologies. 2008, 18

    [9] You L L, Karamanolis C. Evaluation of efficient archival storage techniques. In: Proceedings of the 21st IEEE Symposium on Mass Storage Systems and Technologies. 2004, 227-232

    [10] Manber U. Finding similar files in a large file system. In: Proceedings of the USENIX Winter 1994 Technical Conference. 1994, 1-10

    [11] Rabin M O. Fingerprinting by Random Polynomials. Center for Research in Computing Technology. Harvard University Technical Report TR-15-81. 1981

    [12] Brin S, Davis J, Garcia-Molina H. Copy detection mechanisms for digital documents. In: Proceedings of the ACM SIGMOD International Conference on Management of Data. 1995, 398-409

    Xuejun NIE, Leihua QIN, Jingli ZHOU, Ke LIU, Jianfeng ZHU, Yu WANG. Optimization for data de-duplication algorithm based on file content[J]. Frontiers of Optoelectronics, 2010, 3(3): 308
    Download Citation