Remote Differential Compression

Remote Differential Compression

Remote Differential Compression (RDC) is a client-server synchronization algorithm that allows the contents of two files to be synchronized by communicating only the differences between them. It was introduced with Windows Server 2003 R2 and is included with later Windows client and server operating systems.

Unlike Binary Delta Compression (BDC), which is designed to operate only on known versions of a single file, RDC does not make assumptions about file similarity or versioning. The differences between files are computed on the fly, therefore RDC is suitable for efficient synchronization of files that have been updated independently, network bandwidth is small or in scenarios where the files are large but the differences between them are small.

The algorithm used is based on fingerprinting blocks on each file locally at both ends of the replication partners. Since many types of file changes can cause the file contents to move (for example, a small insertion or deletion at the beginning of a file can cause the rest of the file to become misaligned to the original content) the blocks used for comparison are not based on static arbitrary cut points but on cut points defined by the contents of each file segment. This means that if a part of a file changes in lenght or blocks of the contents get moved to other parts of the file, the block boundaries for the parts that have not changed remain fixed related to the contents, and thus the series of fingerprints for those blocks don't change either, they just change position. By comparing all hashes in a file to the hashes for the same file at the other end of the replication pair, RDC is able to identify which blocks of the file have changed and which haven't, even if the contents of the file has been significantly reshuffled. Since comparing large files could imply making large numbers of signature comparisons, the algorithm is recursively applied to the hash sets to detect which blocks of hashes have changed or moved around, significantly reducing the amount of thata that needs to be transmitted for comparing files.

The Client Side Caching (CSC) feature in Windows Vista makes use of the technology for the first time, allowing file types such as Microsoft Outlook personal folders (*.pst) to be made available offline. Previously, Windows XP used only file metadata to test if a file such as a .pst had changed. When the application "touches" a .PST file's date, even when it does not make any changes, it triggers an update of the file in Windows XP causing CSC to recopy these large files unnecessarily.In Windows Vista the file will be updated ony if it has actually been modified, and only the actual parts of the file that have been changed are transmitted.

External links

* [http://technet2.microsoft.com/windowsserver/en/library/8c4cf2e7-0b92-4643-acbd-abfa9f189d031033.mspx?mfr=true Introduction to DFS replication]
* [http://msdn2.microsoft.com/en-us/library/aa372948.aspx About Remote Differential Compression]
* [ftp://ftp.research.microsoft.com/pub/tr/TR-2006-157.pdf Optimizing File Replication over Limited-Bandwidth Networks using Remote Differential Compression]


Wikimedia Foundation. 2010.

Игры ⚽ Нужен реферат?

Look at other dictionaries:

  • Remote Desktop Services — Developer(s) Microsoft Stable release 7.0 (6.1.7600) / October 27, 2009 …   Wikipedia

  • Binary delta compression — (BDC) is a technology used in software deployment for distributing patches. Explanation Downloading large amounts of data over the internet for software updates can induce high network traffic problems, especially when a network of computers is… …   Wikipedia

  • Remote backup service — Types of Internet hosting service Full featured hosting Virtual private server · Dedicated hosting · …   Wikipedia

  • Windows Vista networking technologies — This article is part of a series on Windows Vista New features Overview Technical and core system Security and safety Networking technologies I/O technologies Management and administration Removed features …   Wikipedia

  • rsync — Original author(s) Andrew Tridgell, Paul Mackerras Developer(s) Wayne Davison Initia …   Wikipedia

  • NTFS — Developer Microsoft Full name New Technology File System[1] Introduced July 1993 (Windows NT 3.1) Partition identifier 0x07 (MBR) EBD0A0A2 B9E5 4433 87C0 68B6 …   Wikipedia

  • Technical features new to Windows Vista — This article is part of a series on Windows Vista New features Overview Technical and core system Security and safety Networking technologies I/O technologies Management and administration Removed features …   Wikipedia

  • Distributed File System (Microsoft) — This article is about Microsoft s implementation of DFS. For general discussion of the concept and other implementations, see Distributed file system. Distributed File System (DFS) is a set of client and server services that allow an organization …   Wikipedia

  • List of Microsoft Windows components — The following is a list of Microsoft Windows components. Contents 1 Configuration and maintenance 2 User interface 3 Applications and utilities 4 Windows Server components …   Wikipedia

  • Roaming user profile — C:Documents and Settings{username}   Application Data   …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”