[sllug-members]: easy backups
Bart Whiteley
bwhiteley at novell.com
Mon May 1 20:06:01 MDT 2006
On Mon, 2006-05-01 at 18:41 -0600, Mac Newbold wrote:
> Today at 6:05pm, Bart Whiteley said:
>
> > On Mon, 2006-05-01 at 17:50 -0600, Scott Patten wrote:
> >> I'd like to add a second vote for rdiff-backup. It combines the
> >> advantages of diff with those of rsync. It will very efficiently
> >> create
> >> a copy of a directory and also keep compressed backups of the changes
> >> that occurred since the last backup. When you want to restore a
> >> file,
> >> you just copy it. When you want to restore an older copy then you
> >> issue
> >> an rdiff-backup command that decompresses and recombines the files to
> >> give you the version that you are after.
> >
> > What are the advantages of rdiff-backup over
> > http://www.mikerubel.org/computers/rsync_snapshots/ ?
>
> I'll answer a different question and say that rsnapshot has the advantage
> that every file in any of your snapshots is a complete file, usually a
> hard-link to identical versions of the same file. You can easily search
> your backups with grep or whatever you normally like to use, and restoring
> a certain version is as easy as doing a cp (I like cp -p too). The file
> permissions and ownership are perfectly preserved. The biggest advantage
> I've found in that is knowing that I can quickly and very easily do large
> sweeping restores in case of emergency. Once I know which version I want
> (like the most recent backup) it is extremely easy to get it and do
> whatever you want with it. Since it uses rsync, it also only transfers the
> files that have actually changed, and you can turn on compression for
> remote backups, so that it saves on bandwidth and time. (It doesn't store
> the files compressed, just gzips them during transfer over the network.)
>
> I suspect that an rdiff backup to a remote server would have to transfer
> at least the smaller of the two file versions to find the difference
> between them, and if so, it wouldn't provide any bandwidth/speed savings
> over an rsync/rsnapshot.
>
> The place it could provide an advantage is in the space required for a
> backup, especially if you have lots of large files that get small changes
> (like log files for example). The diffs could also be bigger than the file
> itself, though, if a majority of the file changed, since it shows the old
> way and the new way in the diff. And generally diffs are ridiculously bad
> with binary files, like images, compressed files, non-text documents (like
> the MS Office formats), etc.
>
> The rdiff method can make it more fragile too. This method also adds more
> complications for restoring or examining files in your backups, since
> they're not whole files, just patches, basically. To restore, you need to
> have a valid version of every patch in the dependency chain. I don't know
> how they do it in rdiff-backup, but if the full backup was on April 1st,
> and you wanted the version from April 27th, you might need 27 patch files,
> and if you didn't have all of them, perfect and without any corruption,
> you wouldn't be able to recover the file fully. If they do fulls monthly,
> an incremental to the full weekly, and an incremental to the weekly each
> day, then you'll only need 3 patches for everything to get back perfectly.
> All of this is avoided by storing only whole file versions with
> rsync-based methods.
>
That's what I was thinking. As I read his description of rdiff-backup,
I could only think of ways in which it was inferior to rsnapshot-like
solutions. I was just wondering if I missed something...
More information about the sllug-members
mailing list