* [linux-lvm] Snapshot weirdness
@ 2005-11-03 23:53 kelsey hudson
2005-11-04 1:20 ` Dan Stromberg
0 siblings, 1 reply; 3+ messages in thread
From: kelsey hudson @ 2005-11-03 23:53 UTC (permalink / raw)
To: linux-lvm
Hello.
I'm building a disk-to-disk-to-tape backup appliance here, and decided
that for maximum flexibility I'd use LVM2 (mainly because of its
snapshot feature and the ability to hot-add disks and extend volumes
seamlessly. Good stuff.)
Anyhow, I have a 600GB primary physical volume configured with a single
logical volume utilizing 99% of the extents. I have the system set to
take a snapshot every night so there's always a live copy of the data
available for backup. Three such snapshots are used in rotation (the
oldest snapshot is deleted and recreated as the newest); each occupy 25
extents. The problem is, after some time, I'll have a bunch of errors
regarding the snapshot volumes spewed to the system logs and console. If
I subsequently try and read from the filesystem, the kernel shuts the
filesystem down (XFS feature).
This makes it rather inconvenient to back up a snapshot -- if I can't
read it, it doesn't do me much good to have it. I'm basically using the
snapshots read-only, and the filesystems are mounted as such, as well.
So, can anyone shed some insight on why I have self-corrupting
filesystems on my snapshot volumes?
Thanks in advance.
-Kelsey
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: [linux-lvm] Snapshot weirdness
2005-11-03 23:53 [linux-lvm] Snapshot weirdness kelsey hudson
@ 2005-11-04 1:20 ` Dan Stromberg
2005-11-04 7:07 ` kelsey hudson
0 siblings, 1 reply; 3+ messages in thread
From: Dan Stromberg @ 2005-11-04 1:20 UTC (permalink / raw)
To: LVM general discussion and development
Is disk to disk to tape cost effective compared to disk to disk?
I don't mean to assert that disk to disk to tape is more or less
expensive than disk to disk, but it seems worth comparing.
I'm thinking disk to disk to tape'd be faster in some sense than disk to
tape, in fact that's what we use here at UCI for a lot of our backups,
but if you're starting over from scratch, how about comparing the cost
of something like:
1) A bunch of opterons with RAID 5 volumes built via md, tacked together
with Ibrix, or perhaps just a bunch of gnbd's md'd together into a huge
xfs
2) One of those many rsync front-ends that stores only one copy of a
given file? backuppc seems to have the most sophisticated user
interface, but honorable mention goes to rdiff-backup for using rdiff
(binary diff based on the rsync algorithm) and reverse deltas (which
ISTR was the big plus of CVS over prior source code control systems, so
the diff'ing gets deeper as you go back in time, not forward in time,
and since you're more likely to need contemporary files...).
Before you decide that the hashing involved in rsync would cause too
many collisions, bear in mind that with a sufficiently strong hash, you
can have a lower probability of a collision than the probability of a
tape failure...
On Thu, 2005-11-03 at 15:53 -0800, kelsey hudson wrote:
> Hello.
>
> I'm building a disk-to-disk-to-tape backup appliance here, and decided
> that for maximum flexibility I'd use LVM2 (mainly because of its
> snapshot feature and the ability to hot-add disks and extend volumes
> seamlessly. Good stuff.)
>
> Anyhow, I have a 600GB primary physical volume configured with a single
> logical volume utilizing 99% of the extents. I have the system set to
> take a snapshot every night so there's always a live copy of the data
> available for backup. Three such snapshots are used in rotation (the
> oldest snapshot is deleted and recreated as the newest); each occupy 25
> extents. The problem is, after some time, I'll have a bunch of errors
> regarding the snapshot volumes spewed to the system logs and console. If
> I subsequently try and read from the filesystem, the kernel shuts the
> filesystem down (XFS feature).
>
> This makes it rather inconvenient to back up a snapshot -- if I can't
> read it, it doesn't do me much good to have it. I'm basically using the
> snapshots read-only, and the filesystems are mounted as such, as well.
>
> So, can anyone shed some insight on why I have self-corrupting
> filesystems on my snapshot volumes?
>
> Thanks in advance.
> -Kelsey
>
> _______________________________________________
> linux-lvm mailing list
> linux-lvm@redhat.com
> https://www.redhat.com/mailman/listinfo/linux-lvm
> read the LVM HOW-TO at http://tldp.org/HOWTO/LVM-HOWTO/
>
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: [linux-lvm] Snapshot weirdness
2005-11-04 1:20 ` Dan Stromberg
@ 2005-11-04 7:07 ` kelsey hudson
0 siblings, 0 replies; 3+ messages in thread
From: kelsey hudson @ 2005-11-04 7:07 UTC (permalink / raw)
To: LVM general discussion and development
Dan Stromberg wrote:
> Is disk to disk to tape cost effective compared to disk to disk?
It is. We have need for both online and archival backups. The online
backups in case we need to hot-restore a file or to quickly go back to
an old snapshot of a database or something, and an archive in case
things go terribly wrong, and for (as the name implies) an archive.
> I don't mean to assert that disk to disk to tape is more or less
> expensive than disk to disk, but it seems worth comparing.
It's always important to weigh the options. :)
> I'm thinking disk to disk to tape'd be faster in some sense than disk to
> tape, in fact that's what we use here at UCI for a lot of our backups,
> but if you're starting over from scratch, how about comparing the cost
> of something like:
Funny you mention that -- I work on the UCI campus, in the research park
near the corner of Bison and California. Small world ...
Anyhow, yes. Disk to disk to tape is much faster. Most everything I'm
backing up is over the network, mainly on large NAS volumes mounted over
100MBit ethernet (yes, our network admin needs his head examined). So,
in order to keep the tape buffer from under-running and causing the tape
head to back up and retrain (costly), I cache the information locally.
But, in the case that a sync is still in progress, I take a snapshot
when the sync is over. The tape process goes and backs up that snapshot,
while the original volume is (again) being synchronized.
At least, this is how it works in *concept*. Then again, Itanium looked
good on paper, too :)
> 1) A bunch of opterons with RAID 5 volumes built via md, tacked together
> with Ibrix, or perhaps just a bunch of gnbd's md'd together into a huge
> xfs
If only money were no object. :) I had a $500 budget, and some old
hardware to munge together in order to achieve a working solution. You'd
absolutely cringe if you found out what I ended up doing. Fortunately,
the LTO autoloader is new, though.
> 2) One of those many rsync front-ends that stores only one copy of a
> given file? backuppc seems to have the most sophisticated user
> interface, but honorable mention goes to rdiff-backup for using rdiff
> (binary diff based on the rsync algorithm) and reverse deltas (which
> ISTR was the big plus of CVS over prior source code control systems, so
> the diff'ing gets deeper as you go back in time, not forward in time,
> and since you're more likely to need contemporary files...).
I'm in the process of writing my own front-ends and processes to do most
of this; what I need to do is largely custom (but when I'm done you can
bet I'll release what I've done in case someone else may find it useful)
:D
> Before you decide that the hashing involved in rsync would cause too
> many collisions, bear in mind that with a sufficiently strong hash, you
> can have a lower probability of a collision than the probability of a
> tape failure...
eh, rsync is great, I use it all over the place. And, for sufficiently
different files, the hashing doesn't cause too many collisions,
especially when using a strong hash like MD5 (which is what IIRC rsync
uses). I actually use rsync internally to do most of the network (and
local!) copying.
Anyways, I managed to get the kernel to panic twice today when removing
snapshot volumes which had been corrupted by the phantom metadata eater.
I'll see what I can do about reproducing it tomorrow.
Anyhow, off to sleep for me.
Cheers,
-Kelsey
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2005-11-04 7:07 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2005-11-03 23:53 [linux-lvm] Snapshot weirdness kelsey hudson
2005-11-04 1:20 ` Dan Stromberg
2005-11-04 7:07 ` kelsey hudson
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.