linux-lvm.redhat.com archive mirror
 help / color / mirror / Atom feed
* [linux-lvm] Can LVM block I/O and hang a system?
@ 2007-09-12  7:09 Maurice Volaski
  2007-09-12 10:15 ` Lars Ellenberg
  0 siblings, 1 reply; 3+ messages in thread
From: Maurice Volaski @ 2007-09-12  7:09 UTC (permalink / raw)
  To: linux-lvm; +Cc: linux-kernel

A working system begins hanging and it seems to be stuck on I/O 
processes that use ext3 partitions that are running on top of LVM. 
The system is AMD 64-bit running Gentoo. Kernel is Gentoo 2.6.22-r3 
and LVM lvm2-2.02.27. Here is the disk setup:

Boot disk, attached to motherboard via SATA
1) some partitions accessed via ext3 -> hardware partition.
2) some partitions accessed via ext3 -> drbd, which is version 8.0.5, 
-> hardware partition.

External SATA-SCSI RAID, attached to via an LSI Logic card,
3) one partition accessed via ext3 -> drbd -> hardware partition.
4) some partitions accessed via ext3 -> LVM -> drbd -> hardware partition.

On repeated reboots, #1) boots fine, and I can fsck #2) no problem. I 
can also fsck #3, but the fsck processes on #4, which all are trying 
to recover the journals, just seem to not do anything. There is no 
evidence of I/O and there are no errors reported anywhere. The frozen 
fsck processes cannot even be killed and the system ignores the 
shutdown command.

That the hanging fsck processes are all occurring on just the LVM 
partitions seems to imply that LVM is responsible.

drbd had been unattached to its peer during this time, and when I 
reattached it, it had no trouble syncing to the peer. That system, 
which should basically be identical, however, has no trouble running 
running fsck everywhere. I'm not sure, though, if that lets LVM off 
the hook.
-- 

Maurice Volaski, mvolaski@aecom.yu.edu
Computing Support, Rose F. Kennedy Center
Albert Einstein College of Medicine of Yeshiva University

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [linux-lvm] Can LVM block I/O and hang a system?
  2007-09-12  7:09 [linux-lvm] Can LVM block I/O and hang a system? Maurice Volaski
@ 2007-09-12 10:15 ` Lars Ellenberg
  2007-09-17 21:23   ` Alasdair G Kergon
  0 siblings, 1 reply; 3+ messages in thread
From: Lars Ellenberg @ 2007-09-12 10:15 UTC (permalink / raw)
  To: linux-lvm

On Wed, Sep 12, 2007 at 03:09:14AM -0400, Maurice Volaski wrote:
> A working system begins hanging and it seems to be stuck on I/O 
> processes that use ext3 partitions that are running on top of LVM. 
> The system is AMD 64-bit running Gentoo. Kernel is Gentoo 2.6.22-r3 
> and LVM lvm2-2.02.27. Here is the disk setup:
> 
> Boot disk, attached to motherboard via SATA
> 1) some partitions accessed via ext3 -> hardware partition.
> 2) some partitions accessed via ext3 -> drbd, which is version 8.0.5, 

please use drbd 8.0.6 with kernels that do not allow recursion in
generic_make_request, otherwise any lock is likely to be caused by this.

then, if there are still lockups, these may be caused by lvm, and may be
due to some similar problem, namely doing synchronous (housekeeping) io
from within the generic_make_request path within the same "thread".

> -> hardware partition.
> 
> External SATA-SCSI RAID, attached to via an LSI Logic card,
> 3) one partition accessed via ext3 -> drbd -> hardware partition.
> 4) some partitions accessed via ext3 -> LVM -> drbd -> hardware partition.
> 
> On repeated reboots, #1) boots fine, and I can fsck #2) no problem. I 
> can also fsck #3, but the fsck processes on #4, which all are trying 
> to recover the journals, just seem to not do anything. There is no 
> evidence of I/O and there are no errors reported anywhere. The frozen 
> fsck processes cannot even be killed and the system ignores the 
> shutdown command.
> 
> That the hanging fsck processes are all occurring on just the LVM 
> partitions seems to imply that LVM is responsible.
> 
> drbd had been unattached to its peer during this time, and when I 
> reattached it, it had no trouble syncing to the peer. That system, 
> which should basically be identical, however, has no trouble running 
> running fsck everywhere. I'm not sure, though, if that lets LVM off 
> the hook.
> -- 
> 
> Maurice Volaski, mvolaski@aecom.yu.edu
> Computing Support, Rose F. Kennedy Center
> Albert Einstein College of Medicine of Yeshiva University

-- 
: Lars Ellenberg                            Tel +43-1-8178292-0  :
: LINBIT Information Technologies GmbH      Fax +43-1-8178292-82 :
: Vivenotgasse 48, A-1120 Vienna/Europe    http://www.linbit.com :
__
please use the "List-Reply" function of your email client.

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [linux-lvm] Can LVM block I/O and hang a system?
  2007-09-12 10:15 ` Lars Ellenberg
@ 2007-09-17 21:23   ` Alasdair G Kergon
  0 siblings, 0 replies; 3+ messages in thread
From: Alasdair G Kergon @ 2007-09-17 21:23 UTC (permalink / raw)
  To: LVM general discussion and development

On Wed, Sep 12, 2007 at 12:15:43PM +0200, Lars Ellenberg wrote:
> then, if there are still lockups, these may be caused by lvm, and may be
> due to some similar problem, namely doing synchronous (housekeeping) io
> from within the generic_make_request path within the same "thread".
 
We moved the housekeeping out of the way, but there can still be
problems when an incoming piece of I/O needs to be split between
devices.

Alasdair
-- 
agk@redhat.com

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2007-09-17 21:23 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2007-09-12  7:09 [linux-lvm] Can LVM block I/O and hang a system? Maurice Volaski
2007-09-12 10:15 ` Lars Ellenberg
2007-09-17 21:23   ` Alasdair G Kergon

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).