linux-ide.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* Re: 2.6.20-rc4-mm1
       [not found] <20070111222627.66bb75ab.akpm@osdl.org>
@ 2007-01-14  9:48 ` Thomas Gleixner
  2007-01-14 10:46   ` 2.6.20-rc4-mm1 Thomas Gleixner
  0 siblings, 1 reply; 9+ messages in thread
From: Thomas Gleixner @ 2007-01-14  9:48 UTC (permalink / raw)
  To: Andrew Morton; +Cc: linux-kernel, jgarzik, linux-ide

On Thu, 2007-01-11 at 22:26 -0800, Andrew Morton wrote:
> - Merged the "filesystem AIO patches".

Hotfixes alreday applied.

BUG: at /home/tglx/work/kernel/vanilla/linux-2.6.20-rc4-mm1/arch/i386/mm/highmem.c:60 kmap_atomic()
 [<c0105fba>] show_trace_log_lvl+0x1a/0x2f
 [<c01065ff>] show_trace+0x12/0x14
 [<c01066b1>] dump_stack+0x16/0x18
 [<c011fad8>] kmap_atomic+0x12f/0x1c8
 [<f88db29c>] ata_scsi_rbuf_get+0x22/0x37 [libata]
 [<f88db773>] atapi_qc_complete+0x1ee/0x240 [libata]
 [<f88d666b>] __ata_qc_complete+0x86/0x8d [libata]
 [<f88d670a>] ata_qc_complete+0x98/0x9e [libata]
 [<f88d98f4>] ata_qc_complete_multiple+0x8a/0xa4 [libata]
 [<f88b3a2c>] ahci_interrupt+0x2bd/0x3b9 [ahci]
 [<c0154a2d>] handle_IRQ_event+0x21/0x48
 [<c0155971>] handle_edge_irq+0xd1/0x115
 [<c01071af>] do_IRQ+0x6c/0x89
 [<c0105a0b>] common_interrupt+0x23/0x28
 [<c01031c2>] mwait_idle+0xd/0xf
 [<c010343f>] cpu_idle+0xb7/0xf1
 [<c010141d>] rest_init+0x37/0x3a
 [<c04629bd>] start_kernel+0x3cc/0x3ef
 [<00000000>] 0x0

ata_scsi_rbuf_get requests KM_IRQ0 type memory and calls kmap_atomic
with interrupts enabled.

Boot proceeds, but gets stuck hard at:
"Remounting root filesystem in read-write mode:"

No SysRq-T, nothing.

The above BUG seems unrelated to that. Investigating further.

	tglx



^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: 2.6.20-rc4-mm1
  2007-01-14  9:48 ` 2.6.20-rc4-mm1 Thomas Gleixner
@ 2007-01-14 10:46   ` Thomas Gleixner
  2007-01-14 14:40     ` 2.6.20-rc4-mm1 Thomas Gleixner
  0 siblings, 1 reply; 9+ messages in thread
From: Thomas Gleixner @ 2007-01-14 10:46 UTC (permalink / raw)
  To: Andrew Morton; +Cc: linux-kernel, jgarzik, linux-ide, axboe

On Sun, 2007-01-14 at 10:48 +0100, Thomas Gleixner wrote:
> ata_scsi_rbuf_get requests KM_IRQ0 type memory and calls kmap_atomic
> with interrupts enabled.
> 
> Boot proceeds, but gets stuck hard at:
> "Remounting root filesystem in read-write mode:"
> 
> No SysRq-T, nothing.
> 
> The above BUG seems unrelated to that. Investigating further.

Bisect identified: git-block.patch

	tglx



^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: 2.6.20-rc4-mm1
  2007-01-14 10:46   ` 2.6.20-rc4-mm1 Thomas Gleixner
@ 2007-01-14 14:40     ` Thomas Gleixner
  2007-01-14 22:05       ` 2.6.20-rc4-mm1 Jens Axboe
  2007-01-14 22:20       ` 2.6.20-rc4-mm1 Jens Axboe
  0 siblings, 2 replies; 9+ messages in thread
From: Thomas Gleixner @ 2007-01-14 14:40 UTC (permalink / raw)
  To: Andrew Morton; +Cc: linux-kernel, jgarzik, linux-ide, axboe

On Sun, 2007-01-14 at 11:46 +0100, Thomas Gleixner wrote:
> > Boot proceeds, but gets stuck hard at:
> > "Remounting root filesystem in read-write mode:"
> > 
> > No SysRq-T, nothing.
> > 
> > The above BUG seems unrelated to that. Investigating further.
> 
> Bisect identified: git-block.patch

Does only happen on 2 systems. Both have sata + raid1 setup. I managed 
to get a stacktrace from the SMP box. Sits there and sleeps forever.

	tglx

[<c032ac64>] io_schedule+0x7a/0x9a
[<c0157f89>] sleep_on_page+0x8/0xc
[<c032ae45>] __wait_on_bit+0x36/0x5d
[<c01580d8>] wait_on_page_bit+0x5b/0x61
[<c0158a2b>] wait_on_page_writeback_range+0x4f/0xef
[<c0158b0f>] filemap_fdatawait+0x44/0x49
[<c0158da0>] filemap_write_and_wait+0x22/0x2d
[<c0190e39>] sync_blockdev+0x17/0x1d
[<c01a27af>] quota_sync_sb+0x33/0xd6
[<c01a2874>] sync_dquots+0x22/0xfa
[<c01757cf>] __fsync_super+0x17/0x66
[<c0175829>] fsync_super+0xb/0x19
[<c0175880>] do_remount_sb+0x49/0x101
[<c0187f98>] do_mount+0x1ad/0x678
[<c01884d2>] sys_mount+0x6f/0xa4
[<c0103f6a>] sysenter_past_esp+0x5f/0x99



^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: 2.6.20-rc4-mm1
  2007-01-14 14:40     ` 2.6.20-rc4-mm1 Thomas Gleixner
@ 2007-01-14 22:05       ` Jens Axboe
  2007-01-14 22:31         ` 2.6.20-rc4-mm1 Thomas Gleixner
  2007-01-14 22:20       ` 2.6.20-rc4-mm1 Jens Axboe
  1 sibling, 1 reply; 9+ messages in thread
From: Jens Axboe @ 2007-01-14 22:05 UTC (permalink / raw)
  To: Thomas Gleixner; +Cc: Andrew Morton, linux-kernel, jgarzik, linux-ide

On Sun, Jan 14 2007, Thomas Gleixner wrote:
> On Sun, 2007-01-14 at 11:46 +0100, Thomas Gleixner wrote:
> > > Boot proceeds, but gets stuck hard at:
> > > "Remounting root filesystem in read-write mode:"
> > > 
> > > No SysRq-T, nothing.
> > > 
> > > The above BUG seems unrelated to that. Investigating further.
> > 
> > Bisect identified: git-block.patch
> 
> Does only happen on 2 systems. Both have sata + raid1 setup. I managed 
> to get a stacktrace from the SMP box. Sits there and sleeps forever.
> 
> 	tglx
> 
> [<c032ac64>] io_schedule+0x7a/0x9a
> [<c0157f89>] sleep_on_page+0x8/0xc
> [<c032ae45>] __wait_on_bit+0x36/0x5d
> [<c01580d8>] wait_on_page_bit+0x5b/0x61
> [<c0158a2b>] wait_on_page_writeback_range+0x4f/0xef
> [<c0158b0f>] filemap_fdatawait+0x44/0x49
> [<c0158da0>] filemap_write_and_wait+0x22/0x2d
> [<c0190e39>] sync_blockdev+0x17/0x1d
> [<c01a27af>] quota_sync_sb+0x33/0xd6
> [<c01a2874>] sync_dquots+0x22/0xfa
> [<c01757cf>] __fsync_super+0x17/0x66
> [<c0175829>] fsync_super+0xb/0x19
> [<c0175880>] do_remount_sb+0x49/0x101
> [<c0187f98>] do_mount+0x1ad/0x678
> [<c01884d2>] sys_mount+0x6f/0xa4
> [<c0103f6a>] sysenter_past_esp+0x5f/0x99

raid seems to have severe problems with the plugging change. I'll try
and find Neil and have a chat with him, hopefully we can work it out.

-- 
Jens Axboe

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: 2.6.20-rc4-mm1
  2007-01-14 14:40     ` 2.6.20-rc4-mm1 Thomas Gleixner
  2007-01-14 22:05       ` 2.6.20-rc4-mm1 Jens Axboe
@ 2007-01-14 22:20       ` Jens Axboe
  1 sibling, 0 replies; 9+ messages in thread
From: Jens Axboe @ 2007-01-14 22:20 UTC (permalink / raw)
  To: Thomas Gleixner; +Cc: Andrew Morton, linux-kernel, jgarzik, linux-ide

On Sun, Jan 14 2007, Thomas Gleixner wrote:
> On Sun, 2007-01-14 at 11:46 +0100, Thomas Gleixner wrote:
> > > Boot proceeds, but gets stuck hard at:
> > > "Remounting root filesystem in read-write mode:"
> > > 
> > > No SysRq-T, nothing.
> > > 
> > > The above BUG seems unrelated to that. Investigating further.
> > 
> > Bisect identified: git-block.patch
> 
> Does only happen on 2 systems. Both have sata + raid1 setup. I managed 
> to get a stacktrace from the SMP box. Sits there and sleeps forever.
> 
> 	tglx
> 
> [<c032ac64>] io_schedule+0x7a/0x9a
> [<c0157f89>] sleep_on_page+0x8/0xc
> [<c032ae45>] __wait_on_bit+0x36/0x5d
> [<c01580d8>] wait_on_page_bit+0x5b/0x61
> [<c0158a2b>] wait_on_page_writeback_range+0x4f/0xef
> [<c0158b0f>] filemap_fdatawait+0x44/0x49
> [<c0158da0>] filemap_write_and_wait+0x22/0x2d
> [<c0190e39>] sync_blockdev+0x17/0x1d
> [<c01a27af>] quota_sync_sb+0x33/0xd6
> [<c01a2874>] sync_dquots+0x22/0xfa
> [<c01757cf>] __fsync_super+0x17/0x66
> [<c0175829>] fsync_super+0xb/0x19
> [<c0175880>] do_remount_sb+0x49/0x101
> [<c0187f98>] do_mount+0x1ad/0x678
> [<c01884d2>] sys_mount+0x6f/0xa4
> [<c0103f6a>] sysenter_past_esp+0x5f/0x99

raid seems to have severe problems with the plugging change. I'll try
and find Neil and have a chat with him, hopefully we can work it out.

-- 
Jens Axboe

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: 2.6.20-rc4-mm1
  2007-01-14 22:31         ` 2.6.20-rc4-mm1 Thomas Gleixner
@ 2007-01-14 22:30           ` Jens Axboe
  2007-01-15  8:22             ` 2.6.20-rc4-mm1 Ingo Molnar
  0 siblings, 1 reply; 9+ messages in thread
From: Jens Axboe @ 2007-01-14 22:30 UTC (permalink / raw)
  To: Thomas Gleixner; +Cc: Andrew Morton, linux-kernel, jgarzik, linux-ide

On Sun, Jan 14 2007, Thomas Gleixner wrote:
> On Mon, 2007-01-15 at 09:05 +1100, Jens Axboe wrote:
> > raid seems to have severe problems with the plugging change. I'll try
> > and find Neil and have a chat with him, hopefully we can work it out.
> 
> Some hints:
> 
> mount(1899): WRITE block 16424 on md3
> call md_write_start
> md3_raid1(438): WRITE block 40965504 on sdb6
> md3_raid1(438): WRITE block 40965504 on sda6
> First Write sector 16424 disks 2
> 
> Stuck.
> 
> Note, that neither end_buffer_async_write() nor
> raid1_end_write_request() are invoked, 
> 
> In a previous write invoked by:
> fsck.ext3(1896): WRITE block 8552 on sdb1
> end_buffer_async_write() is invoked.
> 
> sdb1 is not a part of a raid device.

When I briefly tested this before I left (and found it broken), doing a
cat /proc/mdstat got things going again. Hard if that's your rootfs,
it's just a hint :-)

> Hope that helps,

I can reproduce, so that's not a problem. I can't do much about it until
I'm back next week, but Neil might be able to help. We shall see. Thanks
for testing.

-- 
Jens Axboe


^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: 2.6.20-rc4-mm1
  2007-01-14 22:05       ` 2.6.20-rc4-mm1 Jens Axboe
@ 2007-01-14 22:31         ` Thomas Gleixner
  2007-01-14 22:30           ` 2.6.20-rc4-mm1 Jens Axboe
  0 siblings, 1 reply; 9+ messages in thread
From: Thomas Gleixner @ 2007-01-14 22:31 UTC (permalink / raw)
  To: Jens Axboe; +Cc: Andrew Morton, linux-kernel, jgarzik, linux-ide

On Mon, 2007-01-15 at 09:05 +1100, Jens Axboe wrote:
> raid seems to have severe problems with the plugging change. I'll try
> and find Neil and have a chat with him, hopefully we can work it out.

Some hints:

mount(1899): WRITE block 16424 on md3
call md_write_start
md3_raid1(438): WRITE block 40965504 on sdb6
md3_raid1(438): WRITE block 40965504 on sda6
First Write sector 16424 disks 2

Stuck.

Note, that neither end_buffer_async_write() nor
raid1_end_write_request() are invoked, 

In a previous write invoked by:
fsck.ext3(1896): WRITE block 8552 on sdb1
end_buffer_async_write() is invoked.

sdb1 is not a part of a raid device.

Hope that helps,

	tglx

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: 2.6.20-rc4-mm1
  2007-01-14 22:30           ` 2.6.20-rc4-mm1 Jens Axboe
@ 2007-01-15  8:22             ` Ingo Molnar
  2007-01-15 23:55               ` 2.6.20-rc4-mm1 Jens Axboe
  0 siblings, 1 reply; 9+ messages in thread
From: Ingo Molnar @ 2007-01-15  8:22 UTC (permalink / raw)
  To: Jens Axboe
  Cc: Thomas Gleixner, Andrew Morton, linux-kernel, jgarzik, linux-ide


* Jens Axboe <jens.axboe@oracle.com> wrote:

> > In a previous write invoked by: fsck.ext3(1896): WRITE block 8552 on 
> > sdb1 end_buffer_async_write() is invoked.
> > 
> > sdb1 is not a part of a raid device.
> 
> When I briefly tested this before I left (and found it broken), doing 
> a cat /proc/mdstat got things going again. Hard if that's your rootfs, 
> it's just a hint :-)

hm, so you knew it's broken, still you let Andrew pick it up, or am i 
misunderstanding something?

	Ingo

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: 2.6.20-rc4-mm1
  2007-01-15  8:22             ` 2.6.20-rc4-mm1 Ingo Molnar
@ 2007-01-15 23:55               ` Jens Axboe
  0 siblings, 0 replies; 9+ messages in thread
From: Jens Axboe @ 2007-01-15 23:55 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: Thomas Gleixner, Andrew Morton, linux-kernel, jgarzik, linux-ide

On Mon, Jan 15 2007, Ingo Molnar wrote:
> 
> * Jens Axboe <jens.axboe@oracle.com> wrote:
> 
> > > In a previous write invoked by: fsck.ext3(1896): WRITE block 8552 on 
> > > sdb1 end_buffer_async_write() is invoked.
> > > 
> > > sdb1 is not a part of a raid device.
> > 
> > When I briefly tested this before I left (and found it broken), doing 
> > a cat /proc/mdstat got things going again. Hard if that's your rootfs, 
> > it's just a hint :-)
> 
> hm, so you knew it's broken, still you let Andrew pick it up, or am i 
> misunderstanding something?

Well the raid issue wasn't known before it was in -mm.

-- 
Jens Axboe


^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2007-01-15 23:54 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <20070111222627.66bb75ab.akpm@osdl.org>
2007-01-14  9:48 ` 2.6.20-rc4-mm1 Thomas Gleixner
2007-01-14 10:46   ` 2.6.20-rc4-mm1 Thomas Gleixner
2007-01-14 14:40     ` 2.6.20-rc4-mm1 Thomas Gleixner
2007-01-14 22:05       ` 2.6.20-rc4-mm1 Jens Axboe
2007-01-14 22:31         ` 2.6.20-rc4-mm1 Thomas Gleixner
2007-01-14 22:30           ` 2.6.20-rc4-mm1 Jens Axboe
2007-01-15  8:22             ` 2.6.20-rc4-mm1 Ingo Molnar
2007-01-15 23:55               ` 2.6.20-rc4-mm1 Jens Axboe
2007-01-14 22:20       ` 2.6.20-rc4-mm1 Jens Axboe

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).