* Oops while booting 2.6.34-rc0 (block pull busted)
@ 2010-03-02 0:15 Dmitry Torokhov
2010-03-02 7:56 ` Jens Axboe
0 siblings, 1 reply; 14+ messages in thread
From: Dmitry Torokhov @ 2010-03-02 0:15 UTC (permalink / raw)
To: linux-kernel, Jens Axboe
Hi,
It looks like block tree that has been pulled today into mainline is
busted, I am getting the Opps below on boot with the following commit:
commit b1bf9368407ae7e89d8a005bb40beb70a41df539
Merge: 524df55 4671a13
Author: Linus Torvalds <torvalds@linux-foundation.org>
Date: Mon Mar 1 09:00:29 2010 -0800
Merge branch 'for-2.6.34' of git://git.kernel.dk/linux-2.6-block
but not with the previous one:
commit 524df55725217b13d5a232fb5badb5846418ea0e
Merge: 0f45339 6679ee1
Author: Linus Torvalds <torvalds@linux-foundation.org>
Date: Mon Mar 1 08:58:44 2010 -0800
Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tiwai/sound-2.6
This is on plain Fedora 12 VM.
Thanks.
--
Dmitry
sd 2:0:0:0: Attached scsi generic sg1 type 0
sd 2:0:0:0: [sda] 16777216 512-byte logical blocks: (8.58 GB/8.00 GiB)
sd 2:0:0:0: [sda] Write Protect is off
sd 2:0:0:0: [sda] Cache data unavailable
sd 2:0:0:0: [sda] Assuming drive cache: write through
sd 2:0:0:0: [sda] Cache data unavailable
sd 2:0:0:0: [sda] Assuming drive cache: write through
sda: sda1 sda2
sd 2:0:0:0: [sda] Cache data unavailable
sd 2:0:0:0: [sda] Assuming drive cache: write through
sd 2:0:0:0: [sda] Attached SCSI disk
device-mapper: multipath: version 1.1.1 loaded
dracut: Scanning devices sda2 for LVM volume groups
dracut: Reading all physical volumes. This may take a while...
dracut: Found volume group "VolGroup" using metadata type lvm2
dracut: 2 logical volume(s) in volume group "VolGroup" now active
EXT4-fs (dm-0): mounted filesystem with ordered data mode
BUG: unable to handle kernel NULL pointer dereference at (null)
IP: [<ffffffff81128ee1>] mpage_end_io_read+0x45/0x6f
PGD 3b776067 PUD 3b7b1067 PMD 0
Oops: 0002 [#1] SMP
last sysfs file: /sys/kernel/uevent_seqnum
CPU 0
Modules linked in: dm_multipath mptspi mptscsih mptbase scsi_transport_spi floppy [last unloaded: scsi_wait_scan]
Pid: 1, comm: init Not tainted 2.6.33 #4 440BX Desktop Reference Platform/VMware Virtual Platform
RIP: 0010:[<ffffffff81128ee1>] [<ffffffff81128ee1>] mpage_end_io_read+0x45/0x6f
RSP: 0018:ffff88003ea957b8 EFLAGS: 00010202
RAX: ffffffff81128e9c RBX: ffff880037740dd0 RCX: 0000000000000000
RDX: ffff880037f9c088 RSI: 0000000000000000 RDI: 0000000000000000
RBP: ffff88003ea957d8 R08: 0000000000000000 R09: ffff880037e93c08
R10: 0000000000000001 R11: 0000000000000001 R12: ffff880037740d80
R13: 0000000000000001 R14: 0000000000000000 R15: ffff880037740d80
FS: 00007f7cde1ec700(0000) GS:ffff880001e00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 000000003b79e000 CR4: 00000000000006f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process init (pid: 1, threadinfo ffff88003ea94000, task ffff88003ea98000)
Stack:
ffff88003ea957e8 ffff880037fe9208 ffff880037f9c000 0000000000000000
<0> ffff88003ea957e8 ffffffff811246cd ffff88003ea95838 ffffffff81352f8c
<0> ffff88003ea95828 ffff88003b7072f8 ffff88003ea95838 ffff880037f9c000
Call Trace:
[<ffffffff811246cd>] bio_endio+0x2b/0x2d
[<ffffffff81352f8c>] dec_pending+0x13d/0x15c
[<ffffffff81353bd2>] __split_and_process_bio+0x510/0x52b
[<ffffffff81353f8c>] dm_request+0x1cd/0x1e0
[<ffffffff811eb999>] generic_make_request+0x23b/0x2b0
[<ffffffff81356c78>] ? linear_merge+0x0/0x5d
[<ffffffff813540bf>] ? dm_merge_bvec+0xcb/0xec
[<ffffffff811ebae0>] submit_bio+0xd2/0xef
[<ffffffff81128e25>] mpage_bio_submit+0x27/0x2b
[<ffffffff811293c6>] do_mpage_readpage+0x3e0/0x483
[<ffffffff810cb385>] ? ____pagevec_lru_add+0x138/0x14f
[<ffffffff81129590>] mpage_readpages+0xc5/0x104
[<ffffffff81175f53>] ? ext4_get_block+0x0/0xe9
[<ffffffff81175f53>] ? ext4_get_block+0x0/0xe9
[<ffffffff81173880>] ext4_readpages+0x1d/0x1f
[<ffffffff810ca855>] __do_page_cache_readahead+0x103/0x176
[<ffffffff8100a5ce>] ? apic_timer_interrupt+0xe/0x20
[<ffffffff810ca8e9>] ra_submit+0x21/0x25
[<ffffffff810cab55>] ondemand_readahead+0x18e/0x1a1
[<ffffffff810cac25>] page_cache_sync_readahead+0x1c/0x1e
[<ffffffff810c4209>] generic_file_aio_read+0x201/0x504
[<ffffffff81101625>] do_sync_read+0xc4/0x101
[<ffffffff81205803>] ? might_fault+0x21/0x23
[<ffffffff811c98f3>] ? selinux_file_permission+0x5c/0xb3
[<ffffffff811bfcfd>] ? security_file_permission+0x16/0x18
[<ffffffff81101c8c>] vfs_read+0xab/0x108
[<ffffffff81101da9>] sys_read+0x4a/0x6e
[<ffffffff81009c32>] system_call_fastpath+0x16/0x1b
Code: 49 89 fc 41 83 e5 01 48 ff cb 48 c1 e3 04 48 03 5f 48 48 8b 3b 48 83 eb 10 49 3b 5c 24 48 72 06 48 8b 03 0f 0d 08 45 85 ed 74 06 <3e> 80 0f 08 eb 08 3e 80 27 f7
3e 80 0f
RIP [<ffffffff81128ee1>] mpage_end_io_read+0x45/0x6f
RSP <ffff88003ea957b8>
CR2: 0000000000000000
---[ end trace ffacf7730488df2f ]---
Kernel panic - not syncing: Attempted to kill init!
Pid: 1, comm: init Tainted: G D 2.6.33 #4
Call Trace:
[<ffffffff8142fd51>] panic+0x7a/0x13d
[<ffffffff8105628b>] ? exit_ptrace+0x38/0x121
[<ffffffff8104f5b9>] do_exit+0x7a/0x6f3
[<ffffffff8104bfc9>] ? spin_unlock_irqrestore+0xe/0x10
[<ffffffff8104cbe2>] ? kmsg_dump+0x12b/0x145
[<ffffffff81432ff6>] oops_end+0xbf/0xc7
[<ffffffff8102f8f5>] no_context+0x1fc/0x20b
[<ffffffff8100f967>] ? nommu_map_sg+0xd1/0xe5
[<ffffffff8102fa88>] __bad_area_nosemaphore+0x184/0x1a7
[<ffffffff8100a5ce>] ? apic_timer_interrupt+0xe/0x20
[<ffffffff8102fb08>] __bad_area+0x48/0x4f
[<ffffffff81434aab>] ? do_page_fault+0x1bd/0x2a0
[<ffffffff8102fb22>] bad_area+0x13/0x15
[<ffffffff81434ab9>] do_page_fault+0x1cb/0x2a0
[<ffffffff81432475>] page_fault+0x25/0x30
[<ffffffff81128e9c>] ? mpage_end_io_read+0x0/0x6f
[<ffffffff81128ee1>] ? mpage_end_io_read+0x45/0x6f
[<ffffffff811246cd>] bio_endio+0x2b/0x2d
[<ffffffff81352f8c>] dec_pending+0x13d/0x15c
[<ffffffff81353bd2>] __split_and_process_bio+0x510/0x52b
[<ffffffff81353f8c>] dm_request+0x1cd/0x1e0
[<ffffffff811eb999>] generic_make_request+0x23b/0x2b0
[<ffffffff81356c78>] ? linear_merge+0x0/0x5d
[<ffffffff813540bf>] ? dm_merge_bvec+0xcb/0xec
[<ffffffff811ebae0>] submit_bio+0xd2/0xef
[<ffffffff81128e25>] mpage_bio_submit+0x27/0x2b
[<ffffffff811293c6>] do_mpage_readpage+0x3e0/0x483
[<ffffffff810cb385>] ? ____pagevec_lru_add+0x138/0x14f
[<ffffffff81129590>] mpage_readpages+0xc5/0x104
[<ffffffff81175f53>] ? ext4_get_block+0x0/0xe9
[<ffffffff81175f53>] ? ext4_get_block+0x0/0xe9
[<ffffffff81173880>] ext4_readpages+0x1d/0x1f
[<ffffffff810ca855>] __do_page_cache_readahead+0x103/0x176
[<ffffffff8100a5ce>] ? apic_timer_interrupt+0xe/0x20
[<ffffffff810ca8e9>] ra_submit+0x21/0x25
[<ffffffff810cab55>] ondemand_readahead+0x18e/0x1a1
[<ffffffff810cac25>] page_cache_sync_readahead+0x1c/0x1e
[<ffffffff810c4209>] generic_file_aio_read+0x201/0x504
[<ffffffff81101625>] do_sync_read+0xc4/0x101
[<ffffffff81205803>] ? might_fault+0x21/0x23
[<ffffffff811c98f3>] ? selinux_file_permission+0x5c/0xb3
[<ffffffff811bfcfd>] ? security_file_permission+0x16/0x18
[<ffffffff81101c8c>] vfs_read+0xab/0x108
[<ffffffff81101da9>] sys_read+0x4a/0x6e
[<ffffffff81009c32>] system_call_fastpath+0x16/0x1b
^ permalink raw reply [flat|nested] 14+ messages in thread* Re: Oops while booting 2.6.34-rc0 (block pull busted) 2010-03-02 0:15 Oops while booting 2.6.34-rc0 (block pull busted) Dmitry Torokhov @ 2010-03-02 7:56 ` Jens Axboe 2010-03-02 8:15 ` Jens Axboe 0 siblings, 1 reply; 14+ messages in thread From: Jens Axboe @ 2010-03-02 7:56 UTC (permalink / raw) To: Dmitry Torokhov; +Cc: linux-kernel On Mon, Mar 01 2010, Dmitry Torokhov wrote: > Hi, > > It looks like block tree that has been pulled today into mainline is > busted, I am getting the Opps below on boot with the following commit: > > commit b1bf9368407ae7e89d8a005bb40beb70a41df539 > Merge: 524df55 4671a13 > Author: Linus Torvalds <torvalds@linux-foundation.org> > Date: Mon Mar 1 09:00:29 2010 -0800 > > Merge branch 'for-2.6.34' of git://git.kernel.dk/linux-2.6-block > > > but not with the previous one: > > commit 524df55725217b13d5a232fb5badb5846418ea0e > Merge: 0f45339 6679ee1 > Author: Linus Torvalds <torvalds@linux-foundation.org> > Date: Mon Mar 1 08:58:44 2010 -0800 > > Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tiwai/sound-2.6 > > This is on plain Fedora 12 VM. > > Thanks. > > -- > Dmitry > > sd 2:0:0:0: Attached scsi generic sg1 type 0 > sd 2:0:0:0: [sda] 16777216 512-byte logical blocks: (8.58 GB/8.00 GiB) > sd 2:0:0:0: [sda] Write Protect is off > sd 2:0:0:0: [sda] Cache data unavailable > sd 2:0:0:0: [sda] Assuming drive cache: write through > sd 2:0:0:0: [sda] Cache data unavailable > sd 2:0:0:0: [sda] Assuming drive cache: write through > sda: sda1 sda2 > sd 2:0:0:0: [sda] Cache data unavailable > sd 2:0:0:0: [sda] Assuming drive cache: write through > sd 2:0:0:0: [sda] Attached SCSI disk > device-mapper: multipath: version 1.1.1 loaded > dracut: Scanning devices sda2 for LVM volume groups > dracut: Reading all physical volumes. This may take a while... > dracut: Found volume group "VolGroup" using metadata type lvm2 > dracut: 2 logical volume(s) in volume group "VolGroup" now active > EXT4-fs (dm-0): mounted filesystem with ordered data mode > BUG: unable to handle kernel NULL pointer dereference at (null) > IP: [<ffffffff81128ee1>] mpage_end_io_read+0x45/0x6f > PGD 3b776067 PUD 3b7b1067 PMD 0 > Oops: 0002 [#1] SMP > last sysfs file: /sys/kernel/uevent_seqnum > CPU 0 > Modules linked in: dm_multipath mptspi mptscsih mptbase scsi_transport_spi floppy [last unloaded: scsi_wait_scan] > > Pid: 1, comm: init Not tainted 2.6.33 #4 440BX Desktop Reference Platform/VMware Virtual Platform > RIP: 0010:[<ffffffff81128ee1>] [<ffffffff81128ee1>] mpage_end_io_read+0x45/0x6f Can you check where that is? Just do a gdb vmlinux and then an l *mpage_end_io_read+0x45 -- Jens Axboe ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: Oops while booting 2.6.34-rc0 (block pull busted) 2010-03-02 7:56 ` Jens Axboe @ 2010-03-02 8:15 ` Jens Axboe 2010-03-02 8:39 ` Jens Axboe 2010-03-02 10:13 ` walt 0 siblings, 2 replies; 14+ messages in thread From: Jens Axboe @ 2010-03-02 8:15 UTC (permalink / raw) To: Dmitry Torokhov; +Cc: linux-kernel On Tue, Mar 02 2010, Jens Axboe wrote: > On Mon, Mar 01 2010, Dmitry Torokhov wrote: > > Hi, > > > > It looks like block tree that has been pulled today into mainline is > > busted, I am getting the Opps below on boot with the following commit: > > > > commit b1bf9368407ae7e89d8a005bb40beb70a41df539 > > Merge: 524df55 4671a13 > > Author: Linus Torvalds <torvalds@linux-foundation.org> > > Date: Mon Mar 1 09:00:29 2010 -0800 > > > > Merge branch 'for-2.6.34' of git://git.kernel.dk/linux-2.6-block > > > > > > but not with the previous one: > > > > commit 524df55725217b13d5a232fb5badb5846418ea0e > > Merge: 0f45339 6679ee1 > > Author: Linus Torvalds <torvalds@linux-foundation.org> > > Date: Mon Mar 1 08:58:44 2010 -0800 > > > > Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tiwai/sound-2.6 > > > > This is on plain Fedora 12 VM. > > > > Thanks. > > > > -- > > Dmitry > > > > sd 2:0:0:0: Attached scsi generic sg1 type 0 > > sd 2:0:0:0: [sda] 16777216 512-byte logical blocks: (8.58 GB/8.00 GiB) > > sd 2:0:0:0: [sda] Write Protect is off > > sd 2:0:0:0: [sda] Cache data unavailable > > sd 2:0:0:0: [sda] Assuming drive cache: write through > > sd 2:0:0:0: [sda] Cache data unavailable > > sd 2:0:0:0: [sda] Assuming drive cache: write through > > sda: sda1 sda2 > > sd 2:0:0:0: [sda] Cache data unavailable > > sd 2:0:0:0: [sda] Assuming drive cache: write through > > sd 2:0:0:0: [sda] Attached SCSI disk > > device-mapper: multipath: version 1.1.1 loaded > > dracut: Scanning devices sda2 for LVM volume groups > > dracut: Reading all physical volumes. This may take a while... > > dracut: Found volume group "VolGroup" using metadata type lvm2 > > dracut: 2 logical volume(s) in volume group "VolGroup" now active > > EXT4-fs (dm-0): mounted filesystem with ordered data mode > > BUG: unable to handle kernel NULL pointer dereference at (null) > > IP: [<ffffffff81128ee1>] mpage_end_io_read+0x45/0x6f > > PGD 3b776067 PUD 3b7b1067 PMD 0 > > Oops: 0002 [#1] SMP > > last sysfs file: /sys/kernel/uevent_seqnum > > CPU 0 > > Modules linked in: dm_multipath mptspi mptscsih mptbase scsi_transport_spi floppy [last unloaded: scsi_wait_scan] > > > > Pid: 1, comm: init Not tainted 2.6.33 #4 440BX Desktop Reference Platform/VMware Virtual Platform > > RIP: 0010:[<ffffffff81128ee1>] [<ffffffff81128ee1>] mpage_end_io_read+0x45/0x6f > > Can you check where that is? Just do a gdb vmlinux and then an > l *mpage_end_io_read+0x45 I tried checking mine here, but we must be using vastly different gcc versions. So I'd like that output. Can you also try and see if reverting 9f7cdbc33f36d28e57eaba0093f68f0d14c38c5b makes it work? -- Jens Axboe ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: Oops while booting 2.6.34-rc0 (block pull busted) 2010-03-02 8:15 ` Jens Axboe @ 2010-03-02 8:39 ` Jens Axboe 2010-03-02 9:35 ` Dmitry Torokhov 2010-03-02 10:13 ` walt 1 sibling, 1 reply; 14+ messages in thread From: Jens Axboe @ 2010-03-02 8:39 UTC (permalink / raw) To: Dmitry Torokhov; +Cc: linux-kernel On Tue, Mar 02 2010, Jens Axboe wrote: > On Tue, Mar 02 2010, Jens Axboe wrote: > > On Mon, Mar 01 2010, Dmitry Torokhov wrote: > > > Hi, > > > > > > It looks like block tree that has been pulled today into mainline is > > > busted, I am getting the Opps below on boot with the following commit: > > > > > > commit b1bf9368407ae7e89d8a005bb40beb70a41df539 > > > Merge: 524df55 4671a13 > > > Author: Linus Torvalds <torvalds@linux-foundation.org> > > > Date: Mon Mar 1 09:00:29 2010 -0800 > > > > > > Merge branch 'for-2.6.34' of git://git.kernel.dk/linux-2.6-block > > > > > > > > > but not with the previous one: > > > > > > commit 524df55725217b13d5a232fb5badb5846418ea0e > > > Merge: 0f45339 6679ee1 > > > Author: Linus Torvalds <torvalds@linux-foundation.org> > > > Date: Mon Mar 1 08:58:44 2010 -0800 > > > > > > Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tiwai/sound-2.6 > > > > > > This is on plain Fedora 12 VM. > > > > > > Thanks. > > > > > > -- > > > Dmitry > > > > > > sd 2:0:0:0: Attached scsi generic sg1 type 0 > > > sd 2:0:0:0: [sda] 16777216 512-byte logical blocks: (8.58 GB/8.00 GiB) > > > sd 2:0:0:0: [sda] Write Protect is off > > > sd 2:0:0:0: [sda] Cache data unavailable > > > sd 2:0:0:0: [sda] Assuming drive cache: write through > > > sd 2:0:0:0: [sda] Cache data unavailable > > > sd 2:0:0:0: [sda] Assuming drive cache: write through > > > sda: sda1 sda2 > > > sd 2:0:0:0: [sda] Cache data unavailable > > > sd 2:0:0:0: [sda] Assuming drive cache: write through > > > sd 2:0:0:0: [sda] Attached SCSI disk > > > device-mapper: multipath: version 1.1.1 loaded > > > dracut: Scanning devices sda2 for LVM volume groups > > > dracut: Reading all physical volumes. This may take a while... > > > dracut: Found volume group "VolGroup" using metadata type lvm2 > > > dracut: 2 logical volume(s) in volume group "VolGroup" now active > > > EXT4-fs (dm-0): mounted filesystem with ordered data mode > > > BUG: unable to handle kernel NULL pointer dereference at (null) > > > IP: [<ffffffff81128ee1>] mpage_end_io_read+0x45/0x6f > > > PGD 3b776067 PUD 3b7b1067 PMD 0 > > > Oops: 0002 [#1] SMP > > > last sysfs file: /sys/kernel/uevent_seqnum > > > CPU 0 > > > Modules linked in: dm_multipath mptspi mptscsih mptbase scsi_transport_spi floppy [last unloaded: scsi_wait_scan] > > > > > > Pid: 1, comm: init Not tainted 2.6.33 #4 440BX Desktop Reference Platform/VMware Virtual Platform > > > RIP: 0010:[<ffffffff81128ee1>] [<ffffffff81128ee1>] mpage_end_io_read+0x45/0x6f > > > > Can you check where that is? Just do a gdb vmlinux and then an > > l *mpage_end_io_read+0x45 > > I tried checking mine here, but we must be using vastly different gcc > versions. So I'd like that output. Can you also try and see if reverting > 9f7cdbc33f36d28e57eaba0093f68f0d14c38c5b makes it work? OK, so disasm of that reveals that 12: 3e 80 0f 08 orb $0x8,%ds:(%rdi) is the start of the faulting instruction. You are running UP. 0x8 is the 4th bit, so I'd be surprised if that isn't SetPageUptodate(page). -- Jens Axboe ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: Oops while booting 2.6.34-rc0 (block pull busted) 2010-03-02 8:39 ` Jens Axboe @ 2010-03-02 9:35 ` Dmitry Torokhov 2010-03-02 22:51 ` Dmitry Torokhov 0 siblings, 1 reply; 14+ messages in thread From: Dmitry Torokhov @ 2010-03-02 9:35 UTC (permalink / raw) To: Jens Axboe; +Cc: linux-kernel On Tue, Mar 02, 2010 at 09:39:07AM +0100, Jens Axboe wrote: > On Tue, Mar 02 2010, Jens Axboe wrote: > > On Tue, Mar 02 2010, Jens Axboe wrote: > > > On Mon, Mar 01 2010, Dmitry Torokhov wrote: > > > > Hi, > > > > > > > > It looks like block tree that has been pulled today into mainline is > > > > busted, I am getting the Opps below on boot with the following commit: > > > > > > > > commit b1bf9368407ae7e89d8a005bb40beb70a41df539 > > > > Merge: 524df55 4671a13 > > > > Author: Linus Torvalds <torvalds@linux-foundation.org> > > > > Date: Mon Mar 1 09:00:29 2010 -0800 > > > > > > > > Merge branch 'for-2.6.34' of git://git.kernel.dk/linux-2.6-block > > > > > > > > > > > > but not with the previous one: > > > > > > > > commit 524df55725217b13d5a232fb5badb5846418ea0e > > > > Merge: 0f45339 6679ee1 > > > > Author: Linus Torvalds <torvalds@linux-foundation.org> > > > > Date: Mon Mar 1 08:58:44 2010 -0800 > > > > > > > > Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tiwai/sound-2.6 > > > > > > > > This is on plain Fedora 12 VM. > > > > > > > > Thanks. > > > > > > > > -- > > > > Dmitry > > > > > > > > sd 2:0:0:0: Attached scsi generic sg1 type 0 > > > > sd 2:0:0:0: [sda] 16777216 512-byte logical blocks: (8.58 GB/8.00 GiB) > > > > sd 2:0:0:0: [sda] Write Protect is off > > > > sd 2:0:0:0: [sda] Cache data unavailable > > > > sd 2:0:0:0: [sda] Assuming drive cache: write through > > > > sd 2:0:0:0: [sda] Cache data unavailable > > > > sd 2:0:0:0: [sda] Assuming drive cache: write through > > > > sda: sda1 sda2 > > > > sd 2:0:0:0: [sda] Cache data unavailable > > > > sd 2:0:0:0: [sda] Assuming drive cache: write through > > > > sd 2:0:0:0: [sda] Attached SCSI disk > > > > device-mapper: multipath: version 1.1.1 loaded > > > > dracut: Scanning devices sda2 for LVM volume groups > > > > dracut: Reading all physical volumes. This may take a while... > > > > dracut: Found volume group "VolGroup" using metadata type lvm2 > > > > dracut: 2 logical volume(s) in volume group "VolGroup" now active > > > > EXT4-fs (dm-0): mounted filesystem with ordered data mode > > > > BUG: unable to handle kernel NULL pointer dereference at (null) > > > > IP: [<ffffffff81128ee1>] mpage_end_io_read+0x45/0x6f > > > > PGD 3b776067 PUD 3b7b1067 PMD 0 > > > > Oops: 0002 [#1] SMP > > > > last sysfs file: /sys/kernel/uevent_seqnum > > > > CPU 0 > > > > Modules linked in: dm_multipath mptspi mptscsih mptbase scsi_transport_spi floppy [last unloaded: scsi_wait_scan] > > > > > > > > Pid: 1, comm: init Not tainted 2.6.33 #4 440BX Desktop Reference Platform/VMware Virtual Platform > > > > RIP: 0010:[<ffffffff81128ee1>] [<ffffffff81128ee1>] mpage_end_io_read+0x45/0x6f > > > > > > Can you check where that is? Just do a gdb vmlinux and then an > > > l *mpage_end_io_read+0x45 > > > > I tried checking mine here, but we must be using vastly different gcc > > versions. So I'd like that output. Can you also try and see if reverting > > 9f7cdbc33f36d28e57eaba0093f68f0d14c38c5b makes it work? > > OK, so disasm of that reveals that > > 12: 3e 80 0f 08 orb $0x8,%ds:(%rdi) > > is the start of the faulting instruction. You are running UP. 0x8 is the > 4th bit, so I'd be surprised if that isn't SetPageUptodate(page). > Sorry, don't have access to that box at the moment... Will try checking tomorrow. -- Dmitry ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: Oops while booting 2.6.34-rc0 (block pull busted) 2010-03-02 9:35 ` Dmitry Torokhov @ 2010-03-02 22:51 ` Dmitry Torokhov 2010-03-03 7:31 ` Jens Axboe 0 siblings, 1 reply; 14+ messages in thread From: Dmitry Torokhov @ 2010-03-02 22:51 UTC (permalink / raw) To: Jens Axboe; +Cc: linux-kernel On Tue, Mar 02, 2010 at 01:35:48AM -0800, Dmitry Torokhov wrote: > On Tue, Mar 02, 2010 at 09:39:07AM +0100, Jens Axboe wrote: > > On Tue, Mar 02 2010, Jens Axboe wrote: > > > On Tue, Mar 02 2010, Jens Axboe wrote: > > > > On Mon, Mar 01 2010, Dmitry Torokhov wrote: > > > > > Hi, > > > > > > > > > > It looks like block tree that has been pulled today into mainline is > > > > > busted, I am getting the Opps below on boot with the following commit: > > > > > > > > > > commit b1bf9368407ae7e89d8a005bb40beb70a41df539 > > > > > Merge: 524df55 4671a13 > > > > > Author: Linus Torvalds <torvalds@linux-foundation.org> > > > > > Date: Mon Mar 1 09:00:29 2010 -0800 > > > > > > > > > > Merge branch 'for-2.6.34' of git://git.kernel.dk/linux-2.6-block > > > > > > > > > > > > > > > but not with the previous one: > > > > > > > > > > commit 524df55725217b13d5a232fb5badb5846418ea0e > > > > > Merge: 0f45339 6679ee1 > > > > > Author: Linus Torvalds <torvalds@linux-foundation.org> > > > > > Date: Mon Mar 1 08:58:44 2010 -0800 > > > > > > > > > > Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tiwai/sound-2.6 > > > > > > > > > > This is on plain Fedora 12 VM. > > > > > > > > > > Thanks. > > > > > > > > > > -- > > > > > Dmitry > > > > > > > > > > sd 2:0:0:0: Attached scsi generic sg1 type 0 > > > > > sd 2:0:0:0: [sda] 16777216 512-byte logical blocks: (8.58 GB/8.00 GiB) > > > > > sd 2:0:0:0: [sda] Write Protect is off > > > > > sd 2:0:0:0: [sda] Cache data unavailable > > > > > sd 2:0:0:0: [sda] Assuming drive cache: write through > > > > > sd 2:0:0:0: [sda] Cache data unavailable > > > > > sd 2:0:0:0: [sda] Assuming drive cache: write through > > > > > sda: sda1 sda2 > > > > > sd 2:0:0:0: [sda] Cache data unavailable > > > > > sd 2:0:0:0: [sda] Assuming drive cache: write through > > > > > sd 2:0:0:0: [sda] Attached SCSI disk > > > > > device-mapper: multipath: version 1.1.1 loaded > > > > > dracut: Scanning devices sda2 for LVM volume groups > > > > > dracut: Reading all physical volumes. This may take a while... > > > > > dracut: Found volume group "VolGroup" using metadata type lvm2 > > > > > dracut: 2 logical volume(s) in volume group "VolGroup" now active > > > > > EXT4-fs (dm-0): mounted filesystem with ordered data mode > > > > > BUG: unable to handle kernel NULL pointer dereference at (null) > > > > > IP: [<ffffffff81128ee1>] mpage_end_io_read+0x45/0x6f > > > > > PGD 3b776067 PUD 3b7b1067 PMD 0 > > > > > Oops: 0002 [#1] SMP > > > > > last sysfs file: /sys/kernel/uevent_seqnum > > > > > CPU 0 > > > > > Modules linked in: dm_multipath mptspi mptscsih mptbase scsi_transport_spi floppy [last unloaded: scsi_wait_scan] > > > > > > > > > > Pid: 1, comm: init Not tainted 2.6.33 #4 440BX Desktop Reference Platform/VMware Virtual Platform > > > > > RIP: 0010:[<ffffffff81128ee1>] [<ffffffff81128ee1>] mpage_end_io_read+0x45/0x6f > > > > > > > > Can you check where that is? Just do a gdb vmlinux and then an > > > > l *mpage_end_io_read+0x45 > > > > > > I tried checking mine here, but we must be using vastly different gcc > > > versions. So I'd like that output. Can you also try and see if reverting > > > 9f7cdbc33f36d28e57eaba0093f68f0d14c38c5b makes it work? > > > > OK, so disasm of that reveals that > > > > 12: 3e 80 0f 08 orb $0x8,%ds:(%rdi) > > > > is the start of the faulting instruction. You are running UP. 0x8 is the > > 4th bit, so I'd be surprised if that isn't SetPageUptodate(page). > > > > Sorry, don't have access to that box at the moment... Will try checking > tomorrow. > You are absolutely right, it crashes in SetPageUptodate(): (gdb) l *bio_endio+0x2b 0xffffffff8112209d is in bio_endio (fs/bio.c:1433). 1428 else if (!test_bit(BIO_UPTODATE, &bio->bi_flags)) 1429 error = -EIO; 1430 1431 if (bio->bi_end_io) 1432 bio->bi_end_io(bio, error); 1433 } 1434 EXPORT_SYMBOL(bio_endio); 1435 1436 void bio_pair_release(struct bio_pair *bp) 1437 { (gdb) l *mpage_end_io_read+0x45 0xffffffff811268b1 is in mpage_end_io_read (/home/dtor/kernel/linus/arch/x86/include/asm/bitops.h:63). 58 */ 59 static __always_inline void 60 set_bit(unsigned int nr, volatile unsigned long *addr) 61 { 62 if (IS_IMMEDIATE(nr)) { 63 asm volatile(LOCK_PREFIX "orb %1,%0" 64 : CONST_MASK_ADDR(nr, addr) 65 : "iq" ((u8)CONST_MASK(nr)) 66 : "memory"); 67 } else { (gdb) l *mpage_end_io_read+0x44 0xffffffff811268b0 is in mpage_end_io_read (fs/mpage.c:53). 48 struct page *page = bvec->bv_page; 49 50 if (--bvec >= bio->bi_io_vec) 51 prefetchw(&bvec->bv_page->flags); 52 53 if (uptodate) { 54 SetPageUptodate(page); 55 } else { 56 ClearPageUptodate(page); 57 SetPageError(page); -- Dmitry ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: Oops while booting 2.6.34-rc0 (block pull busted) 2010-03-02 22:51 ` Dmitry Torokhov @ 2010-03-03 7:31 ` Jens Axboe 0 siblings, 0 replies; 14+ messages in thread From: Jens Axboe @ 2010-03-03 7:31 UTC (permalink / raw) To: Dmitry Torokhov; +Cc: linux-kernel On Tue, Mar 02 2010, Dmitry Torokhov wrote: > On Tue, Mar 02, 2010 at 01:35:48AM -0800, Dmitry Torokhov wrote: > > On Tue, Mar 02, 2010 at 09:39:07AM +0100, Jens Axboe wrote: > > > On Tue, Mar 02 2010, Jens Axboe wrote: > > > > On Tue, Mar 02 2010, Jens Axboe wrote: > > > > > On Mon, Mar 01 2010, Dmitry Torokhov wrote: > > > > > > Hi, > > > > > > > > > > > > It looks like block tree that has been pulled today into mainline is > > > > > > busted, I am getting the Opps below on boot with the following commit: > > > > > > > > > > > > commit b1bf9368407ae7e89d8a005bb40beb70a41df539 > > > > > > Merge: 524df55 4671a13 > > > > > > Author: Linus Torvalds <torvalds@linux-foundation.org> > > > > > > Date: Mon Mar 1 09:00:29 2010 -0800 > > > > > > > > > > > > Merge branch 'for-2.6.34' of git://git.kernel.dk/linux-2.6-block > > > > > > > > > > > > > > > > > > but not with the previous one: > > > > > > > > > > > > commit 524df55725217b13d5a232fb5badb5846418ea0e > > > > > > Merge: 0f45339 6679ee1 > > > > > > Author: Linus Torvalds <torvalds@linux-foundation.org> > > > > > > Date: Mon Mar 1 08:58:44 2010 -0800 > > > > > > > > > > > > Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tiwai/sound-2.6 > > > > > > > > > > > > This is on plain Fedora 12 VM. > > > > > > > > > > > > Thanks. > > > > > > > > > > > > -- > > > > > > Dmitry > > > > > > > > > > > > sd 2:0:0:0: Attached scsi generic sg1 type 0 > > > > > > sd 2:0:0:0: [sda] 16777216 512-byte logical blocks: (8.58 GB/8.00 GiB) > > > > > > sd 2:0:0:0: [sda] Write Protect is off > > > > > > sd 2:0:0:0: [sda] Cache data unavailable > > > > > > sd 2:0:0:0: [sda] Assuming drive cache: write through > > > > > > sd 2:0:0:0: [sda] Cache data unavailable > > > > > > sd 2:0:0:0: [sda] Assuming drive cache: write through > > > > > > sda: sda1 sda2 > > > > > > sd 2:0:0:0: [sda] Cache data unavailable > > > > > > sd 2:0:0:0: [sda] Assuming drive cache: write through > > > > > > sd 2:0:0:0: [sda] Attached SCSI disk > > > > > > device-mapper: multipath: version 1.1.1 loaded > > > > > > dracut: Scanning devices sda2 for LVM volume groups > > > > > > dracut: Reading all physical volumes. This may take a while... > > > > > > dracut: Found volume group "VolGroup" using metadata type lvm2 > > > > > > dracut: 2 logical volume(s) in volume group "VolGroup" now active > > > > > > EXT4-fs (dm-0): mounted filesystem with ordered data mode > > > > > > BUG: unable to handle kernel NULL pointer dereference at (null) > > > > > > IP: [<ffffffff81128ee1>] mpage_end_io_read+0x45/0x6f > > > > > > PGD 3b776067 PUD 3b7b1067 PMD 0 > > > > > > Oops: 0002 [#1] SMP > > > > > > last sysfs file: /sys/kernel/uevent_seqnum > > > > > > CPU 0 > > > > > > Modules linked in: dm_multipath mptspi mptscsih mptbase scsi_transport_spi floppy [last unloaded: scsi_wait_scan] > > > > > > > > > > > > Pid: 1, comm: init Not tainted 2.6.33 #4 440BX Desktop Reference Platform/VMware Virtual Platform > > > > > > RIP: 0010:[<ffffffff81128ee1>] [<ffffffff81128ee1>] mpage_end_io_read+0x45/0x6f > > > > > > > > > > Can you check where that is? Just do a gdb vmlinux and then an > > > > > l *mpage_end_io_read+0x45 > > > > > > > > I tried checking mine here, but we must be using vastly different gcc > > > > versions. So I'd like that output. Can you also try and see if reverting > > > > 9f7cdbc33f36d28e57eaba0093f68f0d14c38c5b makes it work? > > > > > > OK, so disasm of that reveals that > > > > > > 12: 3e 80 0f 08 orb $0x8,%ds:(%rdi) > > > > > > is the start of the faulting instruction. You are running UP. 0x8 is the > > > 4th bit, so I'd be surprised if that isn't SetPageUptodate(page). > > > > > > > Sorry, don't have access to that box at the moment... Will try checking > > tomorrow. > > > > You are absolutely right, it crashes in SetPageUptodate(): I think what happens here is that since the add_page logic got borked, mpage_end_io_read() barfs on a bio that doesn't actually contain any pages. It's reverted now, so everything should be fine in current -git. -- Jens Axboe ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: Oops while booting 2.6.34-rc0 (block pull busted) 2010-03-02 8:15 ` Jens Axboe 2010-03-02 8:39 ` Jens Axboe @ 2010-03-02 10:13 ` walt 2010-03-02 16:50 ` Michael Breuer 1 sibling, 1 reply; 14+ messages in thread From: walt @ 2010-03-02 10:13 UTC (permalink / raw) To: linux-kernel On 03/02/2010 12:15 AM, Jens Axboe wrote: >> On Mon, Mar 01 2010, Dmitry Torokhov wrote: >>> It looks like block tree that has been pulled today into mainline is >>> busted, I am getting the Opps below on boot with the following commit: >>> >>> commit b1bf9368407ae7e89d8a005bb40beb70a41df539 >....Can you also try and see if reverting > 9f7cdbc33f36d28e57eaba0093f68f0d14c38c5b makes it work? I'm getting the same oops and reverting that commit fixes it, thanks. I'm happy to test patches, etc. ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: Oops while booting 2.6.34-rc0 (block pull busted) 2010-03-02 10:13 ` walt @ 2010-03-02 16:50 ` Michael Breuer 2010-03-02 17:42 ` Steven Rostedt 0 siblings, 1 reply; 14+ messages in thread From: Michael Breuer @ 2010-03-02 16:50 UTC (permalink / raw) To: walt; +Cc: linux-kernel On 3/2/2010 5:13 AM, walt wrote: > On 03/02/2010 12:15 AM, Jens Axboe wrote: >>> On Mon, Mar 01 2010, Dmitry Torokhov wrote: > >>>> It looks like block tree that has been pulled today into mainline is >>>> busted, I am getting the Opps below on boot with the following commit: >>>> >>>> commit b1bf9368407ae7e89d8a005bb40beb70a41df539 > > >> ....Can you also try and see if reverting >> 9f7cdbc33f36d28e57eaba0093f68f0d14c38c5b makes it work? > > I'm getting the same oops and reverting that commit fixes it, thanks. > I'm happy to test patches, etc. > Same here - was unable to boot - revert of this solved the issue. ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: Oops while booting 2.6.34-rc0 (block pull busted) 2010-03-02 16:50 ` Michael Breuer @ 2010-03-02 17:42 ` Steven Rostedt 2010-03-02 17:49 ` Steven Rostedt 0 siblings, 1 reply; 14+ messages in thread From: Steven Rostedt @ 2010-03-02 17:42 UTC (permalink / raw) To: Michael Breuer; +Cc: walt, linux-kernel On Tue, Mar 02, 2010 at 11:50:15AM -0500, Michael Breuer wrote: > > > >I'm getting the same oops and reverting that commit fixes it, thanks. > >I'm happy to test patches, etc. > > Seems we have a winner! I had the same bug: http://pastebin.com/iiLgJMwy and reverting this commit fixes it. -- Steve ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: Oops while booting 2.6.34-rc0 (block pull busted) 2010-03-02 17:42 ` Steven Rostedt @ 2010-03-02 17:49 ` Steven Rostedt 2010-03-02 18:21 ` [GIT PULL] single block IO revert (Was "Re: Oops while booting 2.6.34-rc0 (block pull busted)") Jens Axboe 0 siblings, 1 reply; 14+ messages in thread From: Steven Rostedt @ 2010-03-02 17:49 UTC (permalink / raw) To: walt Cc: linux-kernel, Michael Breuer, Dmitry Torokhov, Jens Axboe, Linus Torvalds Ug, Walt, do not remove Cc's when replying to LKML! This looks urgent that we revert this commit: 9f7cdbc33f36d28e57eaba0093f68f0d14c38c5b or find a fix real quick! -- Steve Subject: Oops while booting 2.6.34-rc0 (block pull busted) On Tue, Mar 02, 2010 at 12:42:51PM -0500, Steven Rostedt wrote: > On Tue, Mar 02, 2010 at 11:50:15AM -0500, Michael Breuer wrote: > > > > > >I'm getting the same oops and reverting that commit fixes it, thanks. > > >I'm happy to test patches, etc. > > > > > Seems we have a winner! > > I had the same bug: > > http://pastebin.com/iiLgJMwy > > and reverting this commit fixes it. > > -- Steve ^ permalink raw reply [flat|nested] 14+ messages in thread
* [GIT PULL] single block IO revert (Was "Re: Oops while booting 2.6.34-rc0 (block pull busted)") 2010-03-02 17:49 ` Steven Rostedt @ 2010-03-02 18:21 ` Jens Axboe 2010-03-02 19:17 ` Steven Rostedt 0 siblings, 1 reply; 14+ messages in thread From: Jens Axboe @ 2010-03-02 18:21 UTC (permalink / raw) To: Steven Rostedt Cc: walt, linux-kernel, Michael Breuer, Dmitry Torokhov, Linus Torvalds, dmonakhov On Tue, Mar 02 2010, Steven Rostedt wrote: > Ug, Walt, do not remove Cc's when replying to LKML! > > This looks urgent that we revert this commit: > > 9f7cdbc33f36d28e57eaba0093f68f0d14c38c5b > > or find a fix real quick! We'll revert it asap, no point in wasting time debugging it first. Linus, please pull: git://git.kernel.dk/linux-2.6-block.git for-linus Jens Axboe (1): Revert "blkdev: fix merge_bvec_fn return value checks" fs/bio.c | 4 ++-- 1 files changed, 2 insertions(+), 2 deletions(-) -- Jens Axboe ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [GIT PULL] single block IO revert (Was "Re: Oops while booting 2.6.34-rc0 (block pull busted)") 2010-03-02 18:21 ` [GIT PULL] single block IO revert (Was "Re: Oops while booting 2.6.34-rc0 (block pull busted)") Jens Axboe @ 2010-03-02 19:17 ` Steven Rostedt 2010-03-02 19:21 ` Jens Axboe 0 siblings, 1 reply; 14+ messages in thread From: Steven Rostedt @ 2010-03-02 19:17 UTC (permalink / raw) To: Jens Axboe Cc: walt, linux-kernel, Michael Breuer, Dmitry Torokhov, Linus Torvalds, dmonakhov On Tue, 2010-03-02 at 19:21 +0100, Jens Axboe wrote: > On Tue, Mar 02 2010, Steven Rostedt wrote: > > We'll revert it asap, no point in wasting time debugging it first. Thanks! Since I have a box that triggers this issue, let me know if there's a git branch you would like me to test. -- Steve ^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: [GIT PULL] single block IO revert (Was "Re: Oops while booting 2.6.34-rc0 (block pull busted)") 2010-03-02 19:17 ` Steven Rostedt @ 2010-03-02 19:21 ` Jens Axboe 0 siblings, 0 replies; 14+ messages in thread From: Jens Axboe @ 2010-03-02 19:21 UTC (permalink / raw) To: Steven Rostedt Cc: walt, linux-kernel, Michael Breuer, Dmitry Torokhov, Linus Torvalds, dmonakhov On Tue, Mar 02 2010, Steven Rostedt wrote: > On Tue, 2010-03-02 at 19:21 +0100, Jens Axboe wrote: > > On Tue, Mar 02 2010, Steven Rostedt wrote: > > > > > We'll revert it asap, no point in wasting time debugging it first. > > Thanks! > > Since I have a box that triggers this issue, let me know if there's a > git branch you would like me to test. Thanks, will let you know! -- Jens Axboe ^ permalink raw reply [flat|nested] 14+ messages in thread
end of thread, other threads:[~2010-03-03 7:31 UTC | newest] Thread overview: 14+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2010-03-02 0:15 Oops while booting 2.6.34-rc0 (block pull busted) Dmitry Torokhov 2010-03-02 7:56 ` Jens Axboe 2010-03-02 8:15 ` Jens Axboe 2010-03-02 8:39 ` Jens Axboe 2010-03-02 9:35 ` Dmitry Torokhov 2010-03-02 22:51 ` Dmitry Torokhov 2010-03-03 7:31 ` Jens Axboe 2010-03-02 10:13 ` walt 2010-03-02 16:50 ` Michael Breuer 2010-03-02 17:42 ` Steven Rostedt 2010-03-02 17:49 ` Steven Rostedt 2010-03-02 18:21 ` [GIT PULL] single block IO revert (Was "Re: Oops while booting 2.6.34-rc0 (block pull busted)") Jens Axboe 2010-03-02 19:17 ` Steven Rostedt 2010-03-02 19:21 ` Jens Axboe
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox