* [Drbd-dev] DRBD8: Panic in drbd_bm_write_sect() after an io error during resync.
@ 2007-02-14 18:03 Montrose, Ernest
2007-02-15 15:28 ` Philipp Reisner
0 siblings, 1 reply; 2+ messages in thread
From: Montrose, Ernest @ 2007-02-14 18:03 UTC (permalink / raw)
To: drbd-dev
[-- Attachment #1: Type: text/plain, Size: 2894 bytes --]
Hi all,
We are overwelmed with panic's after io errors. Seem mdev->bc is null
due to some race condition. Here is one instance:
Two node cluster, node A and Node B. Syncsource is node A. While syncing
Reads are issued on Node B. I/O errosrs start to occur
on node A, Node A panics :
Feb 11 03:15:49 drbd0: Sending NegRSDReply. sector 11620032s.
Feb 11 03:15:49 drbd0: Notified peer that my disk is broken.
Feb 11 03:15:58 end_request: I/O error, dev sda, sector 59856375
Feb 11 03:16:10 end_request: I/O error, dev sda, sector 59856383
Feb 11 03:16:10 drbd0: Local IO failed. Detaching...
Feb 11 03:16:10 Unable to handle kernel NULL pointer dereference at
virtual address 00000010
Feb 11 03:16:10 printing eip:
Feb 11 03:16:10 ee3d0b0b
Feb 11 03:16:10 299ba000 -> *pde = 00000000:7f512001
Feb 11 03:16:10 00512000 -> *pme = 00000000:00000000
Feb 11 03:16:10 Oops: 0000 [#1]
Feb 11 03:16:10 SMP
Feb 11 03:16:10 Modules linked in: drbd cn bridge ipv6 ipmi_devintf
ipmi_si ipmi_msghandler i2c_dev i2c_core binfmt_misc dm_mirror video
thermal processor fan container button battery ac shpchp pci_hotplug
e1000 piix ide_cd cdrom sg raid1 dm_mod ide_disk mptscsih mptsas mptspi
mptfc mptscsi mptbase sd_mod scsi_mod
Feb 11 03:16:10 CPU: 0
Feb 11 03:16:10 EIP: 0061:[<ee3d0b0b>] Tainted: GF VLI
Feb 11 03:16:10 EFLAGS: 00010292 (2.6.16.29-xen #1)
Feb 11 03:16:10 EIP is at drbd_bm_write_sect+0x1b/0x1f0 [drbd]
Feb 11 03:16:10 eax: 00000000 ebx: eb925000 ecx: 00000000 edx:
000001bd
Feb 11 03:16:10 esi: eb925000 edi: eb92502c ebp: eae77f50 esp:
eae77f1c
Feb 11 03:16:10 ds: 007b es: 007b ss: 0069
Feb 11 03:16:10 Process drbd0_worker (pid: 5777, threadinfo=eae76000
task=c060b570)
Feb 11 03:16:10 Stack: <0>c0136f00 eae77f20 eae77f20 ffffffff ffffffff
e7ea3550 00000000 000001bd
Feb 11 03:16:10 00000000 000001bd eb925000 eb667980 eb92502c
eae77f74 ee3e2080 eb385d40
Feb 11 03:16:10 eb92502c eae77f74 ee3e6606 00000005 eb667980
eb925000 eae77fc0 ee3d4cae
Feb 11 03:16:10 Call Trace:
Feb 11 03:16:10 [<c0105401>] show_stack_log_lvl+0xa1/0xe0
Feb 11 03:16:10 [<c01055f1>] show_registers+0x181/0x200
Feb 11 03:16:10 [<c0105810>] die+0x100/0x1a0
Feb 11 03:16:10 [<c01156f6>] do_page_fault+0x3c6/0x8b1
Feb 11 03:16:10 [<c0105067>] error_code+0x2b/0x30
Feb 11 03:16:10 [<ee3e2080>] w_update_odbm+0x100/0x220 [drbd]
Feb 11 03:16:10 [<ee3d4cae>] drbd_worker+0x2de/0x4b5 [drbd]
Feb 11 03:16:10 [<ee3e70fc>] drbd_thread_setup+0x8c/0x100 [drbd]
Feb 11 03:16:10 [<c0102e95>] kernel_thread_helper+0x5/0x10
Feb 11 03:16:10 Code: 89 c8 5b 5d c3 8d 74 26 00 8d bc 27 00 00 00 00 55
89 e5 57 56 89 c6 53 83 ec 28 89 55 e8 c7 45 ec 00 00 00 00 89 55 f0 8b
40 14 <8b> 50 10 8b 48 14 01 55 e8 11 4d ec 8b 40 54 c7 45 e0 00 00 00
Feb 11 03:16:10 <0>Fatal exception: panic in 5 seconds
Thanks,
EM--
[-- Attachment #2: Type: text/html, Size: 4423 bytes --]
^ permalink raw reply [flat|nested] 2+ messages in thread
* Re: [Drbd-dev] DRBD8: Panic in drbd_bm_write_sect() after an io error during resync.
2007-02-14 18:03 [Drbd-dev] DRBD8: Panic in drbd_bm_write_sect() after an io error during resync Montrose, Ernest
@ 2007-02-15 15:28 ` Philipp Reisner
0 siblings, 0 replies; 2+ messages in thread
From: Philipp Reisner @ 2007-02-15 15:28 UTC (permalink / raw)
To: drbd-dev
Am Mittwoch, 14. Februar 2007 19:03 schrieb Montrose, Ernest:
> Hi all,
> We are overwelmed with panic's after io errors. Seem mdev->bc is null
> due to some race condition. Here is one instance:
>
> Two node cluster, node A and Node B. Syncsource is node A. While syncing
> Reads are issued on Node B. I/O errosrs start to occur
> on node A, Node A panics :
>
[...OOPS... ]
Hi Ernest,
I was not able to understand the cause of the oops on the first
glance.
Could you provide the output of ksymoops when you feed this
OOPS to it ? ( I am interested in the disassebled code)
AND
I do this debugging by comparing it to the assembler output
of the compiler.
Please provide the .s files from the machine where you build
your drbd (with your compiler, kernel config and kernel source).
Remke DRBD with "make V=1"
The create the .s file:
Replaceing the "-c" option with "-gstabs+ -S" and the -o "foo.o" to
-o "foo.s" in the call of the compiler
Something like this:
(cd $KDIR ; gcc ... /some/path/foo.c )
Thanks,
Philipp
--
: Dipl-Ing Philipp Reisner Tel +43-1-8178292-50 :
: LINBIT Information Technologies GmbH Fax +43-1-8178292-82 :
: Vivenotgasse 48, 1120 Vienna, Austria http://www.linbit.com :
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2007-02-15 15:28 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2007-02-14 18:03 [Drbd-dev] DRBD8: Panic in drbd_bm_write_sect() after an io error during resync Montrose, Ernest
2007-02-15 15:28 ` Philipp Reisner
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox