From mboxrd@z Thu Jan 1 00:00:00 1970 From: benco Subject: Re: kernel BUG at arch/x86/xen/mmu.c:1860! Date: Fri, 11 Mar 2011 19:38:00 +0100 Message-ID: <20110311183800.GD32084@acid.sk> References: <20110303221639.GB12175@dumpdata.com> <20110308192950.GA4562@dumpdata.com> <20110308201002.GA5721@dumpdata.com> <1299617407852-3414620.post@n5.nabble.com> <4D76C48F.2050006@leuphana.de> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Return-path: Content-Disposition: inline In-Reply-To: <4D76C48F.2050006@leuphana.de> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: Andreas Olsowski Cc: xen-devel@lists.xensource.com List-Id: xen-devel@lists.xenproject.org Hello, I can confirm this bug, I'm using very similar configuration - Debian Lenny/Squeeze on several servers connected to FC storage with enabled multipathing. One more thing - it was quite long ago, but I did not see this bug with X= en 4.0 pre-release versions with 2.6.31 kernel from Jeremy's tree. I'm regularly updating my system to up-to-date version of Xen/PV-OPS kernel a= nd=20 this bug is here across whole development line from Xen 4.0/2.6.32 kernel= =20 until now:( Roman On Wed, Mar 09, 2011 at 01:06:39AM +0100, Andreas Olsowski wrote: > Well, this is too bad. >=20 > I encountered this bug when xen 4.0 was released, around the time=20 > development on 2.6.31 was halted. >=20 > That is why i stuck with 2.6.31 when everyone else went with 2.6.32, > because i determined 2.6.32 was not fit for duty and im guessing it=20 > still isnt today. >=20 > The bug occures on 2.6.32 xen kernels ( maybe even newer ones) and is=20 > distribution unrelated, i was running debian 5.0 then, i am running 6.0= =20 > testing now and even have tried compiling all the userland stuff myself= . >=20 > This is error can be encountered during a number of different actions: > 1.) any action with lvm (start, stop, create, delete) > 2.) while starting multipathd (restarting too, of course) >=20 > Sometimes the box only hangs there and no further device mapper=20 > interactions are possible. This is where i got my syslog entry from. >=20 > Back in 2010 i had to serial console the server and stuff like that to=20 > see the whole error. >=20 >=20 > my guess is everything one does with the device mapper can and will=20 > trigger this sooner or later. >=20 > Does anybody have any kind of insight on what the problem may be? >=20 > ------------ > Here is my syslog part when i ran "/etc/init.d/multipath-tools restart"= : >=20 > Mar 9 00:24:10 memoryana multipathd: mpatha: stop event checker thread= =20 > (140606587918080) > Mar 9 00:24:10 memoryana multipathd: mpathb: stop event checker thread= =20 > (140606587885312) > Mar 9 00:24:10 memoryana multipathd: mpathc: stop event checker thread= =20 > (140606587852544) > Mar 9 00:24:10 memoryana kernel: ------------[ cut here ]------------ > Mar 9 00:24:10 memoryana kernel: kernel BUG at arch/x86/xen/mmu.c:1872= ! > Mar 9 00:24:10 memoryana kernel: invalid opcode: 0000 [#1] SMP > Mar 9 00:24:10 memoryana kernel: last sysfs file:=20 > /sys/devices/pci0000:00/0000:00:07.0/0000:04:00.1/host3/rport-3:0-2/tar= get3:0:2/3:0:2:0/state > Mar 9 00:24:10 memoryana kernel: CPU 1 > Mar 9 00:24:10 memoryana kernel: Modules linked in: dm_round_robin=20 > dm_multipath qla2xxx > Mar 9 00:24:10 memoryana kernel: Pid: 10662, comm: multipath-tools Not= =20 > tainted 2.6.32.28-xen0 #4 PowerEdge R610 > Mar 9 00:24:10 memoryana kernel: RIP: e030:[]=20 > [] pin_pagetable_pfn+0x31/0x60 > Mar 9 00:24:10 memoryana kernel: RSP: e02b:ffff8800c3101df8 EFLAGS:=20 > 00010282 > Mar 9 00:24:10 memoryana kernel: RAX: 00000000ffffffea RBX:=20 > ffff8800cc4c3400 RCX: 0000000000000003 > Mar 9 00:24:10 memoryana kernel: RDX: 0000000000000000 RSI:=20 > 0000000000000001 RDI: ffff8800c3101df8 > Mar 9 00:24:10 memoryana kernel: RBP: ffff8800c3135b60 R08:=20 > 00003ffffffff000 R09: ffff880000000000 > Mar 9 00:24:10 memoryana kernel: R10: 0000000000007ff0 R11:=20 > 0000000000000246 R12: 00000000000cc302 > Mar 9 00:24:10 memoryana kernel: R13: 0000000000000000 R14:=20 > ffff8800c374cc60 R15: ffff8800c374cc60 > Mar 9 00:24:10 memoryana kernel: FS: 00007f60add15700(0000)=20 > GS:ffff880028055000(0000) knlGS:0000000000000000 > Mar 9 00:24:10 memoryana kernel: CS: e033 DS: 0000 ES: 0000 CR0:=20 > 000000008005003b > Mar 9 00:24:10 memoryana kernel: CR2: 00007f60ad841876 CR3:=20 > 00000000cef79000 CR4: 0000000000002660 > Mar 9 00:24:10 memoryana kernel: DR0: 0000000000000000 DR1:=20 > 0000000000000000 DR2: 0000000000000000 > Mar 9 00:24:10 memoryana kernel: DR3: 0000000000000000 DR6:=20 > 00000000ffff0ff0 DR7: 0000000000000400 > Mar 9 00:24:10 memoryana kernel: Process multipath-tools (pid: 10662,=20 > threadinfo ffff8800c3100000, task ffff8800cc01cbc0) > Mar 9 00:24:10 memoryana kernel: Stack: > Mar 9 00:24:10 memoryana kernel: 0000000000000000 00000000008e8302=20 > ffff8800cc4c3400 ffff8800c3135b60 > Mar 9 00:24:10 memoryana kernel: <0> 00000000000cc302 ffffffff810b0382= =20 > 00007f60ad841876 ffff8800c30b4c10 > Mar 9 00:24:10 memoryana kernel: <0> 00000000000100e0 0000000000000000= =20 > ffff8800c374cc60 ffffffff810b3595 > Mar 9 00:24:10 memoryana kernel: Call Trace: > Mar 9 00:24:10 memoryana kernel: [] ?=20 > __pte_alloc+0xf2/0x120 > Mar 9 00:24:10 memoryana kernel: [] ?=20 > handle_mm_fault+0xa45/0xab0 > Mar 9 00:24:10 memoryana kernel: [] ?=20 > page_fault+0x25/0x30 > Mar 9 00:24:10 memoryana kernel: [] ?=20 > error_exit+0x2a/0x60 > Mar 9 00:24:10 memoryana kernel: [] ?=20 > retint_restore_args+0x5/0x6 > Mar 9 00:24:10 memoryana kernel: [] ?=20 > do_page_fault+0x121/0x3c0 > Mar 9 00:24:10 memoryana kernel: [] ?=20 > __put_user_4+0x1d/0x30 > Mar 9 00:24:10 memoryana kernel: [] ?=20 > page_fault+0x25/0x30 > Mar 9 00:24:10 memoryana kernel: Code: 57 c7 75 00 00 48 89 f0 89 3c 2= 4=20 > 74 27 48 89 44 24 08 48 89 e7 be 01 00 00 00 31 d2 41 ba f0 7f 00 00 e8= =20 > d3 be ff ff 85 c0 74 04 <0f> 0b eb fe 48 83 c4 28 c3 48 89 f7 e8 6e f7=20 > ff ff 48 83 f8 ff > Mar 9 00:24:10 memoryana kernel: RIP []=20 > pin_pagetable_pfn+0x31/0x60 > Mar 9 00:24:10 memoryana kernel: RSP > Mar 9 00:24:10 memoryana kernel: ---[ end trace f4eae184c1a9f532 ]--- > Mar 9 00:24:11 memoryana multipathd: --------shut down------- >=20 > --=20 > Andreas Olsowski > Leuphana Universit=C3=A4t L=C3=BCneburg > Rechen- und Medienzentrum > Scharnhorststra=C3=9Fe 1, C7.015 > 21335 L=C3=BCneburg >=20 > Tel: ++49 4131 677 1309 >=20 >=20 > _______________________________________________ > Xen-devel mailing list > Xen-devel@lists.xensource.com > http://lists.xensource.com/xen-devel --=20 ---------------------------------------------------------------------- ,''`. [benco] | mailto: benco@acid.sk | silc: /msg benco : :' : ------------------------------------------------------------- `. `' GPG publickey: http://www.acid.sk/pubkey.asc `- KF =3D 0DF6 0592 74D2 F17A DACF A5C3 1720 CB7C F54C F429