2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops

All of lore.kernel.org
 help / color / mirror / Atom feed

* 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops
       [not found]   ` <1115392141.12197.3.camel@cobra>
@ 2005-05-07 16:31     ` Antoine Martin
  2005-05-07 15:57       ` Alexander Nyberg
  2005-05-08 14:12       ` Andi Kleen
  0 siblings, 2 replies; 26+ messages in thread
From: Antoine Martin @ 2005-05-07 16:31 UTC (permalink / raw)
  To: linux-kernel

I can crash a host repeatedly just by running a UML instance
(and causing some load on that instance - like running gcc)
I captured these messages from the serial console after it crashed:

Host: Linux x86_64 2.6.11.8 (built with selinux but not enforcing)
Guest: 2.6.12-rc3-uml
The UML patch applied on top of rc3 is available here:
http://213.228.237.37/uml/2.6.12-rc3/uml-x86_64-2.6.12-rc3.patch.bz2
It was built using the instructions from Jeff Dike at:
http://user-mode-linux.sourceforge.net/patches.html
The thread 'kernel-4' is the UML instance, running as a nobody user.
This was reported to the UML ML and Jeff Dike who advised me to
post it here too.
Let me know if you need anything else / testing / etc.
I am not subscribed to LKML, so please CC me on all emails.

general protection fault: 0000 [1]
CPU 0
Pid: 26926, comm: kernel-4 Not tainted 2.6.11.8
RIP: 0010:[<ffffffff8010ca47>] <ffffffff8010ca47>{__switch_to+311}
RSP: 0018:ffff8100a7635d48  EFLAGS: 00010016
RAX: 0000c8e816000002 RBX: ffff8100b895f320 RCX: 00000000c0000102
RDX: 000000000000c8e8 RSI: 0000000000000000 RDI: 0000000000000000
RBP: 0000000000000000 R08: ffff810090db3a00 R09: 0000000000006933
R10: 0000000000000000 R11: 0000000000000202 R12: ffff8100a827b890
R13: ffff8100b895f010 R14: ffff8100a827b580 R15: ffff8100a827b7f8
FS:  000000006025212c(0000) GS:ffffffff80785a00(0000)
knlGS:0000000000000d7e
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 000000006880d010 CR3: 00000000a2321000 CR4: 00000000000006e0
Process kernel-4 (pid: 26926, threadinfo ffff810060884000, task
ffff8100a827b580)
Stack: ffff8100dc37e180 ffff8100b895f010 ffffffff806b6d50
ffff8100b895f010
       000003595b049ed6 ffffffff804f4de4 ffff8100a7635de8
0000000000000086
       0000007500000020 ffff8100b895f010
Call Trace:<ffffffff804f4de4>{thread_return+0}
<ffffffff8013cb08>{ptrace_stop+280}
       <ffffffff8013cde6>{get_signal_to_deliver+358}
<ffffffff8010d4e3>{do_signal+163}
       <ffffffff8010e905>{error_exit+0}
<ffffffff8010de67>{sys_rt_sigreturn+535}
       <ffffffff8010dee9>{sys_rt_sigreturn+665}
<ffffffff8010e2b6>{int_signal+18}


Code: 0f 30 66 41 89 6c 24 2e 65 48 8b 04 25 20 00 00 00 49 89 44
RIP <ffffffff8010ca47>{__switch_to+311} RSP <ffff8100a7635d48>
 <0>general protection fault: 0000 [2]
CPU 0
Pid: 3, comm: events/0 Not tainted 2.6.11.8
RIP: 0010:[<ffffffff8010ca47>] <ffffffff8010ca47>{__switch_to+311}
RSP: 0018:ffff8100a7635d48  EFLAGS: 00010016
RAX: 0000c8e816000002 RBX: ffff8100b895f320 RCX: 00000000c0000102
RDX: 000000000000c8e8 RSI: 0000000000000000 RDI: 0000000000000000
RBP: 0000000000000000 R08: 0000000000000000 R09: 000000000000000d
R10: 0000000000000000 R11: 0000000000000000 R12: ffff8100fbf909f0
R13: ffff8100b895f010 R14: ffff8100fbf906e0 R15: ffff8100fbf90950
FS:  000000006025212c(0000) GS:ffffffff80785a00(0000)
knlGS:0000000000000d7e
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 000000006880d010 CR3: 00000000a2321000 CR4: 00000000000006e0
Process events/0 (pid: 3, threadinfo ffff8100fbfaa000, task
ffff8100fbf906e0)
Stack: ffff8100dc37e180 ffff8100b895f010 ffffffff806b6d50
ffff8100b895f010
       00000359c1aeb8da ffffffff804f4de4 ffff8100a7635de8
0000000000000086
       0000007500000020 ffff8100b895f010
Call Trace:<ffffffff804f4de4>{thread_return+0}
<ffffffff8013cb08>{ptrace_stop+280}
       <ffffffff8013cde6>{get_signal_to_deliver+358}
<ffffffff8010d4e3>{do_signal+163}
       <ffffffff8010e905>{error_exit+0}
<ffffffff8010de67>{sys_rt_sigreturn+535}
       <ffffffff8010dee9>{sys_rt_sigreturn+665}
<ffffffff8010e2b6>{int_signal+18}


Code: 0f 30 66 41 89 6c 24 2e 65 48 8b 04 25 20 00 00 00 49 89 44
RIP <ffffffff8010ca47>{__switch_to+311} RSP <ffff8100a7635d48>
 <1>Unable to handle kernel NULL pointer dereference at 0000000000000278
RIP:
<ffffffff804f4c99>{schedule+873}
PGD c8175067 PUD 31aea067 PMD 0
Oops: 0002 [3]
CPU 0
Pid: 3, comm: events/0 Not tainted 2.6.11.8
RIP: 0010:[<ffffffff804f4c99>] <ffffffff804f4c99>{schedule+873}
RSP: 0018:ffff8100a7635b68  EFLAGS: 00010007
RAX: 0000000002d3ca44 RBX: 0000000000000000 RCX: ffff810090db3a00
RDX: 000000003b9ac880 RSI: ffff8100b895f010 RDI: ffff8100fbf906e0
RBP: ffff8100a7635bc8 R08: 0000000000000000 R09: 000000000000000d
R10: 0000000000000000 R11: 0000000000000000 R12: ffffffff806b6d50
R13: ffff8100b895f010 R14: 0000035a27fb7d77 R15: ffff8100fbf90950
FS:  000000006025212c(0000) GS:ffffffff80785a00(0000)
knlGS:0000000000000d7e
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 0000000000000278 CR3: 00000000a2321000 CR4: 00000000000006e0
Process events/0 (pid: 3, threadinfo ffff8100fbfaa000, task
ffff8100fbf906e0)
Stack: 0000007300000000 ffff8100fbf906e0 0000000000000180
ffff8100b895f010
       ffffffff806a6fc0 0000000000000000 ffff8100fbf906e0
ffff8100a7635be8
       ffff8100fbf906e0 ffff8100fbf91440
Call Trace:<ffffffff801347a7>{do_exit+2535} <ffffffff8010f5ac>{oops_end
+28}
       <ffffffff8010f6e5>{die+69}
<ffffffff801100fe>{do_general_protection+270}
       <ffffffff8010e905>{error_exit+0} <ffffffff8010ca47>{__switch_to
+311}
       <ffffffff804f4de4>{thread_return+0}
<ffffffff8013cb08>{ptrace_stop+280}
       <ffffffff8013cde6>{get_signal_to_deliver+358}
<ffffffff8010d4e3>{do_signal+163}
       <ffffffff8010e905>{error_exit+0}
<ffffffff8010de67>{sys_rt_sigreturn+535}
       <ffffffff8010dee9>{sys_rt_sigreturn+665}
<ffffffff8010e2b6>{int_signal+18}


Code: 0f ba b3 78 02 00 00 00 0f ba a9 78 02 00 00 00 48 8b 51 38
RIP <ffffffff804f4c99>{schedule+873} RSP <ffff8100a7635b68>
CR2: 0000000000000278
 <1>Unable to handle kernel NULL pointer dereference at 0000000000000000
RIP:
<ffffffff8012e134>{dequeue_task+4}
PGD c8175067 PUD 31aea067 PMD 0
Oops: 0002 [4]
CPU 0
Pid: 3, comm: events/0 Not tainted 2.6.11.8
RIP: 0010:[<ffffffff8012e134>] <ffffffff8012e134>{dequeue_task+4}
RSP: 0018:ffff8100a76358c8  EFLAGS: 00010006
RAX: 0000000000000020 RBX: ffff8100fbf906e0 RCX: ffff8100fbf90718
RDX: ffff8100fbf906e0 RSI: 0000000000000000 RDI: ffff8100fbf906e0
RBP: ffff8100a76358d8 R08: 0000000000000000 R09: 000000000000000d
R10: 0000000000000000 R11: 0000000000000000 R12: ffff8100fbf906e0
R13: ffff8100fbf91440 R14: 0000035aa7f554c4 R15: ffff8100fbf90950
FS:  000000006025212c(0000) GS:ffffffff80785a00(0000)
knlGS:0000000000000d7e
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 0000000000000000 CR3: 00000000a2321000 CR4: 00000000000006e0
Process events/0 (pid: 3, threadinfo ffff8100fbfaa000, task
ffff8100fbf906e0)
Stack: ffffffff8012e3d4 ffff8100a7635968 ffff8100a7635948
ffffffff804f4ae2
       0000000000000000 ffff8100fbf906e0 00000000069f6bc7
ffffffff801301bf
       00000000ffffff01 0000000000000000
Call Trace:<ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8010e905>{error_exit+0} <ffffffff804f4c99>{schedule+873}
       <ffffffff804f4ae2>{schedule+434} <ffffffff801347a7>{do_exit+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8010f6e5>{die+69}
       <ffffffff801100fe>{do_general_protection+270}
<ffffffff8010e905>{error_exit+0}
       <ffffffff8010ca47>{__switch_to+311}
<ffffffff804f4de4>{thread_return+0}
       <ffffffff8013cb08>{ptrace_stop+280}
<ffffffff8013cde6>{get_signal_to_deliver+358}
       <ffffffff8010d4e3>{do_signal+163} <ffffffff8010e905>{error_exit
+0}
       <ffffffff8010de67>{sys_rt_sigreturn+535}
<ffffffff8010dee9>{sys_rt_sigreturn+665}
       <ffffffff8010e2b6>{int_signal+18}

Code: ff 0e 55 48 8b 47 38 48 8b 51 08 48 89 e5 48 89 02 48 89 50
RIP <ffffffff8012e134>{dequeue_task+4} RSP <ffff8100a76358c8>
CR2: 0000000000000000
 <1>Unable to handle kernel NULL pointer dereference at 0000000000000000
RIP:
<ffffffff8012e134>{dequeue_task+4}
PGD c8175067 PUD 31aea067 PMD 0
Oops: 0002 [5]
CPU 0
Pid: 3, comm: events/0 Not tainted 2.6.11.8
RIP: 0010:[<ffffffff8012e134>] <ffffffff8012e134>{dequeue_task+4}
RSP: 0018:ffff8100a7635628  EFLAGS: 00010002
RAX: 0000000000000020 RBX: ffff8100fbf906e0 RCX: ffff8100fbf90718
RDX: ffff8100fbf906e0 RSI: 0000000000000000 RDI: ffff8100fbf906e0
RBP: ffff8100a7635638 R08: 0000000000000000 R09: 000000000000000d
R10: 0000000000000000 R11: 0000000000000000 R12: ffff8100fbf906e0
R13: ffff8100fbf91440 R14: 0000035b3fc7d274 R15: ffff8100fbf90950
FS:  000000006025212c(0000) GS:ffffffff80785a00(0000)
knlGS:0000000000000d7e
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 0000000000000000 CR3: 00000000a2321000 CR4: 00000000000006e0
Process events/0 (pid: 3, threadinfo ffff8100fbfaa000, task
ffff8100fbf906e0)
Stack: ffffffff8012e3d4 ffff8100a76356c8 ffff8100a76356a8
ffffffff804f4ae2
       0000000000000000 ffff8100fbf906e0 00000000069f6bc7
ffffffff801301bf
       00000000ffffff01 0000000000000000
Call Trace:<ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8013218c>{__call_console_drivers+76}
<ffffffff8032d55b>{scrup+123}
       <ffffffff8010e905>{error_exit+0} <ffffffff8012e134>{dequeue_task
+4}
       <ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8010e905>{error_exit+0} <ffffffff804f4c99>{schedule+873}
       <ffffffff804f4ae2>{schedule+434} <ffffffff801347a7>{do_exit+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8010f6e5>{die+69}
       <ffffffff801100fe>{do_general_protection+270}
<ffffffff8010e905>{error_exit+0}
       <ffffffff8010ca47>{__switch_to+311}
<ffffffff804f4de4>{thread_return+0}
       <ffffffff8013cb08>{ptrace_stop+280}
<ffffffff8013cde6>{get_signal_to_deliver+358}
       <ffffffff8010d4e3>{do_signal+163} <ffffffff8010e905>{error_exit
+0}
       <ffffffff8010de67>{sys_rt_sigreturn+535}
<ffffffff8010dee9>{sys_rt_sigreturn+665}
       <ffffffff8010e2b6>{int_signal+18}

Code: ff 0e 55 48 8b 47 38 48 8b 51 08 48 89 e5 48 89 02 48 89 50
RIP <ffffffff8012e134>{dequeue_task+4} RSP <ffff8100a7635628>
CR2: 0000000000000000
 <1>Unable to handle kernel NULL pointer dereference at 0000000000000000
RIP:
<ffffffff8012e134>{dequeue_task+4}
PGD c8175067 PUD 31aea067 PMD 0
Oops: 0002 [6]
CPU 0
Pid: 3, comm: events/0 Not tainted 2.6.11.8
RIP: 0010:[<ffffffff8012e134>] <ffffffff8012e134>{dequeue_task+4}
RSP: 0018:ffff8100a7635388  EFLAGS: 00010006
RAX: 0000000000000020 RBX: ffff8100fbf906e0 RCX: ffff8100fbf90718
RDX: ffff8100fbf906e0 RSI: 0000000000000000 RDI: ffff8100fbf906e0
RBP: ffff8100a7635398 R08: 0000000000000000 R09: 000000000000000d
R10: 0000000000000000 R11: 0000000000000000 R12: ffff8100fbf906e0
R13: ffff8100fbf91440 R14: 0000035bf2d22de7 R15: ffff8100fbf90950
FS:  000000006025212c(0000) GS:ffffffff80785a00(0000)
knlGS:0000000000000d7e
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 0000000000000000 CR3: 00000000a2321000 CR4: 00000000000006e0
Process events/0 (pid: 3, threadinfo ffff8100fbfaa000, task
ffff8100fbf906e0)
Stack: ffffffff8012e3d4 ffff8100a7635428 ffff8100a7635408
ffffffff804f4ae2
       0000000000000000 ffff8100fbf906e0 00000000069f6bc7
ffffffff801301bf
       00000000ffffff01 0000000000000000
Call Trace:<ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8013218c>{__call_console_drivers+76}
<ffffffff8032d55b>{scrup+123}
       <ffffffff8010e905>{error_exit+0} <ffffffff8012e134>{dequeue_task
+4}
       <ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8013218c>{__call_console_drivers+76}
<ffffffff8032d55b>{scrup+123}
       <ffffffff8010e905>{error_exit+0} <ffffffff8012e134>{dequeue_task
+4}
       <ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8010e905>{error_exit+0} <ffffffff804f4c99>{schedule+873}
       <ffffffff804f4ae2>{schedule+434} <ffffffff801347a7>{do_exit+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8010f6e5>{die+69}
       <ffffffff801100fe>{do_general_protection+270}
<ffffffff8010e905>{error_exit+0}
       <ffffffff8010ca47>{__switch_to+311}
<ffffffff804f4de4>{thread_return+0}
       <ffffffff8013cb08>{ptrace_stop+280}
<ffffffff8013cde6>{get_signal_to_deliver+358}
       <ffffffff8010d4e3>{do_signal+163} <ffffffff8010e905>{error_exit
+0}
       <ffffffff8010de67>{sys_rt_sigreturn+535}
<ffffffff8010dee9>{sys_rt_sigreturn+665}
       <ffffffff8010e2b6>{int_signal+18}

Code: ff 0e 55 48 8b 47 38 48 8b 51 08 48 89 e5 48 89 02 48 89 50
RIP <ffffffff8012e134>{dequeue_task+4} RSP <ffff8100a7635388>
CR2: 0000000000000000
 <1>Unable to handle kernel NULL pointer dereference at 0000000000000000
RIP:
<ffffffff8012e134>{dequeue_task+4}
PGD c8175067 PUD 31aea067 PMD 0
Oops: 0002 [7]
CPU 0
Pid: 3, comm: events/0 Not tainted 2.6.11.8
RIP: 0010:[<ffffffff8012e134>] <ffffffff8012e134>{dequeue_task+4}
RSP: 0018:ffff8100a76350e8  EFLAGS: 00010006
RAX: 0000000000000020 RBX: ffff8100fbf906e0 RCX: ffff8100fbf90718
RDX: ffff8100fbf906e0 RSI: 0000000000000000 RDI: ffff8100fbf906e0
RBP: ffff8100a76350f8 R08: 0000000000000000 R09: 000000000000000d
R10: 0000000000000000 R11: 0000000000000000 R12: ffff8100fbf906e0
R13: ffff8100fbf91440 R14: 0000035cc114f6cf R15: ffff8100fbf90950
FS:  000000006025212c(0000) GS:ffffffff80785a00(0000)
knlGS:0000000000000d7e
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 0000000000000000 CR3: 00000000a2321000 CR4: 00000000000006e0
Process events/0 (pid: 3, threadinfo ffff8100fbfaa000, task
ffff8100fbf906e0)
Stack: ffffffff8012e3d4 ffff8100a7635188 ffff8100a7635168
ffffffff804f4ae2
       0000000000000000 ffff8100fbf906e0 00000000069f6bc7
ffffffff801301bf
       00000000ffffff01 0000000000000000
Call Trace:<ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8013218c>{__call_console_drivers+76}
<ffffffff8032d55b>{scrup+123}
       <ffffffff8010e905>{error_exit+0} <ffffffff8012e134>{dequeue_task
+4}
       <ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8013218c>{__call_console_drivers+76}
<ffffffff8032d55b>{scrup+123}
       <ffffffff8010e905>{error_exit+0} <ffffffff8012e134>{dequeue_task
+4}
       <ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8013218c>{__call_console_drivers+76}
<ffffffff8032d55b>{scrup+123}
       <ffffffff8010e905>{error_exit+0} <ffffffff8012e134>{dequeue_task
+4}
       <ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8010e905>{error_exit+0} <ffffffff804f4c99>{schedule+873}
       <ffffffff804f4ae2>{schedule+434} <ffffffff801347a7>{do_exit+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8010f6e5>{die+69}
       <ffffffff801100fe>{do_general_protection+270}
<ffffffff8010e905>{error_exit+0}
       <ffffffff8010ca47>{__switch_to+311}
<ffffffff804f4de4>{thread_return+0}
       <ffffffff8013cb08>{ptrace_stop+280}
<ffffffff8013cde6>{get_signal_to_deliver+358}
       <ffffffff8010d4e3>{do_signal+163} <ffffffff8010e905>{error_exit
+0}
       <ffffffff8010de67>{sys_rt_sigreturn+535}
<ffffffff8010dee9>{sys_rt_sigreturn+665}
       <ffffffff8010e2b6>{int_signal+18}

Code: ff 0e 55 48 8b 47 38 48 8b 51 08 48 89 e5 48 89 02 48 89 50
RIP <ffffffff8012e134>{dequeue_task+4} RSP <ffff8100a76350e8>
CR2: 0000000000000000
 <1>Unable to handle kernel NULL pointer dereference at 0000000000000000
RIP:
<ffffffff8012e134>{dequeue_task+4}
PGD c8175067 PUD 31aea067 PMD 0
Oops: 0002 [8]
CPU 0
Pid: 3, comm: events/0 Not tainted 2.6.11.8
RIP: 0010:[<ffffffff8012e134>] <ffffffff8012e134>{dequeue_task+4}
RSP: 0018:ffff8100a7634e48  EFLAGS: 00010002
RAX: 0000000000000020 RBX: ffff8100fbf906e0 RCX: ffff8100fbf90718
RDX: ffff8100fbf906e0 RSI: 0000000000000000 RDI: ffff8100fbf906e0
RBP: ffff8100a7634e58 R08: 0000000000000000 R09: 000000000000000d
R10: 0000000000000000 R11: 0000000000000000 R12: ffff8100fbf906e0
R13: ffff8100fbf91440 R14: 0000035daaa6e647 R15: ffff8100fbf90950
FS:  000000006025212c(0000) GS:ffffffff80785a00(0000)
knlGS:0000000000000d7e
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 0000000000000000 CR3: 00000000a2321000 CR4: 00000000000006e0
Process events/0 (pid: 3, threadinfo ffff8100fbfaa000, task
ffff8100fbf906e0)
Stack: ffffffff8012e3d4 ffff8100a7634ee8 ffff8100a7634ec8
ffffffff804f4ae2
       0000000000000000 ffff8100fbf906e0 00000000069f6bc7
ffffffff801301bf
       00000000ffffff01 0000000000000000
Call Trace:<ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8013218c>{__call_console_drivers+76}
<ffffffff8032d55b>{scrup+123}
       <ffffffff8010e905>{error_exit+0} <ffffffff8012e134>{dequeue_task
+4}
       <ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8013218c>{__call_console_drivers+76}
<ffffffff8032d55b>{scrup+123}
       <ffffffff8010e905>{error_exit+0} <ffffffff8012e134>{dequeue_task
+4}
       <ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8013218c>{__call_console_drivers+76}
<ffffffff8032d55b>{scrup+123}
       <ffffffff8010e905>{error_exit+0} <ffffffff8012e134>{dequeue_task
+4}
       <ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8013218c>{__call_console_drivers+76}
<ffffffff8032d55b>{scrup+123}
       <ffffffff8010e905>{error_exit+0} <ffffffff8012e134>{dequeue_task
+4}
       <ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8010e905>{error_exit+0} <ffffffff804f4c99>{schedule+873}
       <ffffffff804f4ae2>{schedule+434} <ffffffff801347a7>{do_exit+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8010f6e5>{die+69}
       <ffffffff801100fe>{do_general_protection+270}
<ffffffff8010e905>{error_exit+0}
       <ffffffff8010ca47>{__switch_to+311}
<ffffffff804f4de4>{thread_return+0}
       <ffffffff8013cb08>{ptrace_stop+280}
<ffffffff8013cde6>{get_signal_to_deliver+358}
       <ffffffff8010d4e3>{do_signal+163} <ffffffff8010e905>{error_exit
+0}
       <ffffffff8010de67>{sys_rt_sigreturn+535}
<ffffffff8010dee9>{sys_rt_sigreturn+665}
       <ffffffff8010e2b6>{int_signal+18}

Code: ff 0e 55 48 8b 47 38 48 8b 51 08 48 89 e5 48 89 02 48 89 50
RIP <ffffffff8012e134>{dequeue_task+4} RSP <ffff8100a7634e48>
CR2: 0000000000000000
 <1>Unable to handle kernel NULL pointer dereference at 0000000000000000
RIP:
<ffffffff8012e134>{dequeue_task+4}
PGD c8175067 PUD 31aea067 PMD 0
Oops: 0002 [9]
CPU 0
Pid: 3, comm: events/0 Not tainted 2.6.11.8
RIP: 0010:[<ffffffff8012e134>] <ffffffff8012e134>{dequeue_task+4}
RSP: 0018:ffff8100a7634ba8  EFLAGS: 00010002
RAX: 0000000000000020 RBX: ffff8100fbf906e0 RCX: ffff8100fbf90718
RDX: ffff8100fbf906e0 RSI: 0000000000000000 RDI: ffff8100fbf906e0
RBP: ffff8100a7634bb8 R08: 0000000000000000 R09: 000000000000000d
R10: 0000000000000000 R11: 0000000000000000 R12: ffff8100fbf906e0
R13: ffff8100fbf91440 R14: 0000035eaf57d638 R15: ffff8100fbf90950
FS:  000000006025212c(0000) GS:ffffffff80785a00(0000)
knlGS:0000000000000d7e
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 0000000000000000 CR3: 00000000a2321000 CR4: 00000000000006e0
Process events/0 (pid: 3, threadinfo ffff8100fbfaa000, task
ffff8100fbf906e0)
Stack: ffffffff8012e3d4 ffff8100a7634c48 ffff8100a7634c28
ffffffff804f4ae2
       0000000000000000 ffff8100fbf906e0 00000000069f6bc7
ffffffff801301bf
       00000000ffffff01 0000000000000000
Call Trace:<ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8013218c>{__call_console_drivers+76}
<ffffffff8032d55b>{scrup+123}
       <ffffffff8010e905>{error_exit+0} <ffffffff8012e134>{dequeue_task
+4}
       <ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8013218c>{__call_console_drivers+76}
<ffffffff8032d55b>{scrup+123}
       <ffffffff8010e905>{error_exit+0} <ffffffff8012e134>{dequeue_task
+4}
       <ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8013218c>{__call_console_drivers+76}
<ffffffff8032d55b>{scrup+123}
       <ffffffff8010e905>{error_exit+0} <ffffffff8012e134>{dequeue_task
+4}
       <ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8013218c>{__call_console_drivers+76}
<ffffffff8032d55b>{scrup+123}
       <ffffffff8010e905>{error_exit+0} <ffffffff8012e134>{dequeue_task
+4}
       <ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8013218c>{__call_console_drivers+76}
<ffffffff8032d55b>{scrup+123}
       <ffffffff8010e905>{error_exit+0} <ffffffff8012e134>{dequeue_task
+4}
       <ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8010e905>{error_exit+0} <ffffffff804f4c99>{schedule+873}
       <ffffffff804f4ae2>{schedule+434} <ffffffff801347a7>{do_exit+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8010f6e5>{die+69}
       <ffffffff801100fe>{do_general_protection+270}
<ffffffff8010e905>{error_exit+0}
       <ffffffff8010ca47>{__switch_to+311}
<ffffffff804f4de4>{thread_return+0}
       <ffffffff8013cb08>{ptrace_stop+280}
<ffffffff8013cde6>{get_signal_to_deliver+358}
       <ffffffff8010d4e3>{do_signal+163} <ffffffff8010e905>{error_exit
+0}
       <ffffffff8010de67>{sys_rt_sigreturn+535}
<ffffffff8010dee9>{sys_rt_sigreturn+665}
       <ffffffff8010e2b6>{int_signal+18}

Code: ff 0e 55 48 8b 47 38 48 8b 51 08 48 89 e5 48 89 02 48 89 50
RIP <ffffffff8012e134>{dequeue_task+4} RSP <ffff8100a7634ba8>
CR2: 0000000000000000
 <1>Unable to handle kernel NULL pointer dereference at 0000000000000000
RIP:
<ffffffff8012e134>{dequeue_task+4}
PGD c8175067 PUD 31aea067 PMD 0
Oops: 0002 [10]
CPU 0
Pid: 3, comm: events/0 Not tainted 2.6.11.8
RIP: 0010:[<ffffffff8012e134>] <ffffffff8012e134>{dequeue_task+4}
RSP: 0018:ffff8100a7634908  EFLAGS: 00010002
RAX: 0000000000000020 RBX: ffff8100fbf906e0 RCX: ffff8100fbf90718
RDX: ffff8100fbf906e0 RSI: 0000000000000000 RDI: ffff8100fbf906e0
RBP: ffff8100a7634918 R08: 0000000000000000 R09: 000000000000000d
R10: 0000000000000000 R11: 0000000000000000 R12: ffff8100fbf906e0
R13: ffff8100fbf91440 R14: 0000035fcf41c4f5 R15: ffff8100fbf90950
FS:  000000006025212c(0000) GS:ffffffff80785a00(0000)
knlGS:0000000000000d7e
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 0000000000000000 CR3: 00000000a2321000 CR4: 00000000000006e0
Process events/0 (pid: 3, threadinfo ffff8100fbfaa000, task
ffff8100fbf906e0)
Stack: ffffffff8012e3d4 ffff8100a76349a8 ffff8100a7634988
ffffffff804f4ae2
       0000000000000000 ffff8100fbf906e0 00000000069f6bc7
ffffffff801301bf
       00000000ffffff01 0000000000000000
Call Trace:<ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8013218c>{__call_console_drivers+76}
<ffffffff8032d55b>{scrup+123}
       <ffffffff8010e905>{error_exit+0} <ffffffff8012e134>{dequeue_task
+4}
       <ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8013218c>{__call_console_drivers+76}
<ffffffff8032d55b>{scrup+123}
       <ffffffff8010e905>{error_exit+0} <ffffffff8012e134>{dequeue_task
+4}
       <ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8013218c>{__call_console_drivers+76}
<ffffffff8032d55b>{scrup+123}
       <ffffffff8010e905>{error_exit+0} <ffffffff8012e134>{dequeue_task
+4}
       <ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8013218c>{__call_console_drivers+76}
<ffffffff8032d55b>{scrup+123}
       <ffffffff8010e905>{error_exit+0} <ffffffff8012e134>{dequeue_task
+4}
       <ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8013218c>{__call_console_drivers+76}
<ffffffff8032d55b>{scrup+123}
       <ffffffff8010e905>{error_exit+0} <ffffffff8012e134>{dequeue_task
+4}
       <ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8013218c>{__call_console_drivers+76}
<ffffffff8032d55b>{scrup+123}
       <ffffffff8010e905>{error_exit+0} <ffffffff8012e134>{dequeue_task
+4}
       <ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8010e905>{error_exit+0} <ffffffff804f4c99>{schedule+873}
       <ffffffff804f4ae2>{schedule+434} <ffffffff801347a7>{do_exit+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8010f6e5>{die+69}
       <ffffffff801100fe>{do_general_protection+270}
<ffffffff8010e905>{error_exit+0}
       <ffffffff8010ca47>{__switch_to+311}
<ffffffff804f4de4>{thread_return+0}
       <ffffffff8013cb08>{ptrace_stop+280}
<ffffffff8013cde6>{get_signal_to_deliver+358}
       <ffffffff8010d4e3>{do_signal+163} <ffffffff8010e905>{error_exit
+0}
       <ffffffff8010de67>{sys_rt_sigreturn+535}
<ffffffff8010dee9>{sys_rt_sigreturn+665}
       <ffffffff8010e2b6>{int_signal+18}

Code: ff 0e 55 48 8b 47 38 48 8b 51 08 48 89 e5 48 89 02 48 89 50
RIP <ffffffff8012e134>{dequeue_task+4} RSP <ffff8100a7634908>
CR2: 0000000000000000
 <1>Unable to handle kernel NULL pointer dereference at 0000000000000000
RIP:
<ffffffff8012e134>{dequeue_task+4}
PGD c8175067 PUD 31aea067 PMD 0
Oops: 0002 [11]
CPU 0
Pid: 3, comm: events/0 Not tainted 2.6.11.8
RIP: 0010:[<ffffffff8012e134>] <ffffffff8012e134>{dequeue_task+4}
RSP: 0018:ffff8100a7634668  EFLAGS: 00010002
RAX: 0000000000000020 RBX: ffff8100fbf906e0 RCX: ffff8100fbf90718
RDX: ffff8100fbf906e0 RSI: 0000000000000000 RDI: ffff8100fbf906e0
RBP: ffff8100a7634678 R08: 0000000000000000 R09: 000000000000000d
R10: 0000000000000000 R11: 0000000000000000 R12: ffff8100fbf906e0
R13: ffff8100fbf91440 R14: 000003610a89cc6d R15: ffff8100fbf90950
FS:  000000006025212c(0000) GS:ffffffff80785a00(0000)
knlGS:0000000000000d7e
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 0000000000000000 CR3: 00000000a2321000 CR4: 00000000000006e0
Process events/0 (pid: 3, threadinfo ffff8100fbfaa000, task
ffff8100fbf906e0)
Stack: ffffffff8012e3d4 ffff8100a7634708 ffff8100a76346e8
ffffffff804f4ae2
       0000000000000000 ffff8100fbf906e0 00000000069f6bc7
ffffffff801301bf
       00000000ffffff01 0000000000000000
Call Trace:<ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8013218c>{__call_console_drivers+76}
<ffffffff8032d55b>{scrup+123}
       <ffffffff8010e905>{error_exit+0} <ffffffff8012e134>{dequeue_task
+4}
       <ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8013218c>{__call_console_drivers+76}
<ffffffff8032d55b>{scrup+123}
       <ffffffff8010e905>{error_exit+0} <ffffffff8012e134>{dequeue_task
+4}
       <ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8013218c>{__call_console_drivers+76}
<ffffffff8032d55b>{scrup+123}
       <ffffffff8010e905>{error_exit+0} <ffffffff8012e134>{dequeue_task
+4}
       <ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8013218c>{__call_console_drivers+76}
<ffffffff8032d55b>{scrup+123}
       <ffffffff8010e905>{error_exit+0} <ffffffff8012e134>{dequeue_task
+4}
       <ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8013218c>{__call_console_drivers+76}
<ffffffff8032d55b>{scrup+123}
       <ffffffff8010e905>{error_exit+0} <ffffffff8012e134>{dequeue_task
+4}
       <ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8013218c>{__call_console_drivers+76}
<ffffffff8032d55b>{scrup+123}
       <ffffffff8010e905>{error_exit+0} <ffffffff8012e134>{dequeue_task
+4}
       <ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8013218c>{__call_console_drivers+76}
<ffffffff8032d55b>{scrup+123}
       <ffffffff8010e905>{error_exit+0} <ffffffff8012e134>{dequeue_task
+4}
       <ffffffff8012e3d4>{deactivate_task+20}
<ffffffff804f4ae2>{schedule+434}
       <ffffffff801301bf>{mm_release+47} <ffffffff801347a7>{do_exit
+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8011f4cf>{do_page_fault
+1855}
       <ffffffff8010e905>{error_exit+0} <ffffffff804f4c99>{schedule+873}
       <ffffffff804f4ae2>{schedule+434} <ffffffff801347a7>{do_exit+2535}
       <ffffffff8010f5ac>{oops_end+28} <ffffffff8010f6e5>{die+69}
       <ffffffff801100fe>{do_general_protection+270}
<ffffffff8010e905>{error_exit+0}
       <ffffffff8010ca47>{__switch_to+311}
<ffffffff804f4de4>{thread_return+0}
       <ff


^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops
  2005-05-07 16:31     ` 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops Antoine Martin
@ 2005-05-07 15:57       ` Alexander Nyberg
  2005-05-07 18:03         ` Jeff Dike
  2005-05-07 18:06         ` Antoine Martin
  2005-05-08 14:12       ` Andi Kleen
  1 sibling, 2 replies; 26+ messages in thread
From: Alexander Nyberg @ 2005-05-07 15:57 UTC (permalink / raw)
  To: Antoine Martin; +Cc: linux-kernel

> I can crash a host repeatedly just by running a UML instance
> (and causing some load on that instance - like running gcc)
> I captured these messages from the serial console after it crashed:
> 
> Host: Linux x86_64 2.6.11.8 (built with selinux but not enforcing)
> Guest: 2.6.12-rc3-uml
> The UML patch applied on top of rc3 is available here:
> http://213.228.237.37/uml/2.6.12-rc3/uml-x86_64-2.6.12-rc3.patch.bz2
> It was built using the instructions from Jeff Dike at:
> http://user-mode-linux.sourceforge.net/patches.html
> The thread 'kernel-4' is the UML instance, running as a nobody user.
> This was reported to the UML ML and Jeff Dike who advised me to
> post it here too.
> Let me know if you need anything else / testing / etc.
> I am not subscribed to LKML, so please CC me on all emails.


I never get uml to compile around here, 2.6.12-rc3 + that patchkit from
the link you sent blows up with defconfig any my minimal config. Please
attach your guest .config and if you can you might aswell put your guest
vmlinux somewhere where i can download it too.

> general protection fault: 0000 [1]
> CPU 0
> Pid: 26926, comm: kernel-4 Not tainted 2.6.11.8
> RIP: 0010:[<ffffffff8010ca47>] <ffffffff8010ca47>{__switch_to+311}
> RSP: 0018:ffff8100a7635d48  EFLAGS: 00010016
> RAX: 0000c8e816000002 RBX: ffff8100b895f320 RCX: 00000000c0000102
> RDX: 000000000000c8e8 RSI: 0000000000000000 RDI: 0000000000000000
> RBP: 0000000000000000 R08: ffff810090db3a00 R09: 0000000000006933
> R10: 0000000000000000 R11: 0000000000000202 R12: ffff8100a827b890
> R13: ffff8100b895f010 R14: ffff8100a827b580 R15: ffff8100a827b7f8
> FS:  000000006025212c(0000) GS:ffffffff80785a00(0000)
> knlGS:0000000000000d7e
> CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
> CR2: 000000006880d010 CR3: 00000000a2321000 CR4: 00000000000006e0
> Process kernel-4 (pid: 26926, threadinfo ffff810060884000, task
> ffff8100a827b580)
> Stack: ffff8100dc37e180 ffff8100b895f010 ffffffff806b6d50
> ffff8100b895f010
>        000003595b049ed6 ffffffff804f4de4 ffff8100a7635de8
> 0000000000000086
>        0000007500000020 ffff8100b895f010
> Call Trace:<ffffffff804f4de4>{thread_return+0}
> <ffffffff8013cb08>{ptrace_stop+280}
>        <ffffffff8013cde6>{get_signal_to_deliver+358}
> <ffffffff8010d4e3>{do_signal+163}
>        <ffffffff8010e905>{error_exit+0}
> <ffffffff8010de67>{sys_rt_sigreturn+535}
>        <ffffffff8010dee9>{sys_rt_sigreturn+665}
> <ffffffff8010e2b6>{int_signal+18}
> 



^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops
  2005-05-07 15:57       ` Alexander Nyberg
@ 2005-05-07 18:03         ` Jeff Dike
  2005-05-08  0:18           ` Al Viro
       [not found]           ` <1115573839.10373.42.camel@cobra>
  2005-05-07 18:06         ` Antoine Martin
  1 sibling, 2 replies; 26+ messages in thread
From: Jeff Dike @ 2005-05-07 18:03 UTC (permalink / raw)
  To: Alexander Nyberg; +Cc: Antoine Martin, linux-kernel

On Sat, May 07, 2005 at 05:57:48PM +0200, Alexander Nyberg wrote:
> I never get uml to compile around here, 2.6.12-rc3 + that patchkit from
> the link you sent blows up with defconfig any my minimal config. Please
> attach your guest .config and if you can you might aswell put your guest
> vmlinux somewhere where i can download it too.

Start with -rc3, and all the patches from
	http://user-mode-linux.sf.net/patches.html
up to and including skas0.  You'll see a note to x86_64 users on that patch.

				Jeff

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops
  2005-05-07 18:03         ` Jeff Dike
@ 2005-05-08  0:18           ` Al Viro
  2005-05-08  6:10             ` Al Viro
  2005-05-08 16:28             ` Jeff Dike
       [not found]           ` <1115573839.10373.42.camel@cobra>
  1 sibling, 2 replies; 26+ messages in thread
From: Al Viro @ 2005-05-08  0:18 UTC (permalink / raw)
  To: Jeff Dike; +Cc: Alexander Nyberg, Antoine Martin, linux-kernel

On Sat, May 07, 2005 at 02:03:56PM -0400, Jeff Dike wrote:
> On Sat, May 07, 2005 at 05:57:48PM +0200, Alexander Nyberg wrote:
> > I never get uml to compile around here, 2.6.12-rc3 + that patchkit from
> > the link you sent blows up with defconfig any my minimal config. Please
> > attach your guest .config and if you can you might aswell put your guest
> > vmlinux somewhere where i can download it too.
> 
> Start with -rc3, and all the patches from
> 	http://user-mode-linux.sf.net/patches.html
> up to and including skas0.  You'll see a note to x86_64 users on that patch.

Hrm...
	a) stub.S handling breaks on O= builds.   Actually, your unprofile
breaks there - it's bypassing the machinery that deals with include path.
	b) stub_segv.c on amd64 includes <signal.h>.  Not a good idea...
	c) sysdep-x86_64/checksum.h in -rc4 has csum_partial_copy_from_user()
that needs updating (AFAICS, you have that in your patchset, but it hadn't
reached Linus)
	d) ip_compute_csum() prototype is missing (same file)
	e) #define UPT_SYSCALL_RET(r) UPT_RAX(r) is needed in amd64 ptrace.h
	f) take a good look at UPT_SET() in the same file ;-)
	g) CFLAGS_csum-partial.o := -Dcsum_partial=arch_csum_partial in
sys-x86_64/Makefile needs to be removed.
	h) Makefile.rules should be included _after_ SYMLINKS = in the same
file.
	i) sys-x86_64/delay.c needs exports of __udelay() and __const_udelay(),
include of linux/module.h and barriers in your delay loop bodies (or games
with volatile - anything that would guarantee that gcc won't decide to optimize
the entire loop away).  The last part applies to i386 as well.
	j) ip_compute_csum should be exported on amd64.
	k) sys-x86_64/syscalls.c needs include "kern.h"
	l) elf-i386.h should include <asm/user.h>, not "user.h"
	m) elf-x86_64.h lacks R_X86_64_... definitions
	n) WTF _is_ that #ifdef TIF_IA32 in there?  Aside of the trailing \,
we could as well put #error there - free-floating clear_thread_flag(TIF_IA32);
outside of any function body will have that effect anyway.
	o) in drivers/chan_kern.c we have several printf(KERN_ERR "...");
these should become printk, as they clearly had been intended.  As it is,
they give instant panic if we ever call them.
	p) TOP_ADDR in Kconfig_x86_64 got lost in transmission - your patchset
has it, but same patch in Linus' tree does not.

I've got patches for everything except (a); that one is really nasty.  I hope
to sort it out by tonight; if not, I'll just send what I've got by now.

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops
  2005-05-08  0:18           ` Al Viro
@ 2005-05-08  6:10             ` Al Viro
  2005-05-09 21:07               ` Al Viro
  2005-05-08 16:28             ` Jeff Dike
  1 sibling, 1 reply; 26+ messages in thread
From: Al Viro @ 2005-05-08  6:10 UTC (permalink / raw)
  To: Jeff Dike; +Cc: Alexander Nyberg, Antoine Martin, linux-kernel

On Sun, May 08, 2005 at 01:18:32AM +0100, Al Viro wrote:
> 	a) stub.S handling breaks on O= builds.   Actually, your unprofile
> breaks there - it's bypassing the machinery that deals with include path.

Solved.

> 	p) TOP_ADDR in Kconfig_x86_64 got lost in transmission - your patchset
> has it, but same patch in Linus' tree does not.

	q) skas/mmu.c is calling pte_alloc_map() without ->page_table_lock.
Trivially fixed, needed if you want spinlock debugging to produce something
useful.
	r) when built static, kernel dies ugly death with
#0  0x00000000601e4178 in ptmalloc_init () at swab.h:134
#1  0x00000000601e4034 in malloc_hook_ini () at swab.h:134
#2  0x00000000601e1698 in malloc () at swab.h:134
#3  0x00000000602068ee in _dl_init_paths () at swab.h:134
#4  0x00000000601eba45 in _dl_non_dynamic_init () at swab.h:134
#5  0x00000000601ebc60 in __libc_init_first () at swab.h:134
#6  0x00000000601cfa4f in __libc_start_main () at swab.h:134
#7  0x000000006001202a in _start () at proc_fs.h:183
as stack trace.  Buggered offsets in uml.lds, perhaps?

Dynamically built it works; for i386 the same tree works both with both
static and dynamic.  It _might_ be libc difference, in theory (i386 libc
is 2.3.2.ds1-18, amd64 - 2.3.2.ds1-21, both from sarge), but I wouldn't
bet on it.  Anyway, I'm going down right now; carving the fixes into
sane patch series + experimenting with static/amd64 breakage will have
to wait until the morning...

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops
  2005-05-08  6:10             ` Al Viro
@ 2005-05-09 21:07               ` Al Viro
  2005-05-10  2:26                 ` Al Viro
  0 siblings, 1 reply; 26+ messages in thread
From: Al Viro @ 2005-05-09 21:07 UTC (permalink / raw)
  To: Jeff Dike; +Cc: Alexander Nyberg, Antoine Martin, linux-kernel

On Sun, May 08, 2005 at 07:10:44AM +0100, Al Viro wrote:
> 	r) when built static, kernel dies ugly death with
> #0  0x00000000601e4178 in ptmalloc_init () at swab.h:134
> #1  0x00000000601e4034 in malloc_hook_ini () at swab.h:134
> #2  0x00000000601e1698 in malloc () at swab.h:134
> #3  0x00000000602068ee in _dl_init_paths () at swab.h:134
> #4  0x00000000601eba45 in _dl_non_dynamic_init () at swab.h:134
> #5  0x00000000601ebc60 in __libc_init_first () at swab.h:134
> #6  0x00000000601cfa4f in __libc_start_main () at swab.h:134
> #7  0x000000006001202a in _start () at proc_fs.h:183
> as stack trace.  Buggered offsets in uml.lds, perhaps?

Apparently solved by adding .tdata and .tbss to uml.lds.S.  That change does
not give any visible regression on i386.

	s) i386 TT-only won't compile, due to mispaced include in
sysdep/ptrace.h (under ifdef for skas).  Trivial fix apparently gets it
to work correctly.  Which is surprising - when running that sucker on
amd64 we get zero from
        *host_size_out = ROUND_4M((unsigned long) &arg);
Of course, the real size rounded up to 4M is 4Gb there - 32bit tasks do not
have to share the lower 4Gb of address space with the kernel.  Looks like
we survive, though - it boots and apparently works both on i386 and (as
32bit process) on amd64.
	t) amd64 TT-only builds just fine, but gets buggered due to
mismatch between CONFIG_TOP_ADDR and start address - we get *both* set
to 0x60000000, which is obviously b0rken and doesn't match the old
code, while we are at it.  We want TOP_ADDR at 0x80000000 to match start
address.
	u) amd64 TT is _still_ buggered due to unmap_fin.o attempts at
magic.  errno sits in TLS for amd64, so unmap_fin.o gets very interesting
stuff leaking from libc and messing the link.  IMO that should be dealt
with by brute force; namely, unmap-$(SUBARCH).S instead of trying to
play games with pulling stuff from libc.a.  For fsck sake, we are just
making 3 syscalls there and switcheroo() is as low-level as it gets...
Will post once that's done...

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops
  2005-05-09 21:07               ` Al Viro
@ 2005-05-10  2:26                 ` Al Viro
  2005-05-10  3:50                   ` Jeff Dike
  0 siblings, 1 reply; 26+ messages in thread
From: Al Viro @ 2005-05-10  2:26 UTC (permalink / raw)
  To: Jeff Dike; +Cc: Alexander Nyberg, Antoine Martin, linux-kernel

On Mon, May 09, 2005 at 10:07:53PM +0100, Al Viro wrote:
> 	u) amd64 TT is _still_ buggered due to unmap_fin.o attempts at
> magic.  errno sits in TLS for amd64, so unmap_fin.o gets very interesting
> stuff leaking from libc and messing the link.  IMO that should be dealt
> with by brute force; namely, unmap-$(SUBARCH).S instead of trying to
> play games with pulling stuff from libc.a.  For fsck sake, we are just
> making 3 syscalls there and switcheroo() is as low-level as it gets...
> Will post once that's done...
	OK, actually - C with use of _syscall(); still, per-architecture
due to different calling conventions (mmap() has enough arguments to
trigger irregularities).  That deals with errno / __libc_errno getting
screwed, but there's more...

	v) phys_mappings rbtree gets screwed in fixrange_init() - no
surprise, seeing what it does in
        for ( ; (i < PTRS_PER_PGD) && (vaddr < end); pgd++, i++) {
                pmd = (pmd_t *)pgd;
                for (; (j < PTRS_PER_PMD) && (vaddr != end); pmd++, j++) {
Note that here PTR_PER_PGD and PTRS_PER_PMD are both 512.  Fun...  Liberal
stealing from arch/i386/mm/init.c deals with that one, AFAICS.  Now we have
the following:
	uml/i386 - all variants work
	uml/amd64 TT-only - panics in execve() on /sbin/init (hey, a progress)
	uml/amd64 other variants - work
Now to figure out WTF is happening in that execve()...

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops
  2005-05-10  2:26                 ` Al Viro
@ 2005-05-10  3:50                   ` Jeff Dike
  2005-05-10 10:02                     ` Al Viro
  0 siblings, 1 reply; 26+ messages in thread
From: Jeff Dike @ 2005-05-10  3:50 UTC (permalink / raw)
  To: Al Viro; +Cc: Alexander Nyberg, Antoine Martin, linux-kernel

On Tue, May 10, 2005 at 03:26:31AM +0100, Al Viro wrote:
> Now we have
> the following:
> 	uml/i386 - all variants work
> 	uml/amd64 TT-only - panics in execve() on /sbin/init (hey, a progress)
> 	uml/amd64 other variants - work

Nice, send patches when you get a chance?

				Jeff

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops
  2005-05-10  3:50                   ` Jeff Dike
@ 2005-05-10 10:02                     ` Al Viro
  0 siblings, 0 replies; 26+ messages in thread
From: Al Viro @ 2005-05-10 10:02 UTC (permalink / raw)
  To: Jeff Dike; +Cc: Alexander Nyberg, Antoine Martin, linux-kernel

On Mon, May 09, 2005 at 11:50:52PM -0400, Jeff Dike wrote:
> On Tue, May 10, 2005 at 03:26:31AM +0100, Al Viro wrote:
> > Now we have
> > the following:
> > 	uml/i386 - all variants work
> > 	uml/amd64 TT-only - panics in execve() on /sbin/init (hey, a progress)
> > 	uml/amd64 other variants - work
> 
> Nice, send patches when you get a chance?

ftp.linux.org.uk/pub/people/viro/UM*-RC12-rc4.  The first one is your skas0
rediffed to -rc4, the rest - fixes (matching the bug list upthread; the
last two are actually older than the rest, missed them in the original list
of bugs; one is obvious missing include, another is dereferencing userland
pointer instead of dealing with copied value + missing chunk in amd64/tt
sc_copy_...).

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops
  2005-05-08  0:18           ` Al Viro
  2005-05-08  6:10             ` Al Viro
@ 2005-05-08 16:28             ` Jeff Dike
  1 sibling, 0 replies; 26+ messages in thread
From: Jeff Dike @ 2005-05-08 16:28 UTC (permalink / raw)
  To: Al Viro; +Cc: Alexander Nyberg, Antoine Martin, linux-kernel

On Sun, May 08, 2005 at 01:18:32AM +0100, Al Viro wrote:
> Hrm...

I had a lot of these fixed already.  These will be mostly in the fixlets
patch.

> 	a) stub.S handling breaks on O= builds.   Actually, your unprofile
> breaks there - it's bypassing the machinery that deals with include path.

This?
	$(patsubst -pg,,$(patsubst -fprofile-arcs -ftest-coverage,,$(1)))

I don't see any connection to include paths there.


> 	b) stub_segv.c on amd64 includes <signal.h>.  Not a good idea...

x86_64 doesn't, but i386 does.  Fixed now.

> 	c) sysdep-x86_64/checksum.h in -rc4 has csum_partial_copy_from_user()
> that needs updating (AFAICS, you have that in your patchset, but it hadn't
> reached Linus)

Fixed.

> 	d) ip_compute_csum() prototype is missing (same file)
> 	j) ip_compute_csum should be exported on amd64.

OK, I need to look at this a bit more.

> 	e) #define UPT_SYSCALL_RET(r) UPT_RAX(r) is needed in amd64 ptrace.h

Fixed.

> 	f) take a good look at UPT_SET() in the same file ;-)

Sigh.  Fixed now.

> 	g) CFLAGS_csum-partial.o := -Dcsum_partial=arch_csum_partial in
> sys-x86_64/Makefile needs to be removed.

Fixed.

> 	h) Makefile.rules should be included _after_ SYMLINKS = in the same
> file.

Fixed.

> 	i) sys-x86_64/delay.c needs exports of __udelay() and __const_udelay(),
> include of linux/module.h and barriers in your delay loop bodies (or games
> with volatile - anything that would guarantee that gcc won't decide to optimize
> the entire loop away).  The last part applies to i386 as well.

Looks to me like it all applies to i386 too, except that __delay looks 
unoptimizable.

It also looks to me like I could implement __udelay as n=... ; __delay(n);

And also never mind the fact that __udelay and __const_udelay are identical.

> 	k) sys-x86_64/syscalls.c needs include "kern.h"

Fixed now.

> 	l) elf-i386.h should include <asm/user.h>, not "user.h"

Fixed now.

> 	m) elf-x86_64.h lacks R_X86_64_... definitions

Fixed now.

> 	n) WTF _is_ that #ifdef TIF_IA32 in there?  Aside of the trailing \,
> we could as well put #error there - free-floating clear_thread_flag(TIF_IA32);> outside of any function body will have that effect anyway.

The trailing \ aside, which is fixed, that's a reminder for me when I add
the 32-bit compatibility code.

> 	o) in drivers/chan_kern.c we have several printf(KERN_ERR "...");
> these should become printk, as they clearly had been intended.  As it is,
> they give instant panic if we ever call them.

Oops.  This requires a bit of thought.  Offhand, I think they need to be 
printf, because that early in boot, printk'd stuff may not reach the screen
if UML exits then.

> 	p) TOP_ADDR in Kconfig_x86_64 got lost in transmission - your patchset
> has it, but same patch in Linus' tree does not.

Fixed

> 
> I've got patches for everything except (a); that one is really nasty.  I hope
> to sort it out by tonight; if not, I'll just send what I've got by now.

OK, send me what you have, and if we've fixed the same thing differently,
I choose one or the other.

				Jeff

^ permalink raw reply	[flat|nested] 26+ messages in thread

[parent not found: <1115573839.10373.42.camel@cobra>]

[parent not found: <20050508183533.GA27251@ccure.user-mode-linux.org>]

[parent not found: <1115590823.10373.68.camel@cobra>]

[parent not found: <1115600253.10373.74.camel@cobra>]

[parent not found: <1117307236.10041.4.camel@localhost>]

* [uml-devel] Re: Problems applying patches to 2.6.12-rc5-mm1
       [not found]                   ` <1117307236.10041.4.camel@localhost>
@ 2005-06-02 15:52                     ` antoine
       [not found]                     ` <1117691482.10122.107.camel@localhost>
  1 sibling, 0 replies; 26+ messages in thread
From: antoine @ 2005-06-02 15:52 UTC (permalink / raw)
  To: user-mode-linux-devel

Note: this is an attempt at running it on x86_64 host, so the errors
may or may not be the same...

I applied the patches on the website to 2.6.12-rc5 (mm1 and mm2: most
patches apply cleanly to both, some have been merged in mm2 obviously),
I got an error here:

gcc   -Wall -Wstrict-prototypes -Wno-trigraphs -fno-strict-aliasing
-fno-common -ffreestanding -O2 -fno-omit-frame-pointer
-fno-optimize-sibling-calls -g -U__x86_64__ -fno-builtin -D__arch_um__
-DSUBARCH=\"x86_64\" -Iarch/um/include
-I/usr/src/linux-2.6.12-rc5-mm2-uml/arch/um/kernel/tt/include
-D__x86_64__ -D_GNU_SOURCE -D_LARGEFILE64_SOURCE -c -o
arch/um/sys-x86_64/stub_segv.o arch/um/sys-x86_64/stub_segv.c
In file included from /usr/include/asm/ucontext.h:2,
                 from arch/um/sys-x86_64/stub_segv.c:8:
/usr/include/asm/../asm-x86_64/ucontext.h:4: error: redefinition of
`struct ucontext'

I removed the #include "ucontext", and added #include "uml-config.h" to
stub_segv.c. Then it got stuck on:
fixdep: arch/um/sys-x86_64/.stub_segv.o.d: No such file or directory
Re-running the build as suggested by Jeff did not help, but
creating an empty file to make it proceed seems to work...
`touch arch/um/sys-x86_64/.stub_segv.o.d`
(I haven't tried removing the av-skas patch yet)

Then it builds and almost runs:
(...)
[42949373.260000] kjournald starting.  Commit interval 5 seconds
[42949373.260000] EXT3-fs: mounted filesystem with ordered data mode.
[42949373.260000] VFS: Mounted root (ext3 filesystem) readonly.
[42949373.260000] Kernel panic - not syncing: Kernel mode fault at addr
0x0, ip 0x0
[42949373.260000]
[42949373.260000] Pid: 1, comm: bash Not tainted 2.6.12-rc5-mm2
[42949373.260000] RIP: 0000:[<0000000000000000>]
[42949373.260000] RSP: 0000000060453c30  EFLAGS: 00010202
[42949373.260000] RAX: 0000000000000001 RBX: 0000000000000000 RCX:
0000000060194ec1
[42949373.260000] RDX: 000000006002840d RSI: 0000000000000001 RDI:
0000000060453ad0
[42949373.260000] RBP: 0000000000000000 R08: 0000000000000000 R09:
0000000000000000
[42949373.260000] R10: 0000000000000008 R11: 0000000000000246 R12:
0000000000000000
[42949373.260000] R13: 0000000000000001 R14: 000000000000000a R15:
0000000000000000
[42949373.260000] Call Trace:
[42949373.260000] 604536d8:  [<6001575f>] panic_exit+0x2f/0x50
[42949373.260000] 604536f8:  [<60040c2b>] notifier_call_chain+0x2b/0x50
[42949373.260000] 60453728:  [<6002fe24>] panic+0xe4/0x180
[42949373.260000] 60453768:  [<60194ec1>] __sigprocmask+0x11/0x40
[42949373.260000] 60453788:  [<6010e0b3>] __up_read+0xb3/0xf0
[42949373.260000] 604537a8:  [<60014908>] handle_page_fault+0xc8/0x280
[42949373.260000] 604537d8:  [<6002a1ef>] __do_user_copy+0x4f/0xd0
[42949373.260000] 60453818:  [<60014caa>] segv+0x1ea/0x2b0
[42949373.260000] 60453828:  [<60194f85>] sigemptyset+0x15/0x40
[42949373.260000] 60453838:  [<60028d71>] change_sig+0x61/0x80
[42949373.260000] 604538a8:  [<60194f85>] sigemptyset+0x15/0x40
[42949373.260000] 604538b8:  [<60028de1>] change_signals+0x51/0x80
[42949373.260000] 60453928:  [<60015056>] segv_handler+0x106/0x120
[42949373.260000] 60453958:  [<60014f50>] segv_handler+0x0/0x120
[42949373.260000] 60453968:  [<60017fc9>] sig_handler_common_tt
+0x119/0x1b0
[42949373.260000] 604539d8:  [<60028a80>] sig_handler+0x10/0x20
[42949373.260000] 604539e8:  [<60194c40>] __restore_rt+0x0/0x10
[42949373.260000] 60453a78:  [<6002840d>] run_kernel_thread+0x2d/0x50
[42949373.260000] 60453a88:  [<60194ec1>] __sigprocmask+0x11/0x40
[42949373.260000] 60453ae0:  [<6000f1b0>] init+0x0/0x100
[42949373.260000] 60453b08:  [<6002840d>] run_kernel_thread+0x2d/0x50
[42949373.260000] 60453b88:  [<6000f1b0>] init+0x0/0x100
[42949373.260000] 60453b98:  [<60028e2e>] unblock_signals+0xe/0x10
[42949373.260000] 60453ba8:  [<60016456>] new_thread_handler+0x166/0x1a0
[42949373.260000] 60453cd8:  [<60194ec1>] __sigprocmask+0x11/0x40
[42949373.260000]
[42949373.260000]

Now, I am really stuck.
Any ideas?

Cheers
Antoine


On Sat, 2005-05-28 at 20:07 +0100, antoine wrote:
> Hi Jeff,
> 
> Now that the ptrace host bug has been fixed in 2.6.11.11, I can run
> tests on shared systems without taking too many risks, but I am a little
> bit lost as to which patches I would need to apply, between your
> patches, Blaisorblade's, al's,.. Which all seem to contain x86_64
> patches. Let me know what I can do to help testing. I am sure I can find
> some new bugs, in fact I am working on a Java VM bug atm.
> 
> Cheers
> Antoine
> 
> 
> On Mon, 2005-05-09 at 01:57 +0100, Antoine Martin wrote:
> > Crashing it is quite easy in fact, reiserfs as root_fs just won't work.
> > I don't think this trace is going to help, but here it is anyway:
> > 
> > [42949381.990000] REISERFS: panic (device Null superblock):
> > reiserfs[175]: assertion !( comp_keys( &MAX_KEY, p_s_key ) && !
> > key_in_buffer(p_s_search_path, p_s_key, p_s_sb) ) failed at
> > fs/reiserfs/stree.c:685:search_by_key: PAP-5130: key is not in the
> > buffer
> > [42949381.990000] Kernel panic - not syncing: BUG!
> > [42949381.990000]  [42949381.990000]
> > 
> > 
> > On Sun, 2005-05-08 at 23:20 +0100, Antoine Martin wrote:
> > > All good, builds fine and hasn't crashed yet!
> > > I'll get back to you with (hopefully) more useful debugging information
> > > as soon as Andi Kleen comes up with a patch for the host bug.
> > > 
> > > Antoine
> > > 
> > > On Sun, 2005-05-08 at 14:35 -0400, Jeff Dike wrote:
> > > > I uploaded a new set of patches which patches, builds, and runs on both
> > > > i386 and x86_64.
> > > > 
> > > > On Sun, May 08, 2005 at 06:37:19PM +0100, Antoine Martin wrote:
> > > > > I noticed that you updated the website with patches against -rc4 so I
> > > > > built the beast, As before, I had to do this to fix it:
> > > > > cd include/asm
> > > > > ln -sf elf-x86_64.h elf.h
> > > > 
> > > > I get an empty elf.h which obviously causes build problems.  Removing it
> > > > and rebuilding fixes it.  I haven't looked at this yet.
> > > > 
> > > > > I also had a reject on:
> > > > > arch/um/sys-x86_64/Makefile
> > > > 
> > > > This should be OK now.
> > > > 
> > > > 				Jeff



-------------------------------------------------------
This SF.Net email is sponsored by Yahoo.
Introducing Yahoo! Search Developer Network - Create apps using Yahoo!
Search APIs Find out how you can build Yahoo! directly into your own
Applications - visit http://developer.yahoo.net/?fr=offad-ysdn-ostg-q22005
_______________________________________________
User-mode-linux-devel mailing list
User-mode-linux-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/user-mode-linux-devel

^ permalink raw reply	[flat|nested] 26+ messages in thread

[parent not found: <1117691482.10122.107.camel@localhost>]

* [uml-devel] Re: 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops
       [not found]                     ` <1117691482.10122.107.camel@localhost>
@ 2005-06-02 21:06                       ` Jeff Dike
  2005-06-03  2:33                         ` antoine
  0 siblings, 1 reply; 26+ messages in thread
From: Jeff Dike @ 2005-06-02 21:06 UTC (permalink / raw)
  To: antoine; +Cc: user-mode-linux-devel

On Thu, Jun 02, 2005 at 06:51:22AM +0100, antoine wrote:
> I applied the patches on the website to 2.6.12-rc5 (mm1 and mm2: most
> patches apply cleanly to both, some have been merged in mm2 obviously),
> I got an error here:

I pushed out a fixed set of patches (against -mm2) that work for me.

There were some include changes in stub_segv.c.

The .d file thing is still there.  That seems to have been caused by my 
changing stub_segv.c to a userspace file.

				Jeff


-------------------------------------------------------
This SF.Net email is sponsored by Yahoo.
Introducing Yahoo! Search Developer Network - Create apps using Yahoo!
Search APIs Find out how you can build Yahoo! directly into your own
Applications - visit http://developer.yahoo.net/?fr=offad-ysdn-ostg-q22005
_______________________________________________
User-mode-linux-devel mailing list
User-mode-linux-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/user-mode-linux-devel

^ permalink raw reply	[flat|nested] 26+ messages in thread

* [uml-devel] Re: 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops
  2005-06-02 21:06                       ` [uml-devel] Re: 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops Jeff Dike
@ 2005-06-03  2:33                         ` antoine
  2005-06-03  2:38                           ` Jeff Dike
  0 siblings, 1 reply; 26+ messages in thread
From: antoine @ 2005-06-03  2:33 UTC (permalink / raw)
  To: Jeff Dike; +Cc: user-mode-linux-devel

Thanks, almost all applied cleanly - I skipped the s390 ones for now.
Compiling still requires a `touch arch/um/sys-x86_64/.stub_segv.o.d` to
make it proceed. Works for me! (tm)
But I can't get it to boot into init properly, I can only boot with
init=/bin/bash. (then remount root rw, mount proc)

Every time I run a command, I get something like:
bash: child setpgid (15484 to 15484): No such process
(I remember seeing this one before - but I can't remember the
solution :-(

I can bring the network up manually (ifconfig) and ping (in and out),
but running any of the /etc/init.d/ scripts locks up, same goes for
starting sshd or even "Hello World" in hello.sh

Sometimes when it goes into a spin, it loos on this (to infinity):
"[42949450.500000] fix_range_common: failed, killing current process"

Not sure where do go from here... Maybe patching with just the minimum?
It needs roughly at least up to fix-tt-USR1-handlers to be able to boot
(otherwise you get the error I posted yesterday).

Antoine




On Thu, 2005-06-02 at 17:06 -0400, Jeff Dike wrote:
> On Thu, Jun 02, 2005 at 06:51:22AM +0100, antoine wrote:
> > I applied the patches on the website to 2.6.12-rc5 (mm1 and mm2: most
> > patches apply cleanly to both, some have been merged in mm2 obviously),
> > I got an error here:
> 
> I pushed out a fixed set of patches (against -mm2) that work for me.
> 
> There were some include changes in stub_segv.c.
> 
> The .d file thing is still there.  That seems to have been caused by my 
> changing stub_segv.c to a userspace file.
> 
> 				Jeff



-------------------------------------------------------
This SF.Net email is sponsored by Yahoo.
Introducing Yahoo! Search Developer Network - Create apps using Yahoo!
Search APIs Find out how you can build Yahoo! directly into your own
Applications - visit http://developer.yahoo.net/?fr=offad-ysdn-ostg-q22005
_______________________________________________
User-mode-linux-devel mailing list
User-mode-linux-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/user-mode-linux-devel

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: [uml-devel] Re: 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops
  2005-06-03  2:33                         ` antoine
@ 2005-06-03  2:38                           ` Jeff Dike
  2005-06-03 19:28                             ` antoine
  0 siblings, 1 reply; 26+ messages in thread
From: Jeff Dike @ 2005-06-03  2:38 UTC (permalink / raw)
  To: antoine; +Cc: user-mode-linux-devel

On Fri, Jun 03, 2005 at 03:33:36AM +0100, antoine wrote:
> Thanks, almost all applied cleanly - I skipped the s390 ones for now.

I wouldn't cherry-pick patches.  You may be right, but it also means you're
running different code than me.

> I can bring the network up manually (ifconfig) and ping (in and out),
> but running any of the /etc/init.d/ scripts locks up, same goes for
> starting sshd or even "Hello World" in hello.sh

Dunno, it works here, and I've beat on it a bunch.  If you can get stack
traces or anything from the hangs, that might help.

				Jeff


-------------------------------------------------------
This SF.Net email is sponsored by Yahoo.
Introducing Yahoo! Search Developer Network - Create apps using Yahoo!
Search APIs Find out how you can build Yahoo! directly into your own
Applications - visit http://developer.yahoo.net/?fr=offad-ysdn-ostg-q22005
_______________________________________________
User-mode-linux-devel mailing list
User-mode-linux-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/user-mode-linux-devel

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: [uml-devel] Re: 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops
  2005-06-03  2:38                           ` Jeff Dike
@ 2005-06-03 19:28                             ` antoine
  2005-06-03 16:49                               ` Jeff Dike
  0 siblings, 1 reply; 26+ messages in thread
From: antoine @ 2005-06-03 19:28 UTC (permalink / raw)
  To: Jeff Dike; +Cc: user-mode-linux-devel

On Thu, 2005-06-02 at 22:38 -0400, Jeff Dike wrote:
> On Fri, Jun 03, 2005 at 03:33:36AM +0100, antoine wrote:
> > Thanks, almost all applied cleanly - I skipped the s390 ones for now.
> 
> I wouldn't cherry-pick patches.  You may be right, but it also means you're
> running different code than me.
I wouldn't normally do that, but there is a patch that does not apply
cleanly: fp-state (can't find file to patch at input line 28 ...)
So I started thinking that there were things I did not necessarily need
or want (s390, x11fb, audio, etc) - I guess I was wrong...

I started again, applying everything else, some files complained about
missing skas.h or proc_mm.h, another case of empty file, easy fix:
rm ./arch/um/kernel/skas/include/skas.h

I got tired of doing the same steps over and over again (to be sure I
wasn't reporting my own mistakes) so I made this very simple script to
untar kernel source + apply MM + apply UML patches:
http://213.228.237.37/uml/2.6.12-rc5-mm2/make-uml64.sh
I've also dumped the compile errors, kernel binary, .config in the same
directory.

Everything kinda works now! And it is *very* fast (compared to TT x86
guest) And this time it is a bit more stable.
I think there is still a memory leak lurking, OOM seems to kick in (see
bottom of this email)

Very minor thing: I noticed that the TT threads generally do not show
which process they are running inside the UML (as it used to), any way
to restore this?

Thanks
Antoine


> 
> > I can bring the network up manually (ifconfig) and ping (in and out),
> > but running any of the /etc/init.d/ scripts locks up, same goes for
> > starting sshd or even "Hello World" in hello.sh
> 
> Dunno, it works here, and I've beat on it a bunch.  If you can get stack
> traces or anything from the hangs, that might help.
> 
> 				Jeff
*********
* OOM
* 128RAM + 128swap should be enough to emerge things...
*********

[42951416.180000] oom-killer: gfp_mask=0xd2
[42951416.180000] DMA per-cpu:
[42951416.180000] cpu 0 hot: low 30, high 90, batch 15
[42951416.180000] cpu 0 cold: low 0, high 30, batch 15
[42951416.180000] Normal per-cpu: empty
[42951416.180000] HighMem per-cpu: empty
[42951416.180000]
[42951416.180000] Free pages:        1432kB (0kB HighMem)
[42951416.180000] Active:166 inactive:333 dirty:0 writeback:0 unstable:0
free:358 slab:999 mapped:82 pagetables:9547
[42951416.180000] DMA free:1432kB min:1448kB low:1808kB high:2172kB
active:664kB inactive:1332kB present:131072kB pages_scanned:1301
[42951416.180000] lowmem_reserve[]: 0 0 0
[42951416.180000] Normal free:0kB min:0kB low:0kB high:0kB active:0kB
inactive:0kB present:0kB pages_scanned:0
[42951416.180000] lowmem_reserve[]: 0 0 0
[42951416.180000] HighMem free:0kB min:128kB low:160kB high:192kB
active:0kB inactive:0kB present:0kB pages_scanned:0
[42951416.180000] lowmem_reserve[]: 0 0 0
[42951416.180000] DMA: 0*4kB 5*8kB 1*16kB 1*32kB 1*64kB 0*128kB 1*256kB
0*512kB 1*1024kB 0*2048kB 0*4096kB = 1432kB
[42951416.180000] Normal: empty
[42951416.180000] HighMem: empty
[42951416.180000] Swap cache: add 314384, delete 314009, find
48066/104534, race 0+0
[42951416.180000] Free swap  = 112048kB
[42951416.180000] Total swap = 131064kB
[42951416.180000] Out of Memory: Killed process 978 (emerge).
[42951416.180000] oom-killer: gfp_mask=0xd2
[42951416.180000] DMA per-cpu:
[42951416.180000] cpu 0 hot: low 30, high 90, batch 15
[42951416.180000] cpu 0 cold: low 0, high 30, batch 15
[42951416.180000] Normal per-cpu: empty
[42951416.180000] HighMem per-cpu: empty
[42951416.180000]
[42951416.180000] Free pages:        1432kB (0kB HighMem)
[42951416.180000] Active:166 inactive:333 dirty:0 writeback:0 unstable:0
free:358 slab:1000 mapped:82 pagetables:9519
[42951416.180000] DMA free:1432kB min:1448kB low:1808kB high:2172kB
active:664kB inactive:1332kB present:131072kB pages_scanned:1301
[42951416.180000] lowmem_reserve[]: 0 0 0
[42951416.180000] Normal free:0kB min:0kB low:0kB high:0kB active:0kB
inactive:0kB present:0kB pages_scanned:0
[42951416.180000] lowmem_reserve[]: 0 0 0
[42951416.180000] HighMem free:0kB min:128kB low:160kB high:192kB
active:0kB inactive:0kB present:0kB pages_scanned:0
[42951416.180000] lowmem_reserve[]: 0 0 0
[42951416.180000] DMA: 0*4kB 5*8kB 1*16kB 1*32kB 1*64kB 0*128kB 1*256kB
0*512kB 1*1024kB 0*2048kB 0*4096kB = 1432kB
[42951416.180000] Normal: empty
[42951416.180000] HighMem: empty
[42951416.180000] Swap cache: add 314384, delete 314009, find
48066/104535, race 0+0
[42951416.180000] Free swap  = 123236kB
[42951416.180000] Total swap = 131064kB
[42951416.180000] Out of Memory: Killed process 5713 (portageq).




-------------------------------------------------------
This SF.Net email is sponsored by Yahoo.
Introducing Yahoo! Search Developer Network - Create apps using Yahoo!
Search APIs Find out how you can build Yahoo! directly into your own
Applications - visit http://developer.yahoo.net/?fr=offad-ysdn-ostg-q22005
_______________________________________________
User-mode-linux-devel mailing list
User-mode-linux-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/user-mode-linux-devel

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: [uml-devel] Re: 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops
  2005-06-03 19:28                             ` antoine
@ 2005-06-03 16:49                               ` Jeff Dike
  2005-06-03 17:58                                 ` Bodo Stroesser
  0 siblings, 1 reply; 26+ messages in thread
From: Jeff Dike @ 2005-06-03 16:49 UTC (permalink / raw)
  To: antoine; +Cc: user-mode-linux-devel

On Fri, Jun 03, 2005 at 08:28:10PM +0100, antoine wrote:
> I think there is still a memory leak lurking, OOM seems to kick in (see
> bottom of this email)

Yeah, it's a page leak, I think from page tables, which I haven't tracked
down yet.

> Very minor thing: I noticed that the TT threads generally do not show
> which process they are running inside the UML (as it used to), any way
> to restore this?

Those aren't tt threads, they are skas0 threads, and we could probably restore 
the command names, but I haven't bothered doing so.

				Jeff


-------------------------------------------------------
This SF.Net email is sponsored by: NEC IT Guy Games.  How far can you shotput
a projector? How fast can you ride your desk chair down the office luge track?
If you want to score the big prize, get to know the little guy.  
Play to win an NEC 61" plasma display: http://www.necitguy.com/?r=20
_______________________________________________
User-mode-linux-devel mailing list
User-mode-linux-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/user-mode-linux-devel

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: [uml-devel] Re: 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops
  2005-06-03 16:49                               ` Jeff Dike
@ 2005-06-03 17:58                                 ` Bodo Stroesser
  0 siblings, 0 replies; 26+ messages in thread
From: Bodo Stroesser @ 2005-06-03 17:58 UTC (permalink / raw)
  To: Jeff Dike; +Cc: antoine, user-mode-linux-devel

Jeff Dike wrote:
> On Fri, Jun 03, 2005 at 08:28:10PM +0100, antoine wrote:
> 
>>I think there is still a memory leak lurking, OOM seems to kick in (see
>>bottom of this email)
> 
> 
> Yeah, it's a page leak, I think from page tables, which I haven't tracked
> down yet.
> 
> 
>>Very minor thing: I noticed that the TT threads generally do not show
>>which process they are running inside the UML (as it used to), any way
>>to restore this?
> 
> 
> Those aren't tt threads, they are skas0 threads, and we could probably restore 
> the command names, but I haven't bothered doing so.
I thought about doing so, as the output of ps really looks bad.
Unfortunately, I found only very nasty ways.
On older host's, we could try to expand or strip down the environment in a similar
way as is used it tt. Then, names could be placed into stub-(stack?)-page.
Nasty, but it could work.
An even worse case is a host using stack-randomization. We need a pointer in the
host poining to stub-pages. If the host has left a "hole" above the stack, would
we have to "exec" until accidentally we get a stack starting at upper address-range
limit?
		Bodo
> 
> 				Jeff
> 
> 
> -------------------------------------------------------
> This SF.Net email is sponsored by: NEC IT Guy Games.  How far can you shotput
> a projector? How fast can you ride your desk chair down the office luge track?
> If you want to score the big prize, get to know the little guy.  
> Play to win an NEC 61" plasma display: http://www.necitguy.com/?r=20
> _______________________________________________
> User-mode-linux-devel mailing list
> User-mode-linux-devel@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/user-mode-linux-devel


-------------------------------------------------------
This SF.Net email is sponsored by: NEC IT Guy Games.  How far can you shotput
a projector? How fast can you ride your desk chair down the office luge track?
If you want to score the big prize, get to know the little guy.  
Play to win an NEC 61" plasma display: http://www.necitguy.com/?r=20
_______________________________________________
User-mode-linux-devel mailing list
User-mode-linux-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/user-mode-linux-devel

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops
  2005-05-07 15:57       ` Alexander Nyberg
  2005-05-07 18:03         ` Jeff Dike
@ 2005-05-07 18:06         ` Antoine Martin
  1 sibling, 0 replies; 26+ messages in thread
From: Antoine Martin @ 2005-05-07 18:06 UTC (permalink / raw)
  To: Alexander Nyberg; +Cc: linux-kernel

> I never get uml to compile around here, 2.6.12-rc3 + that patchkit from
> the link you sent blows up with defconfig any my minimal config. Please
> attach your guest .config and if you can you might aswell put your guest
> vmlinux somewhere where i can download it too.
ok, here it is:
http://213.228.237.37/uml/2.6.12-rc3/
kernel and config.

Note: you will need a pure 64bit root fs, ie: gentoo 2005 works fine.


^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops
  2005-05-07 16:31     ` 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops Antoine Martin
  2005-05-07 15:57       ` Alexander Nyberg
@ 2005-05-08 14:12       ` Andi Kleen
  2005-05-08 16:35         ` Antoine Martin
  2005-05-08 16:38         ` Jeff Dike
  1 sibling, 2 replies; 26+ messages in thread
From: Andi Kleen @ 2005-05-08 14:12 UTC (permalink / raw)
  To: Antoine Martin; +Cc: linux-kernel, jdike

Antoine Martin <antoine@nagafix.co.uk> writes:
>
> general protection fault: 0000 [1]
> CPU 0
> Pid: 26926, comm: kernel-4 Not tainted 2.6.11.8
> RIP: 0010:[<ffffffff8010ca47>] <ffffffff8010ca47>{__switch_to+311}
> RSP: 0018:ffff8100a7635d48  EFLAGS: 00010016
> RAX: 0000c8e816000002 RBX: ffff8100b895f320 RCX: 00000000c0000102
> RDX: 000000000000c8e8 RSI: 0000000000000000 RDI: 0000000000000000
> RBP: 0000000000000000 R08: ffff810090db3a00 R09: 0000000000006933
> R10: 0000000000000000 R11: 0000000000000202 R12: ffff8100a827b890
> R13: ffff8100b895f010 R14: ffff8100a827b580 R15: ffff8100a827b7f8
> FS:  000000006025212c(0000) GS:ffffffff80785a00(0000)
> knlGS:0000000000000d7e
> CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
> CR2: 000000006880d010 CR3: 00000000a2321000 CR4: 00000000000006e0
> Process kernel-4 (pid: 26926, threadinfo ffff810060884000, task
> ffff8100a827b580)
> Stack: ffff8100dc37e180 ffff8100b895f010 ffffffff806b6d50
> ffff8100b895f010
>        000003595b049ed6 ffffffff804f4de4 ffff8100a7635de8
> 0000000000000086
>        0000007500000020 ffff8100b895f010
> Call Trace:<ffffffff804f4de4>{thread_return+0}
> <ffffffff8013cb08>{ptrace_stop+280}
>        <ffffffff8013cde6>{get_signal_to_deliver+358}
> <ffffffff8010d4e3>{do_signal+163}
>        <ffffffff8010e905>{error_exit+0}
> <ffffffff8010de67>{sys_rt_sigreturn+535}
>        <ffffffff8010dee9>{sys_rt_sigreturn+665}
> <ffffffff8010e2b6>{int_signal+18}
>
>
> Code: 0f 30 66 41 89 6c 24 2e 65 48 8b 04 25 20 00 00 00 49 89 44

That is a wrmsr to 0x00000000c0000102 (KERNEL_GS_BASE), the code 
is trying to write 0x0000c8e816000002 into it. That is a non canonical
address, which causes the GPF.

The strange thing is that the kernel should have rejected it in
the first place. The code to allow user space to set kernel gs 
checks for the address being > TASK_SIZE and TASK_SIZE is 0x800000000000.
It should have rejected it in the first place.

Are you sure you did not apply any strange UML related patches
to the host kernel? Maybe those are buggy.

-Andi


^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops
  2005-05-08 14:12       ` Andi Kleen
@ 2005-05-08 16:35         ` Antoine Martin
  2005-05-08 15:15           ` Andi Kleen
  2005-05-08 16:45           ` Jeff Dike
  2005-05-08 16:38         ` Jeff Dike
  1 sibling, 2 replies; 26+ messages in thread
From: Antoine Martin @ 2005-05-08 16:35 UTC (permalink / raw)
  To: Andi Kleen; +Cc: linux-kernel, jdike

> (..)
> That is a wrmsr to 0x00000000c0000102 (KERNEL_GS_BASE), the code 
> is trying to write 0x0000c8e816000002 into it. That is a non canonical
> address, which causes the GPF.
> 
> The strange thing is that the kernel should have rejected it in
> the first place. The code to allow user space to set kernel gs 
> checks for the address being > TASK_SIZE and TASK_SIZE is 0x800000000000.
> It should have rejected it in the first place.
> 
> Are you sure you did not apply any strange UML related patches
> to the host kernel? Maybe those are buggy.
The only extra patch applied on top of what is on the web page (as per
Jeff's instructions) is the mconsole-exec patch, and AFAIK it wouldn't
affect the code above.

Alexander Nyberg is also experiencing crashes, aren't you?
Just un-compressing portage (20MB .tar.bz2) on the root_fs image posted
earlier caused a different kind of mis-behaviour, the guest lost network
connectivity, cpu usage shot up on the host (load > 10 now), and I found
this in the host log:
kernel: segfault at 00000000df2948d0 rip 00000000601e5beb
rsp00000000df2948d0 error 4
(same kernel as earlier crashes...)
That's on a different box, running a different host kernel (FC3 2.6.9-?)

The really weird thing is that the processes are still running, but ps
-ef shows an empty string in place of the process name:
(and the terminal which launched the instance got control back)
I am now rebuilding a new kernel on another test box, let me know what
to do to provide better debug information.

# ps -ef
UID        PID  PPID  C STIME TTY          TIME CMD
root         1     0  0 13:27 ?        00:00:00 init [3]
root         2     1  0 13:27 ?        00:00:00 [ksoftirqd/0]
root         3     1  0 13:27 ?        00:00:00 [events/0]
root         4     3  0 13:27 ?        00:00:00 [khelper]
root         5     3  0 13:27 ?        00:00:00 [kacpid]
root        27     3  0 13:27 ?        00:00:00 [kblockd/0]
root        28     1  0 13:27 ?        00:00:00 [khubd]
root        40     3  0 13:27 ?        00:00:00 [pdflush]
root        41     3  0 13:27 ?        00:00:00 [pdflush]
root        43     3  0 13:27 ?        00:00:00 [aio/0]
root        42     1  0 13:27 ?        00:00:00 [kswapd0]
root       115     1  0 13:27 ?        00:00:00 [kseriod]
root       183     3  0 13:27 ?        00:00:00 [ata/0]
root       185     1  0 13:27 ?        00:00:00 [scsi_eh_0]
root       186     1  0 13:27 ?        00:00:00 [scsi_eh_1]
root       202     1  0 13:27 ?        00:00:00 [kjournald]
root      1025     1  0 13:27 ?        00:00:00 udevd
root      1261     1  0 13:27 ?        00:00:00 [khpsbpkt]
root      1290     1  0 13:27 ?        00:00:00 [knodemgrd_0]
root      1453     1  0 13:27 ?        00:00:00 [kjournald]
root      1454     1  0 13:27 ?        00:00:00 [kjournald]
root      1693     1  0 13:28 ?        00:00:00 syslogd -m 0
root      1697     1  0 13:28 ?        00:00:00 klogd -x
nobody    1737     1  0 13:28 ?        00:00:00 mDNSResponder
root      1755     1  0 13:28 ?        00:00:00 /usr/sbin/acpid
root      1770     1  0 13:28 ?        00:00:00 /usr/sbin/sshd
root      1779     1  0 13:28 ?        00:00:00 gpm -m /dev/input/mice
-t imps2
root      1809     1  0 13:28 ?        00:00:00 crond
dbus      1833     1  0 13:28 ?        00:00:00 dbus-daemon-1 --system
root      1842     1  0 13:28 ?        00:00:00 hald
root      1849     1  0 13:28 ?        00:00:00 login -- root
root      1850     1  0 13:28 tty2     00:00:00 /sbin/mingetty tty2
root      1851     1  0 13:28 tty3     00:00:00 /sbin/mingetty tty3
root      1852     1  0 13:28 tty4     00:00:00 /sbin/mingetty tty4
root      1853     1  0 13:28 tty5     00:00:00 /sbin/mingetty tty5
root      1854     1  0 13:28 tty6     00:00:00 /sbin/mingetty tty6
root      2270  1849  0 13:31 tty1     00:00:00 -bash
root      2312  1770  0 13:31 ?        00:00:00 sshd: root@pts/0
root      2314  2312  0 13:32 pts/0    00:00:00 -bash
root      2363  2314 19 13:36 pts/0    00:30:19 ./setiathome
root      4346  1770  0 15:03 ?        00:00:00 sshd: root@pts/1
root      4350  4346  0 15:03 pts/1    00:00:00 -bash
root      5291  4350  0 15:14 pts/1    00:00:00 tail
-f /var/log/messages /var/log/secure
root      6179     1  0 15:16 pts/0    00:00:35
root      7231     1  0 15:16 pts/0    00:00:35
root      7252     1  0 15:16 pts/0    00:00:35
root      7345     1  0 15:17 ?        00:00:00 /usr/sbin/dhcpd
root      7352     1  0 15:17 pts/0    00:00:34
root      7485     1  0 15:17 pts/0    00:00:34
root      7509     1  0 15:17 pts/0    00:00:34
root      7520  1770  0 15:18 ?        00:00:00 sshd: root@pts/2
root      7522  7520  0 15:18 pts/2    00:00:00 -bash
root      7561  7522  0 15:18 pts/2    00:00:00 ssh -v root@10.0.0.250
root      7563     1  1 15:18 pts/0    00:00:35
root      7571     1  1 15:18 pts/0    00:00:35
root     10302  1770  0 15:57 ?        00:00:00 sshd: root@pts/3
root     10304 10302  0 15:57 pts/3    00:00:00 -bash
root     10422     1  8 16:08 pts/0    00:00:38
root     10424     1  9 16:08 pts/0    00:00:42
root     10445  2314  0 16:16 pts/0    00:00:00 ps -ef



^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops
  2005-05-08 16:35         ` Antoine Martin
@ 2005-05-08 15:15           ` Andi Kleen
  2005-05-08 16:42             ` Jeff Dike
  2005-05-08 17:38             ` Antoine Martin
  2005-05-08 16:45           ` Jeff Dike
  1 sibling, 2 replies; 26+ messages in thread
From: Andi Kleen @ 2005-05-08 15:15 UTC (permalink / raw)
  To: Antoine Martin; +Cc: linux-kernel, jdike

Antoine Martin <antoine@nagafix.co.uk> writes:

>> (..)
>> That is a wrmsr to 0x00000000c0000102 (KERNEL_GS_BASE), the code 
>> is trying to write 0x0000c8e816000002 into it. That is a non canonical
>> address, which causes the GPF.
>> 
>> The strange thing is that the kernel should have rejected it in
>> the first place. The code to allow user space to set kernel gs 
>> checks for the address being > TASK_SIZE and TASK_SIZE is 0x800000000000.
>> It should have rejected it in the first place.
>> 
>> Are you sure you did not apply any strange UML related patches
>> to the host kernel? Maybe those are buggy.
> The only extra patch applied on top of what is on the web page (as per
> Jeff's instructions) is the mconsole-exec patch, and AFAIK it wouldn't
> affect the code above.
>
> Alexander Nyberg is also experiencing crashes, aren't you?


Ok, the bug is found now. It is a kernel bug that it allows to set
non canonical addresses in 64bit segment registers through ptrace.

But even if I fixed that then it will not help you run UML, because
UML needs to set correct addresses of course, not illegal ones.

I will submit a patch later for the crash problem.

-Andi

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops
  2005-05-08 15:15           ` Andi Kleen
@ 2005-05-08 16:42             ` Jeff Dike
  2005-05-08 17:38             ` Antoine Martin
  1 sibling, 0 replies; 26+ messages in thread
From: Jeff Dike @ 2005-05-08 16:42 UTC (permalink / raw)
  To: Andi Kleen; +Cc: Antoine Martin, linux-kernel

On Sun, May 08, 2005 at 05:15:36PM +0200, Andi Kleen wrote:
> Antoine Martin <antoine@nagafix.co.uk> writes:
> Ok, the bug is found now. It is a kernel bug that it allows to set
> non canonical addresses in 64bit segment registers through ptrace.
> 
> But even if I fixed that then it will not help you run UML, because
> UML needs to set correct addresses of course, not illegal ones.

True, but if the host stays up, and maybe printks something (or even returns
 -EIO), that would help track down the UML problem.

				Jeff

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops
  2005-05-08 15:15           ` Andi Kleen
  2005-05-08 16:42             ` Jeff Dike
@ 2005-05-08 17:38             ` Antoine Martin
  1 sibling, 0 replies; 26+ messages in thread
From: Antoine Martin @ 2005-05-08 17:38 UTC (permalink / raw)
  To: Andi Kleen; +Cc: linux-kernel, jdike

> Ok, the bug is found now. It is a kernel bug that it allows to set
> non canonical addresses in 64bit segment registers through ptrace.
Is this going to be part of 2.6.11.9 or just 2.6.12?
I've got a good test environment now if you need testers.

Antoine


^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops
  2005-05-08 16:35         ` Antoine Martin
  2005-05-08 15:15           ` Andi Kleen
@ 2005-05-08 16:45           ` Jeff Dike
  2005-05-08 19:51             ` Antoine Martin
  1 sibling, 1 reply; 26+ messages in thread
From: Jeff Dike @ 2005-05-08 16:45 UTC (permalink / raw)
  To: Antoine Martin; +Cc: Andi Kleen, linux-kernel

On Sun, May 08, 2005 at 05:35:02PM +0100, Antoine Martin wrote:
> The only extra patch applied on top of what is on the web page (as per
> Jeff's instructions) is the mconsole-exec patch, and AFAIK it wouldn't
> affect the code above.

mconsole-exec, if it's the patch I'm thinking of, is a patch to the UML
kernel, not to the host.

> The really weird thing is that the processes are still running, but ps
> -ef shows an empty string in place of the process name:
> (and the terminal which launched the instance got control back)
> I am now rebuilding a new kernel on another test box, let me know what
> to do to provide better debug information.

It's not unusual for UML processes to have strange names (including empty
ones) on the host.

				Jeff

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops
  2005-05-08 16:45           ` Jeff Dike
@ 2005-05-08 19:51             ` Antoine Martin
  0 siblings, 0 replies; 26+ messages in thread
From: Antoine Martin @ 2005-05-08 19:51 UTC (permalink / raw)
  To: Jeff Dike; +Cc: Andi Kleen, linux-kernel

On Sun, 2005-05-08 at 12:45 -0400, Jeff Dike wrote:
> On Sun, May 08, 2005 at 05:35:02PM +0100, Antoine Martin wrote:
> > The only extra patch applied on top of what is on the web page (as per
> > Jeff's instructions) is the mconsole-exec patch, and AFAIK it wouldn't
> > affect the code above.
> 
> mconsole-exec, if it's the patch I'm thinking of, is a patch to the UML
> kernel, not to the host.
Yep, that's the one, I thought the question was about the guest.
The host is running 2.6.11.8 - no extra patches at all.

> > The really weird thing is that the processes are still running, but ps
> > -ef shows an empty string in place of the process name:
> > (and the terminal which launched the instance got control back)
> > I am now rebuilding a new kernel on another test box, let me know what
> > to do to provide better debug information.
> 
> It's not unusual for UML processes to have strange names (including empty
> ones) on the host.
Strange thing is, they had names up to the point where I got the
segfault.

Antoine


^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops
  2005-05-08 14:12       ` Andi Kleen
  2005-05-08 16:35         ` Antoine Martin
@ 2005-05-08 16:38         ` Jeff Dike
  1 sibling, 0 replies; 26+ messages in thread
From: Jeff Dike @ 2005-05-08 16:38 UTC (permalink / raw)
  To: Andi Kleen; +Cc: Antoine Martin, linux-kernel

> Are you sure you did not apply any strange UML related patches
> to the host kernel? Maybe those are buggy.

No, stock x86_64 kernel is horribly unstable running UML.  I haven't seen
anything but output-free hangs, so I haven't had much information to 
contribute.  Antoine is actually getting capturable oopses.

I've tried every recent FC3 kernel, plus stock 2.6.10, and none of them
survive very long with UML running.  Haven't tried stock 2.6.11 or anything
later yet.

				Jeff

^ permalink raw reply	[flat|nested] 26+ messages in thread

end of thread, other threads:[~2005-06-03 17:59 UTC | newest]

Thread overview: 26+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <20050504191828.620C812EE7@sc8-sf-spam2.sourceforge.net>
     [not found] ` <1115248927.12088.52.camel@cobra>
     [not found]   ` <1115392141.12197.3.camel@cobra>
2005-05-07 16:31     ` 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops Antoine Martin
2005-05-07 15:57       ` Alexander Nyberg
2005-05-07 18:03         ` Jeff Dike
2005-05-08  0:18           ` Al Viro
2005-05-08  6:10             ` Al Viro
2005-05-09 21:07               ` Al Viro
2005-05-10  2:26                 ` Al Viro
2005-05-10  3:50                   ` Jeff Dike
2005-05-10 10:02                     ` Al Viro
2005-05-08 16:28             ` Jeff Dike
     [not found]           ` <1115573839.10373.42.camel@cobra>
     [not found]             ` <20050508183533.GA27251@ccure.user-mode-linux.org>
     [not found]               ` <1115590823.10373.68.camel@cobra>
     [not found]                 ` <1115600253.10373.74.camel@cobra>
     [not found]                   ` <1117307236.10041.4.camel@localhost>
2005-06-02 15:52                     ` [uml-devel] Re: Problems applying patches to 2.6.12-rc5-mm1 antoine
     [not found]                     ` <1117691482.10122.107.camel@localhost>
2005-06-02 21:06                       ` [uml-devel] Re: 2.6.11.8 + UML/x86_64 (2.6.12-rc3+) = oops Jeff Dike
2005-06-03  2:33                         ` antoine
2005-06-03  2:38                           ` Jeff Dike
2005-06-03 19:28                             ` antoine
2005-06-03 16:49                               ` Jeff Dike
2005-06-03 17:58                                 ` Bodo Stroesser
2005-05-07 18:06         ` Antoine Martin
2005-05-08 14:12       ` Andi Kleen
2005-05-08 16:35         ` Antoine Martin
2005-05-08 15:15           ` Andi Kleen
2005-05-08 16:42             ` Jeff Dike
2005-05-08 17:38             ` Antoine Martin
2005-05-08 16:45           ` Jeff Dike
2005-05-08 19:51             ` Antoine Martin
2005-05-08 16:38         ` Jeff Dike

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.