* [uml-devel] Possible SKAS3 >= v8 bug
@ 2005-08-11 14:13 Christopher S. Aker
2005-08-11 21:07 ` Christopher S. Aker
0 siblings, 1 reply; 5+ messages in thread
From: Christopher S. Aker @ 2005-08-11 14:13 UTC (permalink / raw)
To: user-mode-linux-devel; +Cc: Blaisorblade, Jeff Dike
For many months now, I've been dealing with host machine instability.
Essentially, I'm rebooting at least one host a week (see
http://www.linode.com/forums/viewforum.php?f=25 for details). The
affected machines are all running a kernel patched with skas3 at least
version 8. Only my more recent boxes seem to be affected by this bug,
however the only difference between the boxes that crash and the ones
that don't are processor speed (faster ones crash). Identical hardware,
otherwise. Slower boxes running the identical kernel have great uptimes
(> 120 days). Boxes running skas3 v7 or less have uptimes of over 400
days (!).
Yesterday another box crashed. Due to my crappy remote console unit
requiring me to be connected at the time of a panic to actually capture
it, I had my datacenter plug in a monitor and write down by hand some of
the panic output. There's not much here, but this is what they provided me:
<quote>
(hoangvo-08/10/2005 10:28:22):
Your server has been rebooted and verified to respond to SSH requests.
The error messages I recorded from the console are as follows:
========================================
EFLAGS: 00010292 (2.6.11.11-1-bigmem)
EIP is at 0x0
========================================
I skipped over the information here as I did not feel it would be useful
to you. This is more useful, however:
========================================
Call Trace
[<c0106fd7>] do_syscall_trace+0x97/0x10e
[<c0104934>] math_state_restore+0x24/0x40
[<c0102639>] syscall_trace_entry_+0x11/0x2a
Code: Bad EIP value.
</quote>
Source tree and vmlinux file:
http://www.theshore.net/~caker/uml/2.6.11.11-1-bigmem.tar.bz2 (38M)
http://www.theshore.net/~caker/uml/vmlinux.bz2 (2M)
Built source tree:
http://www.theshore.net/~caker/uml/2.6.11.11-1-bigmem.tar.gz (77M)
Jeff took a look at this yesterday, but I didn't really expect him to
get very far with such little information. I just wanted to get this
out into the open in case anyone else has experienced something similar.
Next host that panics, I'll make sure that the datacenter copies down
the entire panic output.
-Chris
-------------------------------------------------------
SF.Net email is Sponsored by the Better Software Conference & EXPO
September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices
Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA
Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf
_______________________________________________
User-mode-linux-devel mailing list
User-mode-linux-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/user-mode-linux-devel
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [uml-devel] Possible SKAS3 >= v8 bug
2005-08-11 14:13 [uml-devel] Possible SKAS3 >= v8 bug Christopher S. Aker
@ 2005-08-11 21:07 ` Christopher S. Aker
2005-08-12 0:58 ` Christopher S. Aker
0 siblings, 1 reply; 5+ messages in thread
From: Christopher S. Aker @ 2005-08-11 21:07 UTC (permalink / raw)
To: user-mode-linux-devel; +Cc: Blaisorblade, Jeff Dike
Found this from in my remote syslog:
Unable to handle kernel paging request at virtual address 00002000
printing eip:
00000000
*pde = 2a771001
Oops: 0000 [#1]
SMP
Modules linked in:
CPU: 2
EIP: 0200:[<00000000>] Not tainted VLI
EFLAGS: 00010292 (2.6.11.11-1-bigmem)
EIP is at 0x0
eax: ef66b784 ebx: ef7ba000 ecx: ef60f540 edx: 00000000
esi: 00000001 edi: 00000000 ebp: ef7bbf14 esp: ef7bbf9c
ds: 007b es: 007b ss: 0068
Process 2.4.29-linode39 (pid: 11195, threadinfo=ef7ba000 task=ef60f540)
Stack: ef7bbfc4 ef7ba000 c0106fd7 00000005 be7ff890 c0104934 be7ff768
be7ff760
00000000 c0102639 be7ff768 00000000 40029d00 be7ff760 00000000
be7ff928
00000000 0000007b 0000007b 0000004e 40100fb1 00000073 00000246
be7ff74c
Call Trace:
[<c0106fd7>] do_syscall_trace+0x97/0x10e
[<c0104934>] math_state_restore+0x24/0x40
[<c0102639>] syscall_trace_entry+0x11/0x2a
Code: Bad EIP value.
-Chris
-------------------------------------------------------
SF.Net email is Sponsored by the Better Software Conference & EXPO
September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices
Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA
Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf
_______________________________________________
User-mode-linux-devel mailing list
User-mode-linux-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/user-mode-linux-devel
^ permalink raw reply [flat|nested] 5+ messages in thread* Re: [uml-devel] Possible SKAS3 >= v8 bug
2005-08-11 21:07 ` Christopher S. Aker
@ 2005-08-12 0:58 ` Christopher S. Aker
2005-08-21 12:29 ` Blaisorblade
0 siblings, 1 reply; 5+ messages in thread
From: Christopher S. Aker @ 2005-08-12 0:58 UTC (permalink / raw)
To: user-mode-linux-devel; +Cc: Blaisorblade, Jeff Dike
Perhaps this is related. I've upgraded one of the affected machines to
2.6.13-rc6. We'll see how it goes...
http://marc.theaimsgroup.com/?t=111720640400002&r=1&w=2
-Chris
-------------------------------------------------------
SF.Net email is Sponsored by the Better Software Conference & EXPO
September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices
Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA
Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf
_______________________________________________
User-mode-linux-devel mailing list
User-mode-linux-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/user-mode-linux-devel
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [uml-devel] Possible SKAS3 >= v8 bug
2005-08-12 0:58 ` Christopher S. Aker
@ 2005-08-21 12:29 ` Blaisorblade
2005-08-29 22:12 ` Christopher S. Aker
0 siblings, 1 reply; 5+ messages in thread
From: Blaisorblade @ 2005-08-21 12:29 UTC (permalink / raw)
To: Christopher S. Aker; +Cc: user-mode-linux-devel, Jeff Dike
On Friday 12 August 2005 02:58, Christopher S. Aker wrote:
> Perhaps this is related. I've upgraded one of the affected machines to
> 2.6.13-rc6. We'll see how it goes...
>
> http://marc.theaimsgroup.com/?t=111720640400002&r=1&w=2
>
> -Chris
Ok, I'm particularly suspicious of one patch, ptrace_ldt-reload-smp-fix, try
unapplying that one. I don't know exactly *how* it might trigger that crash,
but it seems a data corruption, and that code is doing exactly that.
In addition, below, the segment selector is totally fucked up. It is a ring 0
GDT segment, which is however non existant. It would exist only if there were
a third CPU, and would be a NULL descriptor anyway.
EIP: 0200:[<00000000>] Not tainted VLI
--
Inform me of my mistakes, so I can keep imitating Homer Simpson's "Doh!".
Paolo Giarrusso, aka Blaisorblade (Skype ID "PaoloGiarrusso", ICQ 215621894)
http://www.user-mode-linux.org/~blaisorblade
___________________________________
Yahoo! Mail: gratis 1GB per i messaggi e allegati da 10MB
http://mail.yahoo.it
-------------------------------------------------------
SF.Net email is Sponsored by the Better Software Conference & EXPO
September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices
Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA
Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf
_______________________________________________
User-mode-linux-devel mailing list
User-mode-linux-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/user-mode-linux-devel
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [uml-devel] Possible SKAS3 >= v8 bug
2005-08-21 12:29 ` Blaisorblade
@ 2005-08-29 22:12 ` Christopher S. Aker
0 siblings, 0 replies; 5+ messages in thread
From: Christopher S. Aker @ 2005-08-29 22:12 UTC (permalink / raw)
To: Blaisorblade; +Cc: user-mode-linux-devel, Jeff Dike
Blaisorblade wrote:
> Ok, I'm particularly suspicious of one patch, ptrace_ldt-reload-smp-fix, try
> unapplying that one. I don't know exactly *how* it might trigger that crash,
> but it seems a data corruption, and that code is doing exactly that.
I recompiled 2.6.12.5 with skas-v8 minus the ptrace_ldt-reload-smp-fix
patch, and experienced another crash. No console output was captured by
my remote logging or console unit.
-Chris
-------------------------------------------------------
SF.Net email is Sponsored by the Better Software Conference & EXPO
September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices
Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA
Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf
_______________________________________________
User-mode-linux-devel mailing list
User-mode-linux-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/user-mode-linux-devel
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2005-08-29 22:12 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2005-08-11 14:13 [uml-devel] Possible SKAS3 >= v8 bug Christopher S. Aker
2005-08-11 21:07 ` Christopher S. Aker
2005-08-12 0:58 ` Christopher S. Aker
2005-08-21 12:29 ` Blaisorblade
2005-08-29 22:12 ` Christopher S. Aker
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.