* [uml-devel] SKAS race condition status?
@ 2005-10-17 22:50 Christopher S. Aker
2005-10-25 16:44 ` Christopher S. Aker
0 siblings, 1 reply; 8+ messages in thread
From: Christopher S. Aker @ 2005-10-17 22:50 UTC (permalink / raw)
To: user-mode-linux-devel
Has there been any progress on the race in SKAS? I've been dealing with
2-4 hosts crashing a week. Same thing happens on my old SKAS kernel,
2.6.7 with skas3-v4 or so.
From:
http://marc.theaimsgroup.com/?t=112376981300002&r=1&w=2
Unable to handle kernel paging request at virtual address 00002000
printing eip:
00000000
*pde = 2a771001
Oops: 0000 [#1]
SMP
Modules linked in:
CPU: 2
EIP: 0200:[<00000000>] Not tainted VLI
EFLAGS: 00010292 (2.6.11.11-1-bigmem)
EIP is at 0x0
eax: ef66b784 ebx: ef7ba000 ecx: ef60f540 edx: 00000000
esi: 00000001 edi: 00000000 ebp: ef7bbf14 esp: ef7bbf9c
ds: 007b es: 007b ss: 0068
Process 2.4.29-linode39 (pid: 11195, threadinfo=ef7ba000 task=ef60f540)
Stack: ef7bbfc4 ef7ba000 c0106fd7 00000005 be7ff890 c0104934 be7ff768
be7ff760
00000000 c0102639 be7ff768 00000000 40029d00 be7ff760 00000000
be7ff928
00000000 0000007b 0000007b 0000004e 40100fb1 00000073 00000246
be7ff74c
Call Trace:
[<c0106fd7>] do_syscall_trace+0x97/0x10e
[<c0104934>] math_state_restore+0x24/0x40
[<c0102639>] syscall_trace_entry+0x11/0x2a
Code: Bad EIP value.
Is there code I could add there to OOPs rather than panic? I'd prefer
to have just one UML go down rather than the entire host.
Thanks!
-Chris
-------------------------------------------------------
This SF.Net email is sponsored by:
Power Architecture Resource Center: Free content, downloads, discussions,
and more. http://solutions.newsforge.com/ibmarch.tmpl
_______________________________________________
User-mode-linux-devel mailing list
User-mode-linux-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/user-mode-linux-devel
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [uml-devel] SKAS race condition status?
2005-10-17 22:50 [uml-devel] SKAS race condition status? Christopher S. Aker
@ 2005-10-25 16:44 ` Christopher S. Aker
2005-10-25 16:57 ` Christopher S. Aker
2005-10-25 21:58 ` Peter
0 siblings, 2 replies; 8+ messages in thread
From: Christopher S. Aker @ 2005-10-25 16:44 UTC (permalink / raw)
To: user-mode-linux-devel; +Cc: peter.spamcatcher, Blaisorblade
Christopher S. Aker wrote:
> Has there been any progress on the race in SKAS? I've been dealing with
> 2-4 hosts crashing a week. Same thing happens on my old SKAS kernel,
> 2.6.7 with skas3-v4 or so.
Jeff and I realized that this only happens on Xeon processors where the
model number is > 2. None of my model 2 Xeon hosts crash, but my model
3 and model 4 xeons do. grep model /proc/cpuinfo for info. I've had
skas3-v4 and skas3-v8.2 generate the same panic, so it's something
processor model specific.
Peter, can you verify the problems you've had are on hosts with a model
3 or 4 Xeon processor? Also, with regard to 2.6.12, I'm not sure we're
looking at the same issue, as I had problems with that kernel too but
wasn't able to verify it was the same panic. I have had 2.6.13 working,
and it does generate the same SKAS panic eventually.
Thanks,
-Chris
-------------------------------------------------------
This SF.Net email is sponsored by the JBoss Inc.
Get Certified Today * Register for a JBoss Training Course
Free Certification Exam for All Training Attendees Through End of 2005
Visit http://www.jboss.com/services/certification for more information
_______________________________________________
User-mode-linux-devel mailing list
User-mode-linux-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/user-mode-linux-devel
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [uml-devel] SKAS race condition status?
2005-10-25 16:44 ` Christopher S. Aker
@ 2005-10-25 16:57 ` Christopher S. Aker
2005-10-25 17:58 ` Blaisorblade
2005-10-25 21:58 ` Peter
1 sibling, 1 reply; 8+ messages in thread
From: Christopher S. Aker @ 2005-10-25 16:57 UTC (permalink / raw)
To: caker; +Cc: user-mode-linux-devel, peter.spamcatcher, Blaisorblade
/proc/cpuinfo -- model 3
processor : 0
vendor_id : GenuineIntel
cpu family : 15
model : 3
model name : Intel(R) Xeon(TM) CPU 2.80GHz
stepping : 4
cpu MHz : 2801.996
cache size : 1024 KB
physical id : 0
siblings : 2
fdiv_bug : no
hlt_bug : no
f00f_bug : no
coma_bug : no
fpu : yes
fpu_exception : yes
cpuid level : 5
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge
mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe lm
pni monitor ds_cpl cid xtpr
bogomips : 5521.40
/proc/cpuinfo -- model 4
processor : 0
vendor_id : GenuineIntel
cpu family : 15
model : 4
model name : Intel(R) Xeon(TM) CPU 3.20GHz
stepping : 3
cpu MHz : 3200.832
cache size : 2048 KB
physical id : 0
siblings : 2
core id : 0
cpu cores : 1
fdiv_bug : no
hlt_bug : no
f00f_bug : no
coma_bug : no
fpu : yes
fpu_exception : yes
cpuid level : 5
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge
mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe nx lm
pni monitor ds_cpl cid cx16 xtpr
bogomips : 6409.33
-Chris
-------------------------------------------------------
This SF.Net email is sponsored by the JBoss Inc.
Get Certified Today * Register for a JBoss Training Course
Free Certification Exam for All Training Attendees Through End of 2005
Visit http://www.jboss.com/services/certification for more information
_______________________________________________
User-mode-linux-devel mailing list
User-mode-linux-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/user-mode-linux-devel
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [uml-devel] SKAS race condition status?
2005-10-25 16:57 ` Christopher S. Aker
@ 2005-10-25 17:58 ` Blaisorblade
2005-10-25 21:37 ` Christopher S. Aker
0 siblings, 1 reply; 8+ messages in thread
From: Blaisorblade @ 2005-10-25 17:58 UTC (permalink / raw)
To: user-mode-linux-devel; +Cc: Christopher S. Aker, peter.spamcatcher
On Tuesday 25 October 2005 18:57, Christopher S. Aker wrote:
> /proc/cpuinfo -- model 3
After some researches, it seems that the matching errata document is this one:
http://www.intel.com/design/xeon/specupdt/302402.htm
which would imply you have a 64-bit enabled processor...
Note 1: that's not the only document matching, but the other is for 64-bit
processors too
Note 2: I couldn't match steppings between those docs and cpuinfo...
--
Inform me of my mistakes, so I can keep imitating Homer Simpson's "Doh!".
Paolo Giarrusso, aka Blaisorblade (Skype ID "PaoloGiarrusso", ICQ 215621894)
http://www.user-mode-linux.org/~blaisorblade
___________________________________
Yahoo! Mail: gratis 1GB per i messaggi e allegati da 10MB
http://mail.yahoo.it
-------------------------------------------------------
This SF.Net email is sponsored by the JBoss Inc.
Get Certified Today * Register for a JBoss Training Course
Free Certification Exam for All Training Attendees Through End of 2005
Visit http://www.jboss.com/services/certification for more information
_______________________________________________
User-mode-linux-devel mailing list
User-mode-linux-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/user-mode-linux-devel
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [uml-devel] SKAS race condition status?
2005-10-25 17:58 ` Blaisorblade
@ 2005-10-25 21:37 ` Christopher S. Aker
0 siblings, 0 replies; 8+ messages in thread
From: Christopher S. Aker @ 2005-10-25 21:37 UTC (permalink / raw)
To: Blaisorblade; +Cc: user-mode-linux-devel, peter.spamcatcher
Blaisorblade wrote:
> After some researches, it seems that the matching errata document is this one:
>
> http://www.intel.com/design/xeon/specupdt/302402.htm
>
> which would imply you have a 64-bit enabled processor...
>
> Note 1: that's not the only document matching, but the other is for 64-bit
> processors too
>
> Note 2: I couldn't match steppings between those docs and cpuinfo...
OK. I matched up some purchase orders with Intel part numbers:
BX80546KG3200FU <-- model 4
BX80546KG3200EU <-- model 3 (I think)
I'll start disabling HyperThreading on hosts that crash next.
-Chris
-------------------------------------------------------
This SF.Net email is sponsored by the JBoss Inc.
Get Certified Today * Register for a JBoss Training Course
Free Certification Exam for All Training Attendees Through End of 2005
Visit http://www.jboss.com/services/certification for more information
_______________________________________________
User-mode-linux-devel mailing list
User-mode-linux-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/user-mode-linux-devel
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [uml-devel] SKAS race condition status?
2005-10-25 16:44 ` Christopher S. Aker
2005-10-25 16:57 ` Christopher S. Aker
@ 2005-10-25 21:58 ` Peter
2005-10-25 22:10 ` Christopher S. Aker
1 sibling, 1 reply; 8+ messages in thread
From: Peter @ 2005-10-25 21:58 UTC (permalink / raw)
To: Christopher S. Aker; +Cc: user-mode-linux-devel, Blaisorblade
I have skas3-v8 host servers with model 4 processes and good (100+ day)
uptimes. The host server I had issues with (host38) got converted to
Xen and is still having problems. I am seeing issues with host34 that
require the odd restart. That server runs the same CPU model and kernel
code as other servers of mine with good uptime.
My conclusion is that I must be seeing hardware issues rather than a
skas bug.
- Peter
# for ((i=34;i<41;i++)); do echo host$i $(ssh host$i 'echo $(cat
/proc/cpuinfo | grep model | grep -v name | head -n 1) $(uname -r)
$(uptime)'); done
host34 model : 2 2.6.11.10-skas3-v8 21:52:37 up 15:19, 0 users, load
average: 1.64, 1.60, 1.32
host35 model : 4 2.6.11.7-skas3-v8 21:49:21 up 42 days, 12:15, 0 users,
load average: 13.72, 9.66, 5.21
host36 model : 4 2.6.11.12-skas3-v8.2 21:49:23 up 61 days, 0 min, 0
users, load average: 1.26, 1.44, 1.78
host37 model : 4 2.6.11.7-skas3-v8 21:49:25 up 121 days, 9:48, 0 users,
load average: 0.91, 1.04, 1.14
host38 model : 2 2.6.11.12-xen0 21:49:26 up 7 days, 13:28, 0 users, load
average: 0.02, 0.05, 0.01
host39 model : 4 2.6.11.7-skas3-v8 21:49:28 up 106 days, 2:46, 0 users,
load average: 0.59, 0.65, 0.70
host40 model : 4 2.6.11.7-skas3-v8 21:49:39 up 121 days, 9:47, 0 users,
load average: 0.27, 0.43, 0.47
Christopher S. Aker wrote:
> Christopher S. Aker wrote:
>
>> Has there been any progress on the race in SKAS? I've been dealing
>> with 2-4 hosts crashing a week. Same thing happens on my old SKAS
>> kernel, 2.6.7 with skas3-v4 or so.
>
>
> Jeff and I realized that this only happens on Xeon processors where the
> model number is > 2. None of my model 2 Xeon hosts crash, but my model
> 3 and model 4 xeons do. grep model /proc/cpuinfo for info. I've had
> skas3-v4 and skas3-v8.2 generate the same panic, so it's something
> processor model specific.
>
> Peter, can you verify the problems you've had are on hosts with a model
> 3 or 4 Xeon processor? Also, with regard to 2.6.12, I'm not sure we're
> looking at the same issue, as I had problems with that kernel too but
> wasn't able to verify it was the same panic. I have had 2.6.13 working,
> and it does generate the same SKAS panic eventually.
>
> Thanks,
> -Chris
-------------------------------------------------------
This SF.Net email is sponsored by the JBoss Inc.
Get Certified Today * Register for a JBoss Training Course
Free Certification Exam for All Training Attendees Through End of 2005
Visit http://www.jboss.com/services/certification for more information
_______________________________________________
User-mode-linux-devel mailing list
User-mode-linux-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/user-mode-linux-devel
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [uml-devel] SKAS race condition status?
2005-10-25 21:58 ` Peter
@ 2005-10-25 22:10 ` Christopher S. Aker
2005-10-25 22:25 ` Peter
0 siblings, 1 reply; 8+ messages in thread
From: Christopher S. Aker @ 2005-10-25 22:10 UTC (permalink / raw)
To: Peter; +Cc: user-mode-linux-devel, Blaisorblade
Peter wrote:
> I have skas3-v8 host servers with model 4 processes and good (100+ day)
> uptimes. The host server I had issues with (host38) got converted to
> Xen and is still having problems. I am seeing issues with host34 that
> require the odd restart. That server runs the same CPU model and kernel
> code as other servers of mine with good uptime.
>
> My conclusion is that I must be seeing hardware issues rather than a
> skas bug.
By chance, do you have Hyperthreading disabled on your model 4 Xeon boxes?
-Chris
-------------------------------------------------------
This SF.Net email is sponsored by the JBoss Inc.
Get Certified Today * Register for a JBoss Training Course
Free Certification Exam for All Training Attendees Through End of 2005
Visit http://www.jboss.com/services/certification for more information
_______________________________________________
User-mode-linux-devel mailing list
User-mode-linux-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/user-mode-linux-devel
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [uml-devel] SKAS race condition status?
2005-10-25 22:10 ` Christopher S. Aker
@ 2005-10-25 22:25 ` Peter
0 siblings, 0 replies; 8+ messages in thread
From: Peter @ 2005-10-25 22:25 UTC (permalink / raw)
To: Christopher S. Aker; +Cc: user-mode-linux-devel, Blaisorblade
HT is not disabled on any of my boxes.
Christopher S. Aker wrote:
> Peter wrote:
>
>> I have skas3-v8 host servers with model 4 processes and good (100+
>> day) uptimes. The host server I had issues with (host38) got
>> converted to Xen and is still having problems. I am seeing issues
>> with host34 that require the odd restart. That server runs the same
>> CPU model and kernel code as other servers of mine with good uptime.
>>
>> My conclusion is that I must be seeing hardware issues rather than a
>> skas bug.
>
>
> By chance, do you have Hyperthreading disabled on your model 4 Xeon boxes?
>
> -Chris
-------------------------------------------------------
This SF.Net email is sponsored by the JBoss Inc.
Get Certified Today * Register for a JBoss Training Course
Free Certification Exam for All Training Attendees Through End of 2005
Visit http://www.jboss.com/services/certification for more information
_______________________________________________
User-mode-linux-devel mailing list
User-mode-linux-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/user-mode-linux-devel
^ permalink raw reply [flat|nested] 8+ messages in thread
end of thread, other threads:[~2005-10-25 22:25 UTC | newest]
Thread overview: 8+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2005-10-17 22:50 [uml-devel] SKAS race condition status? Christopher S. Aker
2005-10-25 16:44 ` Christopher S. Aker
2005-10-25 16:57 ` Christopher S. Aker
2005-10-25 17:58 ` Blaisorblade
2005-10-25 21:37 ` Christopher S. Aker
2005-10-25 21:58 ` Peter
2005-10-25 22:10 ` Christopher S. Aker
2005-10-25 22:25 ` Peter
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.