* mm kernel oops with r8169 & named, PREEMPT
@ 2004-10-13 6:00 Danny
2004-10-13 6:32 ` Jeff Garzik
2004-10-13 7:28 ` Francois Romieu
0 siblings, 2 replies; 17+ messages in thread
From: Danny @ 2004-10-13 6:00 UTC (permalink / raw)
To: linux-kernel
This is with the network driver r8169 and linux-2.6.9-rc4-mm1. Same
thing happened with linux-2.6.9-rc3-mm3 (but also locked up).
linux-2.6.8.1-mm4 didn't seem to have this problem. This is very
repeatable, if this is an unknown issue let me know (CC please, not on
the list) and I will jump through the hoops to get a useful oops.
^ permalink raw reply [flat|nested] 17+ messages in thread
* Re: mm kernel oops with r8169 & named, PREEMPT
2004-10-13 6:00 mm kernel oops with r8169 & named, PREEMPT Danny
@ 2004-10-13 6:32 ` Jeff Garzik
2004-10-13 7:28 ` Francois Romieu
1 sibling, 0 replies; 17+ messages in thread
From: Jeff Garzik @ 2004-10-13 6:32 UTC (permalink / raw)
To: Danny; +Cc: linux-kernel
Danny wrote:
> This is with the network driver r8169 and linux-2.6.9-rc4-mm1. Same
> thing happened with linux-2.6.9-rc3-mm3 (but also locked up).
> linux-2.6.8.1-mm4 didn't seem to have this problem. This is very
> repeatable, if this is an unknown issue let me know (CC please, not on
> the list) and I will jump through the hoops to get a useful oops.
What happens if you disable preempt?
lspci? config? Any of the other useful info mentioned in the
REPORTING-BUGS file in the kernel tree?
Jeff
^ permalink raw reply [flat|nested] 17+ messages in thread
* Re: mm kernel oops with r8169 & named, PREEMPT
2004-10-13 6:00 mm kernel oops with r8169 & named, PREEMPT Danny
2004-10-13 6:32 ` Jeff Garzik
@ 2004-10-13 7:28 ` Francois Romieu
2004-10-13 16:05 ` Danny
1 sibling, 1 reply; 17+ messages in thread
From: Francois Romieu @ 2004-10-13 7:28 UTC (permalink / raw)
To: Danny; +Cc: linux-kernel
Danny <dannydaemonic@gmail.com> :
> This is with the network driver r8169 and linux-2.6.9-rc4-mm1. Same
> thing happened with linux-2.6.9-rc3-mm3 (but also locked up).
> linux-2.6.8.1-mm4 didn't seem to have this problem. This is very
> repeatable, if this is an unknown issue let me know (CC please, not on
> the list) and I will jump through the hoops to get a useful oops.
Try the patch below (courtesy of Jon Mason, whitespaces may be wrong) and
see 1) if things perform better 2) if "timeout" messages appear in the
kernel log.
Oops as well as Cc: netdev@oss.sgi.com are welcome.
--- linux-2.6.9-rc4-mm1/drivers/net/r8169.c 2004-10-12 13:59:57.000000000 -0500
+++ linux-2.6.9-rc4-mm1/drivers/net/r8169.c 2004-10-12 10:51:21.000000000 -0500
@@ -1680,6 +1680,7 @@ static void rtl8169_unmap_tx_skb(struct
pci_unmap_single(pdev, le64_to_cpu(desc->addr), len, PCI_DMA_TODEVICE);
desc->opts2 = 0x00;
+ desc->opts1 = 0x00;
desc->addr = 0x00;
tx_skb->len = 0;
}
^ permalink raw reply [flat|nested] 17+ messages in thread
* Re: mm kernel oops with r8169 & named, PREEMPT
2004-10-13 7:28 ` Francois Romieu
@ 2004-10-13 16:05 ` Danny
2004-10-13 16:17 ` Danny
0 siblings, 1 reply; 17+ messages in thread
From: Danny @ 2004-10-13 16:05 UTC (permalink / raw)
To: Francois Romieu, linux-kernel; +Cc: netdev
On Wed, 13 Oct 2004 09:28:14 +0200, Francois Romieu wrote:
> Try the patch below (courtesy of Jon Mason, whitespaces may be wrong) and
> see 1) if things perform better 2) if "timeout" messages appear in the
> kernel log.
The patch doesn't fix or prevent the oops. Performance might have
been better but I did no formal tests. There were no "timeout"
messages in the kernel log, however I only ran it with this change for
35-45 minutes.
I should mention that in the kernel log, with linux-2.6.8.1-mm4, it
complains "process `named' is using obsolete setsockopt SO_BSDCOMPAT".
However, with the most recent, 2.6.9-rc4-mm1, it doesn't get that
far. A "Unable to handle kernel paging request at virtual address
00017f8c" happens instead. I'm guessing the oops is just killing
named before it gets that far.
I enabled some more debug options in the kernel and I'm getting a 2nd
oops following the first.
I wasn't sure if I should paste the huge oops here, and since the raw
dmesg also shows spin lock errors, I thought I'd just post both on the
web:
http://members.cox.net/valenzdu/oops-raw
http://members.cox.net/valenzdu/oops-processed
I ran it through ksymoops but I don't have a /proc/ksyms and when I
tried using /proc/kallsyms it gave me a format error. I hope this is
helpful, let me know if there is anything else I can do. (CC me
please.)
^ permalink raw reply [flat|nested] 17+ messages in thread
* Re: mm kernel oops with r8169 & named, PREEMPT
2004-10-13 16:05 ` Danny
@ 2004-10-13 16:17 ` Danny
2004-10-13 16:18 ` Danny
0 siblings, 1 reply; 17+ messages in thread
From: Danny @ 2004-10-13 16:17 UTC (permalink / raw)
To: Francois Romieu, linux-kernel
I should mention that the oops (there was actually 3 of them not just
2) was without the patch suggested by Francois Romieu. I assumed that
would be more useful as I still get the oops with it. Let me know if
it would help and I'll repatch it and send another oops.
^ permalink raw reply [flat|nested] 17+ messages in thread
* Re: mm kernel oops with r8169 & named, PREEMPT
2004-10-13 16:17 ` Danny
@ 2004-10-13 16:18 ` Danny
2004-10-13 16:29 ` Jon Mason
0 siblings, 1 reply; 17+ messages in thread
From: Danny @ 2004-10-13 16:18 UTC (permalink / raw)
To: netdev
I should mention that the oops (there was actually 3 of them not just
2) was without the patch suggested by Francois Romieu. I assumed that
would be more useful as I still get the oops with it. Let me know if
it would help and I'll repatch it and send another oops.
^ permalink raw reply [flat|nested] 17+ messages in thread
* Re: mm kernel oops with r8169 & named, PREEMPT
2004-10-13 16:18 ` Danny
@ 2004-10-13 16:29 ` Jon Mason
2004-10-13 18:18 ` Francois Romieu
2004-10-13 20:28 ` mm kernel oops with r8169 & named, PREEMPT Danny
0 siblings, 2 replies; 17+ messages in thread
From: Jon Mason @ 2004-10-13 16:29 UTC (permalink / raw)
To: Danny; +Cc: netdev
Can you confirm that you are running r8169 driver with NAPI and TSO turned on,
along with Preemptable Kernel? Also, I didn't see anything in the Oops
specific to the r8169 driver, do you have another adapter available to run
the same test on? Finally, what is your setup (arch, # of cpus, etc)?
--
Jon Mason
jdmason@us.ibm.com
^ permalink raw reply [flat|nested] 17+ messages in thread
* Re: mm kernel oops with r8169 & named, PREEMPT
2004-10-13 16:29 ` Jon Mason
@ 2004-10-13 18:18 ` Francois Romieu
2004-10-13 20:44 ` Danny
[not found] ` <9625752b04101313417be4cf90@mail.gmail.com>
2004-10-13 20:28 ` mm kernel oops with r8169 & named, PREEMPT Danny
1 sibling, 2 replies; 17+ messages in thread
From: Francois Romieu @ 2004-10-13 18:18 UTC (permalink / raw)
To: Jon Mason; +Cc: Danny, netdev, jgarzik
Jon Mason <jdmason@us.ibm.com> :
> Can you confirm that you are running r8169 driver with NAPI and TSO turned on,
> along with Preemptable Kernel? Also, I didn't see anything in the Oops
> specific to the r8169 driver, do you have another adapter available to run
> the same test on? Finally, what is your setup (arch, # of cpus, etc)?
Yep, it does not look closely related to the network driver (there are both
eth1 and unknown eth0 btw).
Danny, can your drop the r8169 driver from 2.6.9-rc4-mm1 into vanilla
2.6.9-rc4 and confirm that it works (preempt should not matter) ?
If it does not, disable ACPI, preempt, profiling and publish the new oops.
--
Ueimor
^ permalink raw reply [flat|nested] 17+ messages in thread* Re: mm kernel oops with r8169 & named, PREEMPT
2004-10-13 16:29 ` Jon Mason
2004-10-13 18:18 ` Francois Romieu
@ 2004-10-13 20:28 ` Danny
[not found] ` <200410131703.21726.jdmason@us.ltcfwd.linux.ibm.com>
1 sibling, 1 reply; 17+ messages in thread
From: Danny @ 2004-10-13 20:28 UTC (permalink / raw)
To: linux-kernel; +Cc: netdev
On Wed, 13 Oct 2004 11:29:05 -0500, Jon Mason wrote:
> Can you confirm that you are running r8169 driver with NAPI and TSO turned on,
> along with Preemptable Kernel? Also, I didn't see anything in the Oops
> specific to the r8169 driver, do you have another adapter available to run
> the same test on? Finally, what is your setup (arch, # of cpus, etc)?
I am using the r8169 driver with NAPI and preemptable kernel. I don't
remember or see any option for offloading but it sounds like something
I'd turn on. Let me know what I should look for in my config, or
check here:
http://members.cox.net/valenzdu/.config
I no longer have a 2nd nic adapter, but I did first notice the problem
when I installed this one. However, if I don't load that module I'm
not able to duplicate the problem by running named on the other
interface. Running named triggers it 100% of the time on the r8169
based nic.
The arch is x86 (athlon) with just 1 cpu. Since I'm not sure what's
wrong here I'm not sure what's relevant. Here is more info about my
setup:
http://members.cox.net/valenzdu/proc-cpuinfo
http://members.cox.net/valenzdu/proc-iomem
http://members.cox.net/valenzdu/proc-ioports
http://members.cox.net/valenzdu/proc-modules
http://members.cox.net/valenzdu/proc-version
http://members.cox.net/valenzdu/stdout-lspci
http://members.cox.net/valenzdu/stdout-ver_linux
^ permalink raw reply [flat|nested] 17+ messages in thread
end of thread, other threads:[~2004-10-16 15:52 UTC | newest]
Thread overview: 17+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2004-10-13 6:00 mm kernel oops with r8169 & named, PREEMPT Danny
2004-10-13 6:32 ` Jeff Garzik
2004-10-13 7:28 ` Francois Romieu
2004-10-13 16:05 ` Danny
2004-10-13 16:17 ` Danny
2004-10-13 16:18 ` Danny
2004-10-13 16:29 ` Jon Mason
2004-10-13 18:18 ` Francois Romieu
2004-10-13 20:44 ` Danny
[not found] ` <9625752b04101313417be4cf90@mail.gmail.com>
[not found] ` <20041013205433.GC30761@electric-eye.fr.zoreil.com>
2004-10-13 21:59 ` Danny
2004-10-14 22:04 ` Danny
2004-10-15 16:18 ` Francois Romieu
2004-10-15 21:07 ` Danny
2004-10-16 10:01 ` profile path bug again (was Re: mm kernel oops with r8169 & named, PREEMPT) Francois Romieu
2004-10-16 15:52 ` Danny
2004-10-13 20:28 ` mm kernel oops with r8169 & named, PREEMPT Danny
[not found] ` <200410131703.21726.jdmason@us.ltcfwd.linux.ibm.com>
2004-10-14 21:54 ` Danny
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.