* Re: All Applied micro boards are failing with current mainline kernel [not found] ` <6631e123845856ac20423c25e07844c8@mail.gmail.com> @ 2010-11-01 13:22 ` Josh Boyer 2010-11-01 15:05 ` Josh Boyer 0 siblings, 1 reply; 7+ messages in thread From: Josh Boyer @ 2010-11-01 13:22 UTC (permalink / raw) To: Rupjyoti Sarmah; +Cc: linuxppc-dev, linux-kernel On Fri, Oct 29, 2010 at 11:06 PM, Rupjyoti Sarmah <rsarmah@apm.com> wrote: > Hi , > > APM boards Canyonlands/Kilauea/Glacier/Katmai/Sequoia are all failing > during booting. What kernel version? What config? Have you tried a git bisect to see when it broke? Etc, etc. Also, CC'ing linuxppc-dev would have been a good idea. Not many on lkml are even going to know what you're talking about with such sparse details. josh > > Call trace is same for all > > > Call Trace: > > [df835d70] [c02d27cc] emac_probe+0xf28/0x12a8 (unreliable) > > [df835e50] [c023c7cc] platform_driver_probe_shim+0x40/0x54 > > [df835e60] [c01bf354] platform_drv_probe+0x20/0x30 > > [df835e70] [c01bde68] driver_probe_device+0x148/0x1ac > > [df835e90] [c01be17c] __driver_attach+0xa4/0xa8 > > [df835eb0] [c01bcfa4] bus_for_each_dev+0x60/0x9c > > [df835ee0] [c01bdbbc] driver_attach+0x24/0x34 > > [df835ef0] [c01bd94c] bus_add_driver+0x1b8/0x274 > > [df835f20] [c01be3d8] driver_register+0x6c/0x160 > > [df835f40] [c01bf6c4] platform_driver_register+0x68/0x78 > > [df835f50] [c023c998] of_register_platform_driver+0xa8/0xc4 > > [df835f60] [c0393e88] emac_init+0x1ac/0x1dc > > [df835fa0] [c0001574] do_one_initcall+0x160/0x1a8 > > [df835fd0] [c037a1e8] kernel_init+0xcc/0x174 > > [df835ff0] [c000c5b0] kernel_thread+0x4c/0x68 > > Instruction dump: > > 419e016c 2f800007 419e0164 38130774 901a00d0 381306b0 901a00d4 7f43d378 > > 4bf91d89 817a01a0 39200001 380b0008 <7d400028> 7d4a4b78 7d40012d 40a2fff4 > > ---[ end trace dac0cf4779f83901 ]--- > > > Regards, > Rup > -- > To unsubscribe from this list: send the line "unsubscribe linux-kernel" i= n > the body of a message to majordomo@vger.kernel.org > More majordomo info at =A0http://vger.kernel.org/majordomo-info.html > Please read the FAQ at =A0http://www.tux.org/lkml/ > ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: All Applied micro boards are failing with current mainline kernel 2010-11-01 13:22 ` All Applied micro boards are failing with current mainline kernel Josh Boyer @ 2010-11-01 15:05 ` Josh Boyer 2010-11-01 15:36 ` Stephen Rothwell 0 siblings, 1 reply; 7+ messages in thread From: Josh Boyer @ 2010-11-01 15:05 UTC (permalink / raw) To: Rupjyoti Sarmah, Tom Herbert, Eric Dumazet, David S. Miller Cc: linuxppc-dev, linux-kernel On Mon, Nov 1, 2010 at 9:22 AM, Josh Boyer <jwboyer@gmail.com> wrote: > On Fri, Oct 29, 2010 at 11:06 PM, Rupjyoti Sarmah <rsarmah@apm.com> wrote= : >> Hi , >> >> APM boards Canyonlands/Kilauea/Glacier/Katmai/Sequoia are all failing >> during booting. > > What kernel version? =A0What config? =A0Have you tried a git bisect to se= e > when it broke? =A0Etc, etc. >> >> Call trace is same for all >> >> >> Call Trace: >> >> [df835d70] [c02d27cc] emac_probe+0xf28/0x12a8 (unreliable) >> >> [df835e50] [c023c7cc] platform_driver_probe_shim+0x40/0x54 >> >> [df835e60] [c01bf354] platform_drv_probe+0x20/0x30 >> >> [df835e70] [c01bde68] driver_probe_device+0x148/0x1ac >> >> [df835e90] [c01be17c] __driver_attach+0xa4/0xa8 >> >> [df835eb0] [c01bcfa4] bus_for_each_dev+0x60/0x9c >> >> [df835ee0] [c01bdbbc] driver_attach+0x24/0x34 >> >> [df835ef0] [c01bd94c] bus_add_driver+0x1b8/0x274 >> >> [df835f20] [c01be3d8] driver_register+0x6c/0x160 >> >> [df835f40] [c01bf6c4] platform_driver_register+0x68/0x78 >> >> [df835f50] [c023c998] of_register_platform_driver+0xa8/0xc4 >> >> [df835f60] [c0393e88] emac_init+0x1ac/0x1dc >> >> [df835fa0] [c0001574] do_one_initcall+0x160/0x1a8 >> >> [df835fd0] [c037a1e8] kernel_init+0xcc/0x174 >> >> [df835ff0] [c000c5b0] kernel_thread+0x4c/0x68 >> >> Instruction dump: >> >> 419e016c 2f800007 419e0164 38130774 901a00d0 381306b0 901a00d4 7f43d378 >> >> 4bf91d89 817a01a0 39200001 380b0008 <7d400028> 7d4a4b78 7d40012d 40a2fff= 4 >> >> ---[ end trace dac0cf4779f83901 ]--- A git bisect between 2.6.36 (working) and Linus tip (traceback) points to: e6484930d7c73d324bccda7d43d131088da697b9 net: allocate tx queues in register_netdevice as causing this. I'm not entirely sure why yet, but the commit message seems slightly off to me. It claims to make TX queue allocation identical to RX, but from what I can tell, most of the RX queue logic is hidden behind CONFIG_RPS, which is not set in my config at all (and can't be due to a dep on CONFIG_SMP). This change doesn't guard anything behind that. A few hints would be appreciated. josh ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: All Applied micro boards are failing with current mainline kernel 2010-11-01 15:05 ` Josh Boyer @ 2010-11-01 15:36 ` Stephen Rothwell 2010-11-01 15:39 ` Josh Boyer 2010-11-01 15:50 ` David Miller 0 siblings, 2 replies; 7+ messages in thread From: Stephen Rothwell @ 2010-11-01 15:36 UTC (permalink / raw) To: Josh Boyer Cc: Eric Dumazet, Rupjyoti Sarmah, linux-kernel, linuxppc-dev, David S. Miller, Tom Herbert [-- Attachment #1: Type: text/plain, Size: 379 bytes --] Hi Josh, On Mon, 1 Nov 2010 11:05:53 -0400 Josh Boyer <jwboyer@gmail.com> wrote: > > A few hints would be appreciated. Remove the call to netif_stop_queue() from emac_probe(). Apparently, calling this before register_netdev() is now wrong (maybe always was). -- Cheers, Stephen Rothwell sfr@canb.auug.org.au http://www.canb.auug.org.au/~sfr/ [-- Attachment #2: Type: application/pgp-signature, Size: 490 bytes --] ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: All Applied micro boards are failing with current mainline kernel 2010-11-01 15:36 ` Stephen Rothwell @ 2010-11-01 15:39 ` Josh Boyer 2010-11-01 15:51 ` David Miller 2010-11-01 15:50 ` David Miller 1 sibling, 1 reply; 7+ messages in thread From: Josh Boyer @ 2010-11-01 15:39 UTC (permalink / raw) To: Stephen Rothwell Cc: Eric Dumazet, Rupjyoti Sarmah, linux-kernel, linuxppc-dev, David S. Miller, Tom Herbert On Mon, Nov 1, 2010 at 11:36 AM, Stephen Rothwell <sfr@canb.auug.org.au> wr= ote: > Hi Josh, > > On Mon, 1 Nov 2010 11:05:53 -0400 Josh Boyer <jwboyer@gmail.com> wrote: >> >> A few hints would be appreciated. > > Remove the call to netif_stop_queue() from emac_probe(). =A0Apparently, > calling this before register_netdev() is now wrong (maybe always was). Yeah, I just discovered that myself. I'm wondering 1) why we do that in that function? 2) If it needs to be removed entirely, or moved to after the register_netdev call 3) If the call to netif_carrier_off also needs similar attention. I can whip up a patch to remove those calls or move them after the register, but I don't want to do that without knowing which one is "right". josh ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: All Applied micro boards are failing with current mainline kernel 2010-11-01 15:39 ` Josh Boyer @ 2010-11-01 15:51 ` David Miller 2010-11-01 16:14 ` Josh Boyer 0 siblings, 1 reply; 7+ messages in thread From: David Miller @ 2010-11-01 15:51 UTC (permalink / raw) To: jwboyer; +Cc: sfr, eric.dumazet, rsarmah, linux-kernel, linuxppc-dev, therbert From: Josh Boyer <jwboyer@gmail.com> Date: Mon, 1 Nov 2010 11:39:47 -0400 > On Mon, Nov 1, 2010 at 11:36 AM, Stephen Rothwell <sfr@canb.auug.org.= au> wrote: >> Hi Josh, >> >> On Mon, 1 Nov 2010 11:05:53 -0400 Josh Boyer <jwboyer@gmail.com> wro= te: >>> >>> A few hints would be appreciated. >> >> Remove the call to netif_stop_queue() from emac_probe(). =A0Apparent= ly, >> calling this before register_netdev() is now wrong (maybe always was= ). > = > Yeah, I just discovered that myself. I'm wondering > = > 1) why we do that in that function? Because likely it was blindly copied from some other driver. > 2) If it needs to be removed entirely, or moved to after the > register_netdev call Removed entirely. > 3) If the call to netif_carrier_off also needs similar attention. Not really. > I can whip up a patch to remove those calls or move them after the > register, but I don't want to do that without knowing which one is > "right". I've already taken care of this. ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: All Applied micro boards are failing with current mainline kernel 2010-11-01 15:51 ` David Miller @ 2010-11-01 16:14 ` Josh Boyer 0 siblings, 0 replies; 7+ messages in thread From: Josh Boyer @ 2010-11-01 16:14 UTC (permalink / raw) To: David Miller Cc: sfr, eric.dumazet, rsarmah, linux-kernel, linuxppc-dev, therbert On Mon, Nov 1, 2010 at 11:51 AM, David Miller <davem@davemloft.net> wrote: >> I can whip up a patch to remove those calls or move them after the >> register, but I don't want to do that without knowing which one is >> "right". > > I've already taken care of this. Thanks! josh ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: All Applied micro boards are failing with current mainline kernel 2010-11-01 15:36 ` Stephen Rothwell 2010-11-01 15:39 ` Josh Boyer @ 2010-11-01 15:50 ` David Miller 1 sibling, 0 replies; 7+ messages in thread From: David Miller @ 2010-11-01 15:50 UTC (permalink / raw) To: sfr; +Cc: eric.dumazet, rsarmah, linux-kernel, linuxppc-dev, therbert From: Stephen Rothwell <sfr@canb.auug.org.au> Date: Tue, 2 Nov 2010 02:36:50 +1100 > Hi Josh, > > On Mon, 1 Nov 2010 11:05:53 -0400 Josh Boyer <jwboyer@gmail.com> wrote: >> >> A few hints would be appreciated. > > Remove the call to netif_stop_queue() from emac_probe(). Apparently, > calling this before register_netdev() is now wrong (maybe always was). Right. I'll add this to net-2.6 -------------------- ibm_newemac: Remove netif_stop_queue() in emac_probe(). Touching the queue state before register_netdev is not allowed, and besides the queue state before ->open() is "don't care" Reported-by: Josh Boyer <jwboyer@gmail.com> Reported-by: Stephen Rothwell <sfr@canb.auug.org.au> Signed-off-by: David S. Miller <davem@davemloft.net> --- drivers/net/ibm_newemac/core.c | 1 - 1 files changed, 0 insertions(+), 1 deletions(-) diff --git a/drivers/net/ibm_newemac/core.c b/drivers/net/ibm_newemac/core.c index 385dc32..06bb9b7 100644 --- a/drivers/net/ibm_newemac/core.c +++ b/drivers/net/ibm_newemac/core.c @@ -2871,7 +2871,6 @@ static int __devinit emac_probe(struct platform_device *ofdev, SET_ETHTOOL_OPS(ndev, &emac_ethtool_ops); netif_carrier_off(ndev); - netif_stop_queue(ndev); err = register_netdev(ndev); if (err) { -- 1.7.3.2 ^ permalink raw reply related [flat|nested] 7+ messages in thread
end of thread, other threads:[~2010-11-01 16:14 UTC | newest] Thread overview: 7+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- [not found] <Act3331keWjsN/sHTZGScnUum3AwDw==> [not found] ` <6631e123845856ac20423c25e07844c8@mail.gmail.com> 2010-11-01 13:22 ` All Applied micro boards are failing with current mainline kernel Josh Boyer 2010-11-01 15:05 ` Josh Boyer 2010-11-01 15:36 ` Stephen Rothwell 2010-11-01 15:39 ` Josh Boyer 2010-11-01 15:51 ` David Miller 2010-11-01 16:14 ` Josh Boyer 2010-11-01 15:50 ` David Miller
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).