From mboxrd@z Thu Jan 1 00:00:00 1970 From: Gregoire Gentil Subject: Re: TI wl1271 wireless bug with 3.4-rt17 Date: Tue, 04 Sep 2012 18:22:33 -0700 Message-ID: <5046A959.3080904@gentil.com> References: <1310590111-11484-1-git-send-email-u.kleine-koenig@pengutronix.de> <20120902194911.GT28643@pengutronix.de> <50444D2F.3040607@gentil.com> <20120904141138.GA14942@beefymiracle.amer.corp.natinst.com> Reply-To: gregoire@gentil.com Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: QUOTED-PRINTABLE Cc: linux-rt-users@vger.kernel.org To: Josh Cartwright Return-path: Received: from mx1.polytechnique.org ([129.104.30.34]:47054 "EHLO mx1.polytechnique.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751560Ab2IEBbc (ORCPT ); Tue, 4 Sep 2012 21:31:32 -0400 In-Reply-To: <20120904141138.GA14942@beefymiracle.amer.corp.natinst.com> Sender: linux-rt-users-owner@vger.kernel.org List-ID: On 09/04/2012 07:11 AM, Josh Cartwright wrote: > On Sun, Sep 02, 2012 at 11:24:47PM -0700, Gregoire Gentil wrote: >> Hello, >> >> I'm trying to debug a wifi bug with 3.4-rt17 applied, running on an >> OMAP4 ARM board such as Pandaboard. >> >> Wi-Fi works perfectly well without rt patches. It also works quite >> well with rt patches AND without wifi module loaded. But with both >> rt patches and wifi module, the system is very flaky and even if I >> manage to launch a big download, I get a kernel hang. I managed to >> get a trace: >> >> BUG: scheduling while atomic: irq/213-wl12xx/1588/0x00010002 >> Modules linked in: omapdce(C) wl12xx wlcore omaprpc(C) mac80211 d >> [] (unwind_backtrace+0x0/0xf0) from [] (dump) >> [] (dump_stack+0x20/0x24) from [] (__schedul) >> [] (__schedule_bug+0x54/0x60) from [] (__sch) >> [] (__schedule+0x74/0x6c0) from [] (schedule) >> [] (schedule+0xa0/0xb8) from [] (rt_spin_loc) >> [] (rt_spin_lock_slowlock+0x198/0x288) from [> [] (rt_spin_lock+0x18/0x1c) from [] (wl12xx_) >> [] (wl12xx_hardirq+0x2c/0xa4 [wlcore]) from [> [] (handle_irq_event_percpu+0xac/0x24c) from [> [] (handle_irq_event+0x7c/0x9c) from [] (han) >> [] (handle_level_irq+0xe4/0x134) from [] (ge) >> [] (generic_handle_irq+0x34/0x3c) from [] (g) >> [] (gpio_irq_handler+0x160/0x1a4) from [] (g) >> [] (generic_handle_irq+0x34/0x3c) from [] (h) >> [] (handle_IRQ+0x88/0xc8) >> >> Source code including the function wl12xx_hardirq is here: >> http://dev.omapzoom.org/?p=3Dintegration/kernel-ubuntu.git;a=3Dblob;= f=3Ddrivers/net/wireless/ti/wlcore/main.c;h=3D45fe911a6504f92dddff5a941= 5bb77a643b3c4a9;hb=3Df84c72f6b36418ff11d16808c16a7c3216730bb0 >> >> Any idea what could be wrong and how I could debug and fix this situ= ation? > > On first glance, it looks like the driver uses request_threaded_irq()= , > to register its handlers, but is trying to acquire a regular spin_loc= k > in its primary handler. That's bad news, since spin_locks' can > schedule() when contended with CONFIG_PREEMPT_RT. > > And it's not just that, unfortunately, since the primary handler also > complete()s a completion, which also can schedule(). > > It looks like the overall interrupt handling strategy of this driver > probably needs to be revisited. :(. Josh, I really appreciate the answer. Thank you. Though I'm definitely not a=20 RT expert, I really would like to make this work... Could you provide a= =20 little bit more guideline what I should patch to fix this situation?=20 Could I change request_threaded_irq to something else? Would raw_spin* = help? Many thanks in advance for any explanation, Gr=E9goire -- To unsubscribe from this list: send the line "unsubscribe linux-rt-user= s" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html