From: Greg KH <gregkh@suse.de>
To: Daniel Mack <daniel@caiaq.de>
Cc: LKML <linux-kernel@vger.kernel.org>,
AKML <linux-arm-kernel@lists.infradead.org>,
linux-wireless@vger.kernel.org,
Sven Neumann <s.neumann@raumfeld.com>
Subject: Re: Memory corruption with 2.6.32.10, but not with 2.6.34-rc3
Date: Thu, 1 Apr 2010 09:50:56 -0700 [thread overview]
Message-ID: <20100401165056.GA1138@suse.de> (raw)
In-Reply-To: <20100401132156.GJ30807@buzzloop.caiaq.de>
On Thu, Apr 01, 2010 at 03:21:56PM +0200, Daniel Mack wrote:
> Hi,
>
> we observed repeated occurances of memory corruptions (Ooopes somewhere
> deep down in the memory mangement code) on ARM PXA300 based boards.
>
> The systems we see this on (arch/arm/mach-pxa/raumfeld.c) feature a
> libertas chipset for WiFi, an ethernet controller (smsc9220), a USB
> fullspeed host, and NAND flash which is used as UBIFS storage.
>
> Currently, these boards run a 2.6.32.10 kernel. After collecting
> evidences for a week or so about when and how and why the memory
> corruptions happen, I tried a 2.6.34-rc3 today and the issue seems fixed
> there. So - appearantly some important fix since 2.6.32 didn't get
> enough care to be backported to the stable branch.
>
> The bug is rather hard to trigger. What I currently do is: after the
> system booted from NAND (UBIFS root partition), I wait for the WPA2
> secured WiFi link to get active and then download a file (~8MB) over
> WiFi to local storage. This download is done in an endless loop. Once in
> a while this crashes the 2.6.32.10 kernel instantly, sometimes it takes
> up to ~5hrs to happen.
>
> Some findings I collected over the last weeks:
>
> - when calling wget with '-O /dev/null' to not write any file
> -> does NOT crash
>
> - downloading via Ethernet instead of WiFi
> -> does NOT crash
>
> - writing the file to either a tmpfs parition or a fatfs (on USB
> connected external media)
> -> DOES still crash (so it is most likely not an UBIFS issue)
>
> - passing --download-rate=50000 to wget (to limit the traffic
> thruput to 50kb/s) _in_creases the probability of the crash
>
> - running userspace applications which heavily allocate and
> deallocate memory doesn't seem to make the bug more likely or
> unlikely
>
> So my current summary is that this is related to WiFi, but OTOH it still
> only happens when file system traffic is issued.
>
> We would like to have a fix for this annoying bug in the stable series
> (especially 2.6.32.x) as well, but I don't have much ideas about where
> to search for it. Hence, I would appreciate if maintainers could think
> about any possible commits in the described time window which haven't
> reached stable. Does the description ring anyone's bell?
I can't think of any USB specific patches that would be related to this,
sorry.
good luck,
greg k-h
WARNING: multiple messages have this Message-ID (diff)
From: gregkh@suse.de (Greg KH)
To: linux-arm-kernel@lists.infradead.org
Subject: Memory corruption with 2.6.32.10, but not with 2.6.34-rc3
Date: Thu, 1 Apr 2010 09:50:56 -0700 [thread overview]
Message-ID: <20100401165056.GA1138@suse.de> (raw)
In-Reply-To: <20100401132156.GJ30807@buzzloop.caiaq.de>
On Thu, Apr 01, 2010 at 03:21:56PM +0200, Daniel Mack wrote:
> Hi,
>
> we observed repeated occurances of memory corruptions (Ooopes somewhere
> deep down in the memory mangement code) on ARM PXA300 based boards.
>
> The systems we see this on (arch/arm/mach-pxa/raumfeld.c) feature a
> libertas chipset for WiFi, an ethernet controller (smsc9220), a USB
> fullspeed host, and NAND flash which is used as UBIFS storage.
>
> Currently, these boards run a 2.6.32.10 kernel. After collecting
> evidences for a week or so about when and how and why the memory
> corruptions happen, I tried a 2.6.34-rc3 today and the issue seems fixed
> there. So - appearantly some important fix since 2.6.32 didn't get
> enough care to be backported to the stable branch.
>
> The bug is rather hard to trigger. What I currently do is: after the
> system booted from NAND (UBIFS root partition), I wait for the WPA2
> secured WiFi link to get active and then download a file (~8MB) over
> WiFi to local storage. This download is done in an endless loop. Once in
> a while this crashes the 2.6.32.10 kernel instantly, sometimes it takes
> up to ~5hrs to happen.
>
> Some findings I collected over the last weeks:
>
> - when calling wget with '-O /dev/null' to not write any file
> -> does NOT crash
>
> - downloading via Ethernet instead of WiFi
> -> does NOT crash
>
> - writing the file to either a tmpfs parition or a fatfs (on USB
> connected external media)
> -> DOES still crash (so it is most likely not an UBIFS issue)
>
> - passing --download-rate=50000 to wget (to limit the traffic
> thruput to 50kb/s) _in_creases the probability of the crash
>
> - running userspace applications which heavily allocate and
> deallocate memory doesn't seem to make the bug more likely or
> unlikely
>
> So my current summary is that this is related to WiFi, but OTOH it still
> only happens when file system traffic is issued.
>
> We would like to have a fix for this annoying bug in the stable series
> (especially 2.6.32.x) as well, but I don't have much ideas about where
> to search for it. Hence, I would appreciate if maintainers could think
> about any possible commits in the described time window which haven't
> reached stable. Does the description ring anyone's bell?
I can't think of any USB specific patches that would be related to this,
sorry.
good luck,
greg k-h
next prev parent reply other threads:[~2010-04-01 16:52 UTC|newest]
Thread overview: 16+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-04-01 13:21 Memory corruption with 2.6.32.10, but not with 2.6.34-rc3 Daniel Mack
2010-04-01 13:21 ` Daniel Mack
2010-04-01 16:50 ` Greg KH [this message]
2010-04-01 16:50 ` Greg KH
2010-04-01 17:00 ` Daniel Mack
2010-04-01 17:00 ` Daniel Mack
2010-04-01 16:51 ` Greg KH
2010-04-01 16:51 ` Greg KH
2010-04-01 16:58 ` Daniel Mack
2010-04-01 16:58 ` Daniel Mack
2010-04-05 10:59 ` Uwe Kleine-König
2010-04-05 10:59 ` Uwe Kleine-König
2010-04-01 17:29 ` Anders Grafström
2010-04-01 17:29 ` Anders Grafström
2010-04-01 17:57 ` Daniel Mack
2010-04-01 17:57 ` Daniel Mack
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20100401165056.GA1138@suse.de \
--to=gregkh@suse.de \
--cc=daniel@caiaq.de \
--cc=linux-arm-kernel@lists.infradead.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-wireless@vger.kernel.org \
--cc=s.neumann@raumfeld.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.