From: Stefan Richter <stefanr@s5r6.in-berlin.de>
To: linux-kernel@vger.kernel.org
Cc: linux1394-user@lists.sourceforge.net,
linux1394-devel@lists.sourceforge.net
Subject: Silent data corruption with kernel 3.4 and FireWire disks
Date: Sat, 2 Jun 2012 11:55:11 +0200 [thread overview]
Message-ID: <20120602115511.5ad979db@stein> (raw)
In-Reply-To: <20120524224447.57a636f7@stein>
About a week ago I noticed silent data corruptions of files on FireWire
disks: Mount disk, read lots of data and e.g. compute their md5sum,
unmount disk, mount disk again, read and md5sum the same files again ->
MD5s may differ.
Defects in files that were written in May hint that not only reading from
but also writing to FireWire disks resulted in corrupt data. This was
silent corruption without any error messages from the PCI, firewire, SCSI,
block, or filesystem subsystems.
Affected:
- kernel 3.4
- kernel 3.4-rc5
Not affected:
- kernel 3.3.1 (which I have been running now for the last 6 days)
I used these three kernels with the same patchlevel of FireWire drivers,
namely circa those which are about to be released in 3.5-rc1. FireWire
disks with different 1394-to-SATA or -IDE bridge chips are affected. I
noticed the problem at first on an Agere FW643e PCIe 1394 controller which
sits behind a PLX PEX 8505 PCIe switch.
MPEG2TS video reception through the same 1394 controller and PCIe switch
did never show a noticable sign of corruption.
I did not have time yet to systematically test
- whether all of my FireWire controllers are affected,
- whether SATA or USB disks are affected (SATA probably not, USB not
used yet),
- whether my secondary Linux PC is affected.
Kernel 3.4 and 3.4-rc5 exhibited another (seemingly harmless but
suspicious) issue on my primary PC: frequent transmit queue time-outs of
an RTL8111/8168B Ethernet interface,
http://www.spinics.net/lists/netdev/msg197032.html
Being busy at work lately and not having Linux available at work, I will
be slow to look further into it. With enough spare time, it should be
possible to identify the regression by bisection between kernel 3.3 and
3.4-rc but I have no estimate when I will be able to spend that time.
--
Stefan Richter
-=====-===-- -==- ---=-
http://arcgraph.de/sr/
next parent reply other threads:[~2012-06-02 9:55 UTC|newest]
Thread overview: 2+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <20120524224447.57a636f7@stein>
2012-06-02 9:55 ` Stefan Richter [this message]
[not found] <mailman.408825.1338838130.11409.linux1394-devel@lists.sourceforge.net>
2012-06-05 0:02 ` Silent data corruption with kernel 3.4 and FireWire disks Jonathan Woithe
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20120602115511.5ad979db@stein \
--to=stefanr@s5r6.in-berlin.de \
--cc=linux-kernel@vger.kernel.org \
--cc=linux1394-devel@lists.sourceforge.net \
--cc=linux1394-user@lists.sourceforge.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox