Re: Strange read data corruption on ext4/LVM/md

linux-ide.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

* Re: Strange read data corruption on ext4/LVM/md
       [not found]     ` <20100519233408.7436bd9b@mjolnir.ossman.eu>
@ 2010-05-20  7:14       ` Pierre Ossman
  2010-05-20  8:57         ` Tejun Heo
  0 siblings, 1 reply; 9+ messages in thread
From: Pierre Ossman @ 2010-05-20  7:14 UTC (permalink / raw)
  To: linux-ide, linux-kernel

[-- Attachment #1: Type: text/plain, Size: 1589 bytes --]

(adding linux-ide)

On Wed, 19 May 2010 23:34:08 +0200
Pierre Ossman <pierre@ossman.eu> wrote:

> 
> I'm mostly talking to myself at this point, but one thing that occurs
> to me here is 4096 sectors line up decently with the numbers above.
> 0x380 is just at the end of a 512 byte sector, and 0xf80 is just at the
> end of a 4096 byte one. Not sure it's relevant, but then again I've
> stayed blissfully unaware of how this sector size transformation is
> going to happen. :)
> 

Ignore the above. Math is hard.

I did some more testing though, and this might be a low level issue. I
did the following multiple times:

# dd if=/dev/sde skip=4k bs=4M count=500 | md5sum

And the results were:

13aa29adcd16f8d0faf3cb5c39f43826
d1e3df33c0b0d03c61f880a8f2bb6cfb
13aa29adcd16f8d0faf3cb5c39f43826
13aa29adcd16f8d0faf3cb5c39f43826
13aa29adcd16f8d0faf3cb5c39f43826
13aa29adcd16f8d0faf3cb5c39f43826
7a746328b60a63b76847c3e1319a8534
13aa29adcd16f8d0faf3cb5c39f43826

Note that this is a live system, so there is some chance that something
wrote to than area, then restored it to the previous state. I'm not
sure how likely that is.

If not, then it would seem that this is a problem in either the disks,
the controller or the controller driver. The components are WD
WD1002FAEX, sil3132 and sata_sil24 respectively.

Rgds
-- 
     -- Pierre Ossman

  WARNING: This correspondence is being monitored by FRA, a
  Swedish intelligence agency. Make sure your server uses
  encryption for SMTP traffic and consider using PGP for
  end-to-end encryption.

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 198 bytes --]

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Strange read data corruption on ext4/LVM/md
  2010-05-20  7:14       ` Strange read data corruption on ext4/LVM/md Pierre Ossman
@ 2010-05-20  8:57         ` Tejun Heo
  2010-05-20  9:29           ` Pierre Ossman
  0 siblings, 1 reply; 9+ messages in thread
From: Tejun Heo @ 2010-05-20  8:57 UTC (permalink / raw)
  To: Pierre Ossman; +Cc: linux-ide, linux-kernel

On 05/20/2010 09:14 AM, Pierre Ossman wrote:
> Note that this is a live system, so there is some chance that something
> wrote to than area, then restored it to the previous state. I'm not
> sure how likely that is.
> 
> If not, then it would seem that this is a problem in either the disks,
> the controller or the controller driver. The components are WD
> WD1002FAEX, sil3132 and sata_sil24 respectively.

There is a report that sil3124/32 recognizes FIS corruption but keeps
using the payload anyway thus leading to data corruption when the bus
condition on pci-e side isn't ideal.  Does moving the controller to
different slot make difference?

Thanks.

-- 
tejun

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Strange read data corruption on ext4/LVM/md
  2010-05-20  8:57         ` Tejun Heo
@ 2010-05-20  9:29           ` Pierre Ossman
  2010-05-20  9:42             ` Tejun Heo
  0 siblings, 1 reply; 9+ messages in thread
From: Pierre Ossman @ 2010-05-20  9:29 UTC (permalink / raw)
  To: Tejun Heo; +Cc: linux-ide, linux-kernel

[-- Attachment #1: Type: text/plain, Size: 1822 bytes --]

On Thu, 20 May 2010 10:57:29 +0200
Tejun Heo <tj@kernel.org> wrote:

> On 05/20/2010 09:14 AM, Pierre Ossman wrote:
> > Note that this is a live system, so there is some chance that something
> > wrote to than area, then restored it to the previous state. I'm not
> > sure how likely that is.
> > 
> > If not, then it would seem that this is a problem in either the disks,
> > the controller or the controller driver. The components are WD
> > WD1002FAEX, sil3132 and sata_sil24 respectively.
> 
> There is a report that sil3124/32 recognizes FIS corruption but keeps
> using the payload anyway thus leading to data corruption when the bus
> condition on pci-e side isn't ideal.  Does moving the controller to
> different slot make difference?
> 

The machine is rather crammed right now, with one controller in each of
the three available pci-e slots (5 disks). I am running continuous tests
on the disks right now though to see if the problems is on all disks or
just some. If just one slot is causing problems then we should see some
results there.

When you say FIS corruption, do you mean corruption in the sense of
randomly flipped bits? I don't know if you saw the first couple of
mails (before linux-ide was added), but the problem is data being moved
around, not just randomly changed.

Another note is that the problem seems to worsen under load. I'm
running the dd thing in the background, which seems to make read errors
more common on my test files on the filesystem level.

I also tried disabling NCQ without any noticeable change.

Rgds
-- 
     -- Pierre Ossman

  WARNING: This correspondence is being monitored by FRA, a
  Swedish intelligence agency. Make sure your server uses
  encryption for SMTP traffic and consider using PGP for
  end-to-end encryption.

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 198 bytes --]

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Strange read data corruption on ext4/LVM/md
  2010-05-20  9:29           ` Pierre Ossman
@ 2010-05-20  9:42             ` Tejun Heo
  2010-05-20 10:22               ` Pierre Ossman
  0 siblings, 1 reply; 9+ messages in thread
From: Tejun Heo @ 2010-05-20  9:42 UTC (permalink / raw)
  To: Pierre Ossman; +Cc: linux-ide, linux-kernel

Hello,

On 05/20/2010 11:29 AM, Pierre Ossman wrote:
> The machine is rather crammed right now, with one controller in each of
> the three available pci-e slots (5 disks). I am running continuous tests
> on the disks right now though to see if the problems is on all disks or
> just some. If just one slot is causing problems then we should see some
> results there.

I see.

> When you say FIS corruption, do you mean corruption in the sense of

Oh, not FIS, FIS is the name for SATA packets.  I meant the PCI-e
packets.  How were they called... yeap TLPs.

> randomly flipped bits? I don't know if you saw the first couple of
> mails (before linux-ide was added), but the problem is data being moved
> around, not just randomly changed.

I ony saw your previous posting.  TLP corruption can happen during
command setup phase and bit flipping in the command address part is
definitely possible, so reads and writes can be headed at wrong places
in both memory and disk.  I don't know whether this would fit your
symptom tho.

> Another note is that the problem seems to worsen under load. I'm
> running the dd thing in the background, which seems to make read errors
> more common on my test files on the filesystem level.

It would be great if you can try a different controller in similar
setup.  But please keep trying to narrow down the problem and if
possible please remove filesystem from the stack and test against the
block device directly.

Thanks.

-- 
tejun

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Strange read data corruption on ext4/LVM/md
  2010-05-20  9:42             ` Tejun Heo
@ 2010-05-20 10:22               ` Pierre Ossman
  2010-05-20 14:00                 ` Pierre Ossman
  0 siblings, 1 reply; 9+ messages in thread
From: Pierre Ossman @ 2010-05-20 10:22 UTC (permalink / raw)
  To: Tejun Heo; +Cc: linux-ide, linux-kernel

[-- Attachment #1: Type: text/plain, Size: 3416 bytes --]

On Thu, 20 May 2010 11:42:29 +0200
Tejun Heo <tj@kernel.org> wrote:

> > randomly flipped bits? I don't know if you saw the first couple of
> > mails (before linux-ide was added), but the problem is data being moved
> > around, not just randomly changed.
> 
> I ony saw your previous posting.  TLP corruption can happen during
> command setup phase and bit flipping in the command address part is
> definitely possible, so reads and writes can be headed at wrong places
> in both memory and disk.  I don't know whether this would fit your
> symptom tho.
> 

Ah. Here's the problem description from a previous mail:

The corruption is 104 bytes. Somewhat odd number. I would have expected
something more fundamental like a sector or a page.

The data in question seems to come from another part of the file.

The shifts are 015d1380 => 015d0f80 (-1024 bytes) and 02210380 =>
0220ff80 (also -1024 bytes). At least the offset is a nice, sane power
of two number.

Noteworthy is also that the last three nibbles of the corruption are
always the same (xxxxx380 => xxxxxf80).

</recap>

Note that the above analysis is from files, so it involves the entire
stack. I've since focused on raw disks. See below.

> > Another note is that the problem seems to worsen under load. I'm
> > running the dd thing in the background, which seems to make read errors
> > more common on my test files on the filesystem level.
> 
> It would be great if you can try a different controller in similar
> setup.

I only stock sil3132 cards as those are the only decent add-on cards
I've found. AHCI stuff all seems to be onboard.

> But please keep trying to narrow down the problem and if
> possible please remove filesystem from the stack and test against the
> block device directly.

That's what I've been doing the last couple of runs. From a previous
mail:

I did some more testing though, and this might be a low level issue. I
did the following multiple times:

# dd if=/dev/sde skip=4k bs=4M count=500 | md5sum

And the results were:

13aa29adcd16f8d0faf3cb5c39f43826
d1e3df33c0b0d03c61f880a8f2bb6cfb
13aa29adcd16f8d0faf3cb5c39f43826
13aa29adcd16f8d0faf3cb5c39f43826
13aa29adcd16f8d0faf3cb5c39f43826
13aa29adcd16f8d0faf3cb5c39f43826
7a746328b60a63b76847c3e1319a8534
13aa29adcd16f8d0faf3cb5c39f43826

</recap2>

Since the amount of data is much larger here and the incidents more
rare, I haven't been able to confirm that the corruption is identical
to what I've seen in the files. I'm working on the assumption that it
is...

I've since constructed a script that keeps re-running the above over
all relevant disks and keeps track of how many unique md5 values we
get. It's been running for about 1.5 hours right now, and here are the
results so far:

sdd - 3, sde - 4, sdf - 1, sdb - 1, sdc - 1, 

sdd and sde are both on the same controller, so the problem you
mentioned could be relevant.

I'll let the test run for a few more hours and try moving things off
that controller later tonight.

Thanks for looking at this. Unstable data storage is one of those
things that can keep you up at night. :/

Rgds
-- 
     -- Pierre Ossman

  WARNING: This correspondence is being monitored by FRA, a
  Swedish intelligence agency. Make sure your server uses
  encryption for SMTP traffic and consider using PGP for
  end-to-end encryption.

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 198 bytes --]

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Strange read data corruption on ext4/LVM/md
  2010-05-20 10:22               ` Pierre Ossman
@ 2010-05-20 14:00                 ` Pierre Ossman
  2010-05-20 16:28                   ` Pierre Ossman
  0 siblings, 1 reply; 9+ messages in thread
From: Pierre Ossman @ 2010-05-20 14:00 UTC (permalink / raw)
  To: Tejun Heo; +Cc: linux-ide, linux-kernel

[-- Attachment #1: Type: text/plain, Size: 930 bytes --]

On Thu, 20 May 2010 12:22:27 +0200
Pierre Ossman <pierre-list@ossman.eu> wrote:

> 
> I've since constructed a script that keeps re-running the above over
> all relevant disks and keeps track of how many unique md5 values we
> get. It's been running for about 1.5 hours right now, and here are the
> results so far:
> 
> sdd - 3, sde - 4, sdf - 1, sdb - 1, sdc - 1, 
> 
> sdd and sde are both on the same controller, so the problem you
> mentioned could be relevant.
> 
> I'll let the test run for a few more hours and try moving things off
> that controller later tonight.
> 

The trend continues and the numbers are now 9 and 8 for sdd and sde and
1 for the others.

Rgds
-- 
     -- Pierre Ossman

  WARNING: This correspondence is being monitored by FRA, a
  Swedish intelligence agency. Make sure your server uses
  encryption for SMTP traffic and consider using PGP for
  end-to-end encryption.

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 198 bytes --]

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Strange read data corruption on ext4/LVM/md
  2010-05-20 14:00                 ` Pierre Ossman
@ 2010-05-20 16:28                   ` Pierre Ossman
  2010-07-15 19:38                     ` Pierre Ossman
  0 siblings, 1 reply; 9+ messages in thread
From: Pierre Ossman @ 2010-05-20 16:28 UTC (permalink / raw)
  To: Tejun Heo; +Cc: linux-ide, linux-kernel

[-- Attachment #1: Type: text/plain, Size: 1268 bytes --]

On Thu, 20 May 2010 16:00:49 +0200
Pierre Ossman <pierre-list@ossman.eu> wrote:

> On Thu, 20 May 2010 12:22:27 +0200
> Pierre Ossman <pierre-list@ossman.eu> wrote:
> 
> > 
> > I've since constructed a script that keeps re-running the above over
> > all relevant disks and keeps track of how many unique md5 values we
> > get. It's been running for about 1.5 hours right now, and here are the
> > results so far:
> > 
> > sdd - 3, sde - 4, sdf - 1, sdb - 1, sdc - 1, 
> > 
> > sdd and sde are both on the same controller, so the problem you
> > mentioned could be relevant.
> > 
> > I'll let the test run for a few more hours and try moving things off
> > that controller later tonight.
> > 
> 
> The trend continues and the numbers are now 9 and 8 for sdd and sde and
> 1 for the others.
> 

Controller now taken out of the picture and I no longer see any errors
on either filesystem nor direct disk level.

I have a sil3132 of a different OEM brand here. I'll try that in that
pci-e slot next.

Rgds
-- 
     -- Pierre Ossman

  WARNING: This correspondence is being monitored by FRA, a
  Swedish intelligence agency. Make sure your server uses
  encryption for SMTP traffic and consider using PGP for
  end-to-end encryption.

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 198 bytes --]

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Strange read data corruption on ext4/LVM/md
  2010-05-20 16:28                   ` Pierre Ossman
@ 2010-07-15 19:38                     ` Pierre Ossman
  2011-01-16 14:01                       ` Pierre Ossman
  0 siblings, 1 reply; 9+ messages in thread
From: Pierre Ossman @ 2010-07-15 19:38 UTC (permalink / raw)
  Cc: Tejun Heo, linux-ide, linux-kernel

[-- Attachment #1: Type: text/plain, Size: 1149 bytes --]

On Thu, 20 May 2010 18:28:52 +0200
Pierre Ossman <pierre-list@ossman.eu> wrote:

> 
> Controller now taken out of the picture and I no longer see any errors
> on either filesystem nor direct disk level.
> 
> I have a sil3132 of a different OEM brand here. I'll try that in that
> pci-e slot next.
> 

Status update:

1. Putting the other sil3132 controller did not make any problems
appear.

2. Next I swapped that controller with the one next to it to test if
this specific board design in combination with this specific slot was
causing the issues. Still no corruption.

3. Today I put the "bad" controller back in, but in a different slot.
Still not seeing any issues.

I could move it back to the inital slot to see if the problem
reappears, but I'm just happy the system is error free again so I'll
write this off as a temporary glitch or a bad combination of card and
slot.

Rgds
-- 
     -- Pierre Ossman

  WARNING: This correspondence is being monitored by FRA, a
  Swedish intelligence agency. Make sure your server uses
  encryption for SMTP traffic and consider using PGP for
  end-to-end encryption.

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 230 bytes --]

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Strange read data corruption on ext4/LVM/md
  2010-07-15 19:38                     ` Pierre Ossman
@ 2011-01-16 14:01                       ` Pierre Ossman
  0 siblings, 0 replies; 9+ messages in thread
From: Pierre Ossman @ 2011-01-16 14:01 UTC (permalink / raw)
  To: linux-ide; +Cc: Tejun Heo, linux-kernel

[-- Attachment #1: Type: text/plain, Size: 1413 bytes --]

On Thu, 15 Jul 2010 21:38:01 +0200
Pierre Ossman <pierre-list@ossman.eu> wrote:

> 
> Status update:
> 
> 1. Putting the other sil3132 controller did not make any problems
> appear.
> 
> 2. Next I swapped that controller with the one next to it to test if
> this specific board design in combination with this specific slot was
> causing the issues. Still no corruption.
> 
> 3. Today I put the "bad" controller back in, but in a different slot.
> Still not seeing any issues.
> 
> I could move it back to the inital slot to see if the problem
> reappears, but I'm just happy the system is error free again so I'll
> write this off as a temporary glitch or a bad combination of card and
> slot.
> 

Time for an update to this somewhat old thread. The problem has
remained, but being so very rare that I cannot reproduce it often
enough for systematic testing. Both motherboard and the "bad"
controller have been swapped out with different (but identical) cards.

These days there are some controller cards based on Marvell's 88SE9128,
which seems to be a decent chip. So the next step is chucking all the
sil3132 stuff in favour of those...

Rgds
-- 
     -- Pierre Ossman

  WARNING: This correspondence is being monitored by FRA, a
  Swedish intelligence agency. Make sure your server uses
  encryption for SMTP traffic and consider using PGP for
  end-to-end encryption.

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 230 bytes --]

^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2011-01-16 14:07 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <20100519225653.1fedb453@mjolnir.ossman.eu>
     [not found] ` <20100519230426.47c6c1ed@mjolnir.ossman.eu>
     [not found]   ` <20100519232906.3be82279@mjolnir.ossman.eu>
     [not found]     ` <20100519233408.7436bd9b@mjolnir.ossman.eu>
2010-05-20  7:14       ` Strange read data corruption on ext4/LVM/md Pierre Ossman
2010-05-20  8:57         ` Tejun Heo
2010-05-20  9:29           ` Pierre Ossman
2010-05-20  9:42             ` Tejun Heo
2010-05-20 10:22               ` Pierre Ossman
2010-05-20 14:00                 ` Pierre Ossman
2010-05-20 16:28                   ` Pierre Ossman
2010-07-15 19:38                     ` Pierre Ossman
2011-01-16 14:01                       ` Pierre Ossman

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).