All of lore.kernel.org
 help / color / mirror / Atom feed
From: Ric Wheeler <rwheeler@redhat.com>
To: Mikulas Patocka <mpatocka@redhat.com>
Cc: device-mapper development <dm-devel@redhat.com>,
	ak@linux.intel.com,
	Linux Kernel Mailing List <linux-kernel@vger.kernel.org>,
	"MASON, CHRISTOPHER" <CHRIS.MASON@oracle.com>,
	Jens Axboe <jens.axboe@oracle.com>
Subject: Re: Barriers still not passing on simple dm devices...
Date: Thu, 09 Apr 2009 06:48:10 -0400	[thread overview]
Message-ID: <49DDD26A.2090901@redhat.com> (raw)
In-Reply-To: <Pine.LNX.4.64.0904080844550.28196@hs20-bc2-1.build.redhat.com>

Mikulas Patocka wrote:
>> And I will restate that back at EMC we tested the original barriers (with
>> reiserfs mostly, a bit on ext3 and ext2) and saw significant reduction in file
>> system integrity issues after power loss.
>>     
>
> You saw that barrier-enabled filesystem was worse than the same filesystem 
> without barriers? And what kind of issues were that? Disks writing damaged 
> sectors if powered-off in the middle of the writes? Or data corruptions 
> due to bugs in ReiserFS?
>   

No - I was not being clear. We saw a reduction in issues which is a 
confusing way to say that it was significantly better with barriers 
enabled, for both ext3 & reiserfs.

>   
>> The vantage point I had at EMC while testing and deploying the original
>> barrier work done by Jens and Chris was pretty unique - full ability to do
>> root cause failures of any component when really needed, a huge installed base
>> which could send information home on a regular basis about crashes/fsck
>> instances/etc and the ability (with customer permission) to dial into any box
>> and diagnose issues remotely. Not to mention access to drive vendors to
>> pressure them to make the flushes more robust. The application was also able
>> to validate that all acknowledged writes were consistent.
>>
>> Barriers do work as we have them, but as others have mentioned, it is not a
>> "free" win - fsync will actually move your data safely out to persistent
>> storage for a huge percentage of real users (including every ATA/S-ATA and SAS
>> drive I was able to test).  The file systems I monitored in production use
>> without barriers were much less reliable.
>>     
>
> With write cache or without write cache?
>   
Write cache enabled.

Barriers are off when write cache is disabled - we probe the drives 
write cache and enable barriers at mount time if and only if the 
barriers are on.
> With cache and without barriers the system is violating the specification. 
> There just could be data corruption ... and it will eventually happen.
>
> If you got corruption without cache and without barriers, there's a bug 
> and it needs to be investigated.
>
>   
>> As others have noted, some storage does not need barriers or flushed (high end
>> arrays, drives with no volatile write cache) and some need it but stink (low
>> cost USB flash sticks?) so warning is a good thing to do...
>>
>> ric
>>     
>
> Mikulas
>   

WARNING: multiple messages have this Message-ID (diff)
From: Ric Wheeler <rwheeler@redhat.com>
To: Mikulas Patocka <mpatocka@redhat.com>
Cc: Jens Axboe <jens.axboe@oracle.com>,
	device-mapper development <dm-devel@redhat.com>,
	Linux Kernel Mailing List <linux-kernel@vger.kernel.org>,
	ak@linux.intel.com, "MASON, CHRISTOPHER" <CHRIS.MASON@oracle.com>
Subject: Re: [dm-devel] Barriers still not passing on simple dm devices...
Date: Thu, 09 Apr 2009 06:48:10 -0400	[thread overview]
Message-ID: <49DDD26A.2090901@redhat.com> (raw)
In-Reply-To: <Pine.LNX.4.64.0904080844550.28196@hs20-bc2-1.build.redhat.com>

Mikulas Patocka wrote:
>> And I will restate that back at EMC we tested the original barriers (with
>> reiserfs mostly, a bit on ext3 and ext2) and saw significant reduction in file
>> system integrity issues after power loss.
>>     
>
> You saw that barrier-enabled filesystem was worse than the same filesystem 
> without barriers? And what kind of issues were that? Disks writing damaged 
> sectors if powered-off in the middle of the writes? Or data corruptions 
> due to bugs in ReiserFS?
>   

No - I was not being clear. We saw a reduction in issues which is a 
confusing way to say that it was significantly better with barriers 
enabled, for both ext3 & reiserfs.

>   
>> The vantage point I had at EMC while testing and deploying the original
>> barrier work done by Jens and Chris was pretty unique - full ability to do
>> root cause failures of any component when really needed, a huge installed base
>> which could send information home on a regular basis about crashes/fsck
>> instances/etc and the ability (with customer permission) to dial into any box
>> and diagnose issues remotely. Not to mention access to drive vendors to
>> pressure them to make the flushes more robust. The application was also able
>> to validate that all acknowledged writes were consistent.
>>
>> Barriers do work as we have them, but as others have mentioned, it is not a
>> "free" win - fsync will actually move your data safely out to persistent
>> storage for a huge percentage of real users (including every ATA/S-ATA and SAS
>> drive I was able to test).  The file systems I monitored in production use
>> without barriers were much less reliable.
>>     
>
> With write cache or without write cache?
>   
Write cache enabled.

Barriers are off when write cache is disabled - we probe the drives 
write cache and enable barriers at mount time if and only if the 
barriers are on.
> With cache and without barriers the system is violating the specification. 
> There just could be data corruption ... and it will eventually happen.
>
> If you got corruption without cache and without barriers, there's a bug 
> and it needs to be investigated.
>
>   
>> As others have noted, some storage does not need barriers or flushed (high end
>> arrays, drives with no volatile write cache) and some need it but stink (low
>> cost USB flash sticks?) so warning is a good thing to do...
>>
>> ric
>>     
>
> Mikulas
>   


  reply	other threads:[~2009-04-09 10:48 UTC|newest]

Thread overview: 62+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2009-03-23 19:04 Barriers still not passing on simple dm devices Eric Sandeen
2009-03-23 19:10 ` Eric Sandeen
2009-03-23 19:10   ` Eric Sandeen
2009-03-24 14:02 ` Mikulas Patocka
2009-03-24 14:02   ` [dm-devel] " Mikulas Patocka
2009-03-24 14:05   ` Jens Axboe
2009-03-24 14:05     ` [dm-devel] " Jens Axboe
2009-03-24 14:26     ` Mikulas Patocka
2009-03-24 14:26       ` [dm-devel] " Mikulas Patocka
2009-03-24 14:30       ` Jens Axboe
2009-03-24 14:30         ` [dm-devel] " Jens Axboe
2009-03-24 14:45         ` Mikulas Patocka
2009-03-24 14:45           ` [dm-devel] " Mikulas Patocka
2009-03-24 15:05           ` Jens Axboe
2009-03-24 15:05             ` [dm-devel] " Jens Axboe
2009-03-25 15:15             ` Mikulas Patocka
2009-03-25 15:15               ` [dm-devel] " Mikulas Patocka
2009-03-25 15:27               ` Jens Axboe
2009-03-25 22:39                 ` Mikulas Patocka
2009-03-25 22:39                   ` [dm-devel] " Mikulas Patocka
2009-03-26  8:42                   ` Jens Axboe
2009-03-26  8:42                     ` [dm-devel] " Jens Axboe
2009-03-31  3:39                     ` Mikulas Patocka
2009-03-31  3:39                       ` [dm-devel] " Mikulas Patocka
2009-03-31 10:49                       ` Jens Axboe
2009-03-31 10:49                         ` [dm-devel] " Jens Axboe
2009-04-02 23:40                         ` Mikulas Patocka
2009-04-02 23:40                           ` [dm-devel] " Mikulas Patocka
2009-04-03  8:11                           ` Jens Axboe
2009-04-03  8:11                             ` [dm-devel] " Jens Axboe
2009-04-04 15:20                             ` Ric Wheeler
2009-04-05  1:28                               ` Theodore Tso
2009-04-05  1:28                                 ` [dm-devel] " Theodore Tso
2009-04-05 11:54                                 ` Ric Wheeler
2009-04-05 11:54                                 ` [dm-devel] " Ric Wheeler
2009-04-06  1:14                                   ` Lee Revell
2009-04-06  1:14                                     ` [dm-devel] " Lee Revell
2009-04-06  1:24                                     ` Ric Wheeler
2009-04-08 12:44                                     ` Mikulas Patocka
2009-04-08 12:44                                       ` [dm-devel] " Mikulas Patocka
2009-04-08 15:16                                       ` Henrique de Moraes Holschuh
2009-04-09  4:22                                     ` Eric Sandeen
2009-04-09  4:22                                       ` [dm-devel] " Eric Sandeen
2009-04-08 12:36                                 ` Mikulas Patocka
2009-04-08 12:36                                   ` [dm-devel] " Mikulas Patocka
2009-04-08 12:54                               ` Mikulas Patocka
2009-04-08 12:54                                 ` [dm-devel] " Mikulas Patocka
2009-04-09 10:48                                 ` Ric Wheeler [this message]
2009-04-09 10:48                                   ` Ric Wheeler
2009-04-08 13:37                             ` Mikulas Patocka
2009-04-08 13:37                               ` [dm-devel] " Mikulas Patocka
2009-04-08 14:06                               ` Jens Axboe
2009-04-08 14:06                                 ` [dm-devel] " Jens Axboe
2009-04-08 23:44                               ` Dave Chinner
2009-04-08 23:44                                 ` [dm-devel] " Dave Chinner
2009-04-09  1:27                               ` Chris Mason
2009-04-09 10:28                                 ` Alasdair G Kergon
2009-04-09 10:28                                   ` [dm-devel] " Alasdair G Kergon
2009-03-26 12:55                   ` Chris Mason
     [not found] <ciXHh-39c-37@gated-at.bofh.it>
2009-03-23 21:52 ` Bodo Eggert
2009-03-23 21:52   ` Bodo Eggert
     [not found] ` <cjfuL-6vJ-43@gated-at.bofh.it>
     [not found]   ` <cjfEl-6J2-45@gated-at.bofh.it>
     [not found]     ` <cjfNX-6Wh-27@gated-at.bofh.it>
2009-03-26 13:05       ` Bodo Eggert

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=49DDD26A.2090901@redhat.com \
    --to=rwheeler@redhat.com \
    --cc=CHRIS.MASON@oracle.com \
    --cc=ak@linux.intel.com \
    --cc=dm-devel@redhat.com \
    --cc=jens.axboe@oracle.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mpatocka@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.