[PATCH] xfs: return errors from partial I/O failures to files

public inbox for linux-xfs@vger.kernel.org
 help / color / mirror / Atom feed

* [PATCH] xfs: return errors from partial I/O failures to files
@ 2015-08-26 19:06 David Jeffery
  2015-08-26 22:19 ` Dave Chinner
  0 siblings, 1 reply; 5+ messages in thread
From: David Jeffery @ 2015-08-26 19:06 UTC (permalink / raw)
  To: xfs

There is an issue with xfs's error reporting in some cases of I/O partially
failing and partially succeeding. Calls like fsync() can report success even
though not all I/O was successful.

The issue can occur when there are multiple bio per xfs_ioend struct.
Each call to xfs_end_bio() for a bio completing will write a value to
ioend->io_error.  If a successful bio completes after any failed bio, no
error is reported do to it writing 0 over the error code set by any failed bio.
The I/O error information is now lost and when the ioend is completed
only success is reported back up the filesystem stack.

xfs_end_bio() should only set ioend->io_error in the case of BIO_UPTODATE
being clear.  ioend->io_error is initialized to 0 at allocation so only needs
to be updated by any failed bio structs. This ensures an error can be reported
to the application.

Signed-off-by: David Jeffery <djeffery@redhat.com>
---

diff --git a/fs/xfs/xfs_aops.c b/fs/xfs/xfs_aops.c
index 3859f5e..b82b128 100644
--- a/fs/xfs/xfs_aops.c
+++ b/fs/xfs/xfs_aops.c
@@ -356,7 +356,8 @@ xfs_end_bio(
 {
 	xfs_ioend_t		*ioend = bio->bi_private;

-	ioend->io_error = test_bit(BIO_UPTODATE, &bio->bi_flags) ? 0 : error;
+	if (!test_bit(BIO_UPTODATE, &bio->bi_flags))
+		ioend->io_error = error;

 	/* Toss bio and pass work off to an xfsdatad thread */
 	bio->bi_private = NULL;

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply related	[flat|nested] 5+ messages in thread

* Re: [PATCH] xfs: return errors from partial I/O failures to files
  2015-08-26 19:06 [PATCH] xfs: return errors from partial I/O failures to files David Jeffery
@ 2015-08-26 22:19 ` Dave Chinner
  2015-08-27  6:09   ` Eryu Guan
  2015-08-27 13:41   ` David Jeffery
  0 siblings, 2 replies; 5+ messages in thread
From: Dave Chinner @ 2015-08-26 22:19 UTC (permalink / raw)
  To: David Jeffery; +Cc: Eryu Guan, xfs

On Wed, Aug 26, 2015 at 03:06:36PM -0400, David Jeffery wrote:
> There is an issue with xfs's error reporting in some cases of I/O partially
> failing and partially succeeding. Calls like fsync() can report success even
> though not all I/O was successful.

Hi David,

I read your bug report last night and after considering all the work
you put into it, I was going to ask if you wanted to finish off the
job by writing the patch to fix it. But you beat me to it.

Nice work! :)

> The issue can occur when there are multiple bio per xfs_ioend struct.
> Each call to xfs_end_bio() for a bio completing will write a value to
> ioend->io_error.  If a successful bio completes after any failed bio, no
> error is reported do to it writing 0 over the error code set by any failed bio.
> The I/O error information is now lost and when the ioend is completed
> only success is reported back up the filesystem stack.

It's worth mentioning the case that this was seen in - a single
failed disk in a raid 0 stripe, and the error from the bio to the
failed disk was overwritten by the successes from the bios to the
other disks.

FWIW, I think that we also need to create an xfstest for this case,
too, because it's clear that this is a big hole in our test coverage
(i.e.  partial block device failure). It might be best to talk to
Eryu (cc'd) to get your reproducer converted into a xfstest case
that we can then test all filesystems against?

> xfs_end_bio() should only set ioend->io_error in the case of BIO_UPTODATE
> being clear.  ioend->io_error is initialized to 0 at allocation so only needs
> to be updated by any failed bio structs. This ensures an error can be reported
> to the application.
> 
> Signed-off-by: David Jeffery <djeffery@redhat.com>
> ---

Best to add a "cc: <stable@vger.kernel.org>" so that it gets pushed
back to all the stable kernels automatically which it hits Linus'
tree.

One minor change to the fix:

> diff --git a/fs/xfs/xfs_aops.c b/fs/xfs/xfs_aops.c
> index 3859f5e..b82b128 100644
> --- a/fs/xfs/xfs_aops.c
> +++ b/fs/xfs/xfs_aops.c
> @@ -356,7 +356,8 @@ xfs_end_bio(
>  {
>  	xfs_ioend_t		*ioend = bio->bi_private;
>  
> -	ioend->io_error = test_bit(BIO_UPTODATE, &bio->bi_flags) ? 0 : error;
> +	if (!test_bit(BIO_UPTODATE, &bio->bi_flags))
> +		ioend->io_error = error;

We should preserve the original error that was reported, rather than
report the last one. ioend->io_error is always initialised to zero,
so we can simply do:

	if (!ioend->io_error && !test_bit(BIO_UPTODATE, &bio->bi_flags))
		ioend->io_error = error;

Can you update the patch and resend it? I've got a couple of other
fixes that I need to push to the for-next tree in the next couple of
days (i.e. before the 4.3. merge window opens) and I'd like to get
this one in as well.

Cheers,

Dave.
-- 
Dave Chinner
david@fromorbit.com

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH] xfs: return errors from partial I/O failures to files
  2015-08-26 22:19 ` Dave Chinner
@ 2015-08-27  6:09   ` Eryu Guan
  2015-08-27 13:41   ` David Jeffery
  1 sibling, 0 replies; 5+ messages in thread
From: Eryu Guan @ 2015-08-27  6:09 UTC (permalink / raw)
  To: Dave Chinner; +Cc: David Jeffery, xfs

On Thu, Aug 27, 2015 at 08:19:52AM +1000, Dave Chinner wrote:
> On Wed, Aug 26, 2015 at 03:06:36PM -0400, David Jeffery wrote:
> > There is an issue with xfs's error reporting in some cases of I/O partially
> > failing and partially succeeding. Calls like fsync() can report success even
> > though not all I/O was successful.
> 
> Hi David,
> 
> I read your bug report last night and after considering all the work
> you put into it, I was going to ask if you wanted to finish off the
> job by writing the patch to fix it. But you beat me to it.
> 
> Nice work! :)
> 
> > The issue can occur when there are multiple bio per xfs_ioend struct.
> > Each call to xfs_end_bio() for a bio completing will write a value to
> > ioend->io_error.  If a successful bio completes after any failed bio, no
> > error is reported do to it writing 0 over the error code set by any failed bio.
> > The I/O error information is now lost and when the ioend is completed
> > only success is reported back up the filesystem stack.
> 
> It's worth mentioning the case that this was seen in - a single
> failed disk in a raid 0 stripe, and the error from the bio to the
> failed disk was overwritten by the successes from the bios to the
> other disks.
> 
> FWIW, I think that we also need to create an xfstest for this case,
> too, because it's clear that this is a big hole in our test coverage
> (i.e.  partial block device failure). It might be best to talk to
> Eryu (cc'd) to get your reproducer converted into a xfstest case
> that we can then test all filesystems against?

Sure, I'll look into the reproducer and work on a fstests case.

Thanks,
Eryu

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH] xfs: return errors from partial I/O failures to files
  2015-08-26 22:19 ` Dave Chinner
  2015-08-27  6:09   ` Eryu Guan
@ 2015-08-27 13:41   ` David Jeffery
  2015-08-28  0:20     ` Dave Chinner
  1 sibling, 1 reply; 5+ messages in thread
From: David Jeffery @ 2015-08-27 13:41 UTC (permalink / raw)
  To: Dave Chinner; +Cc: xfs

On 08/26/2015 06:19 PM, Dave Chinner wrote:
> On Wed, Aug 26, 2015 at 03:06:36PM -0400, David Jeffery wrote:
>> There is an issue with xfs's error reporting in some cases of I/O partially
>> failing and partially succeeding. Calls like fsync() can report success even
>> though not all I/O was successful.
> 
> Hi David,
> 
> I read your bug report last night and after considering all the work
> you put into it, I was going to ask if you wanted to finish off the
> job by writing the patch to fix it. But you beat me to it.
> 
> Nice work! :)
> 
>> The issue can occur when there are multiple bio per xfs_ioend struct.
>> Each call to xfs_end_bio() for a bio completing will write a value to
>> ioend->io_error.  If a successful bio completes after any failed bio, no
>> error is reported do to it writing 0 over the error code set by any failed bio.
>> The I/O error information is now lost and when the ioend is completed
>> only success is reported back up the filesystem stack.
> 
> It's worth mentioning the case that this was seen in - a single
> failed disk in a raid 0 stripe, and the error from the bio to the
> failed disk was overwritten by the successes from the bios to the
> other disks.
> 
> FWIW, I think that we also need to create an xfstest for this case,
> too, because it's clear that this is a big hole in our test coverage
> (i.e.  partial block device failure). It might be best to talk to
> Eryu (cc'd) to get your reproducer converted into a xfstest case
> that we can then test all filesystems against?
> 
>> xfs_end_bio() should only set ioend->io_error in the case of BIO_UPTODATE
>> being clear.  ioend->io_error is initialized to 0 at allocation so only needs
>> to be updated by any failed bio structs. This ensures an error can be reported
>> to the application.
>>
>> Signed-off-by: David Jeffery <djeffery@redhat.com>
>> ---
> 
> Best to add a "cc: <stable@vger.kernel.org>" so that it gets pushed
> back to all the stable kernels automatically which it hits Linus'
> tree.
> 
> One minor change to the fix:
> 
>> diff --git a/fs/xfs/xfs_aops.c b/fs/xfs/xfs_aops.c
>> index 3859f5e..b82b128 100644
>> --- a/fs/xfs/xfs_aops.c
>> +++ b/fs/xfs/xfs_aops.c
>> @@ -356,7 +356,8 @@ xfs_end_bio(
>>  {
>>  	xfs_ioend_t		*ioend = bio->bi_private;
>>  
>> -	ioend->io_error = test_bit(BIO_UPTODATE, &bio->bi_flags) ? 0 : error;
>> +	if (!test_bit(BIO_UPTODATE, &bio->bi_flags))
>> +		ioend->io_error = error;
> 
> We should preserve the original error that was reported, rather than
> report the last one. ioend->io_error is always initialised to zero,
> so we can simply do:

Is there a particular reason to prefer the first error?  Is it just
standard practice?  I originally made a version which preserved the
first error but couldn't think of a reason why the first or last would
be best.  So I went with the patch which has the simpler if statement.

> 
> 	if (!ioend->io_error && !test_bit(BIO_UPTODATE, &bio->bi_flags))
> 		ioend->io_error = error;
> 
> Can you update the patch and resend it? I've got a couple of other
> fixes that I need to push to the for-next tree in the next couple of
> days (i.e. before the 4.3. merge window opens) and I'd like to get
> this one in as well.
> 
> Cheers,
> 
> Dave
> 

I'll get a new version sent.

David Jeffery

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH] xfs: return errors from partial I/O failures to files
  2015-08-27 13:41   ` David Jeffery
@ 2015-08-28  0:20     ` Dave Chinner
  0 siblings, 0 replies; 5+ messages in thread
From: Dave Chinner @ 2015-08-28  0:20 UTC (permalink / raw)
  To: David Jeffery; +Cc: xfs

On Thu, Aug 27, 2015 at 09:41:52AM -0400, David Jeffery wrote:
> On 08/26/2015 06:19 PM, Dave Chinner wrote:
> >> index 3859f5e..b82b128 100644
> >> --- a/fs/xfs/xfs_aops.c
> >> +++ b/fs/xfs/xfs_aops.c
> >> @@ -356,7 +356,8 @@ xfs_end_bio(
> >>  {
> >>  	xfs_ioend_t		*ioend = bio->bi_private;
> >>  
> >> -	ioend->io_error = test_bit(BIO_UPTODATE, &bio->bi_flags) ? 0 : error;
> >> +	if (!test_bit(BIO_UPTODATE, &bio->bi_flags))
> >> +		ioend->io_error = error;
> > 
> > We should preserve the original error that was reported, rather than
> > report the last one. ioend->io_error is always initialised to zero,
> > so we can simply do:
> 
> Is there a particular reason to prefer the first error?  Is it just
> standard practice?  I originally made a version which preserved the
> first error but couldn't think of a reason why the first or last would
> be best.  So I went with the patch which has the simpler if statement.

Standard practice in XFS - it's the first error that matters in most
cases. Subsequent errors are usually less meaningful as they are
either a result of the first error (i.e. cascading hardware errors),
tainted by the first error (e.g. first IO completion error shuts
down the fs, second IO completion error caused by the detection of a
shut down filesystem) or just unrelated noise....

Cheers,

Dave.
-- 
Dave Chinner
david@fromorbit.com

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2015-08-28  0:21 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2015-08-26 19:06 [PATCH] xfs: return errors from partial I/O failures to files David Jeffery
2015-08-26 22:19 ` Dave Chinner
2015-08-27  6:09   ` Eryu Guan
2015-08-27 13:41   ` David Jeffery
2015-08-28  0:20     ` Dave Chinner

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox