public inbox for linux-xfs@vger.kernel.org
 help / color / mirror / Atom feed
* XFS: false "torn write" errors (preventing mount)
@ 2016-02-29 15:29 Jan Beulich
  2016-02-29 15:57 ` Brian Foster
  0 siblings, 1 reply; 3+ messages in thread
From: Jan Beulich @ 2016-02-29 15:29 UTC (permalink / raw)
  To: bfoster; +Cc: xfs

Brian,

on a system where I routinely run both a 32-bit and a 64-bit x86
kernel (underneath the same 32-bit distro) I'm observing the
newly added message being issued, along with the mounts
subsequently failing when running the 32-bit kernel. Without
doing anything to the FS, running an older 32-bit kernel or a
4.5-rc6 64-bit one have everything work fine (and silently), so
I can only assume the detection logic doesn't work right in a
32-bit kernel. I've looked over commits 6528250b71 and
7088c4136f without being able to spot any obvious word size
dependency, but then again I know nothing about the inner
workings of the XFS code.

I'm now hoping that you have an idea what's going on here.

Thanks, Jan

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: XFS: false "torn write" errors (preventing mount)
  2016-02-29 15:29 XFS: false "torn write" errors (preventing mount) Jan Beulich
@ 2016-02-29 15:57 ` Brian Foster
  2016-02-29 18:08   ` Brian Foster
  0 siblings, 1 reply; 3+ messages in thread
From: Brian Foster @ 2016-02-29 15:57 UTC (permalink / raw)
  To: Jan Beulich; +Cc: xfs

On Mon, Feb 29, 2016 at 08:29:20AM -0700, Jan Beulich wrote:
> Brian,
> 
> on a system where I routinely run both a 32-bit and a 64-bit x86
> kernel (underneath the same 32-bit distro) I'm observing the
> newly added message being issued, along with the mounts
> subsequently failing when running the 32-bit kernel. Without
> doing anything to the FS, running an older 32-bit kernel or a
> 4.5-rc6 64-bit one have everything work fine (and silently), so
> I can only assume the detection logic doesn't work right in a
> 32-bit kernel. I've looked over commits 6528250b71 and
> 7088c4136f without being able to spot any obvious word size
> dependency, but then again I know nothing about the inner
> workings of the XFS code.
> 
> I'm now hoping that you have an idea what's going on here.
> 

There was one follow on fix related to byte order: 8e0bd4925bf6 ("xfs:
fix endianness error when checking log block crc on big endian
platforms"), but I don't think that would have any effect on an x86
kernel.

Is the 32-bit kernel problematic on its own, or must the 64-bit kernel
be involved somehow before the 32-bit kernel reproduces a problem? For
example, can you mkfs, mount and remount (perhaps multiple times) on the
32-bit kernel without a problem? If so, what happens if you transition
to the 64-bit kernel, remount a few times, and then go back to 32-bit?
In general, anything that narrows down the reproducer is helpful.

I don't appear to have a 32-bit env. handy so I'll kick off an install
in the meantime and take a closer look from there...

Brian

> Thanks, Jan
> 
> _______________________________________________
> xfs mailing list
> xfs@oss.sgi.com
> http://oss.sgi.com/mailman/listinfo/xfs

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: XFS: false "torn write" errors (preventing mount)
  2016-02-29 15:57 ` Brian Foster
@ 2016-02-29 18:08   ` Brian Foster
  0 siblings, 0 replies; 3+ messages in thread
From: Brian Foster @ 2016-02-29 18:08 UTC (permalink / raw)
  To: Jan Beulich; +Cc: xfs

On Mon, Feb 29, 2016 at 10:57:52AM -0500, Brian Foster wrote:
> On Mon, Feb 29, 2016 at 08:29:20AM -0700, Jan Beulich wrote:
> > Brian,
> > 
> > on a system where I routinely run both a 32-bit and a 64-bit x86
> > kernel (underneath the same 32-bit distro) I'm observing the
> > newly added message being issued, along with the mounts
> > subsequently failing when running the 32-bit kernel. Without
> > doing anything to the FS, running an older 32-bit kernel or a
> > 4.5-rc6 64-bit one have everything work fine (and silently), so
> > I can only assume the detection logic doesn't work right in a
> > 32-bit kernel. I've looked over commits 6528250b71 and
> > 7088c4136f without being able to spot any obvious word size
> > dependency, but then again I know nothing about the inner
> > workings of the XFS code.
> > 
> > I'm now hoping that you have an idea what's going on here.
> > 
> 
> There was one follow on fix related to byte order: 8e0bd4925bf6 ("xfs:
> fix endianness error when checking log block crc on big endian
> platforms"), but I don't think that would have any effect on an x86
> kernel.
> 
> Is the 32-bit kernel problematic on its own, or must the 64-bit kernel
> be involved somehow before the 32-bit kernel reproduces a problem? For
> example, can you mkfs, mount and remount (perhaps multiple times) on the
> 32-bit kernel without a problem? If so, what happens if you transition
> to the 64-bit kernel, remount a few times, and then go back to 32-bit?
> In general, anything that narrows down the reproducer is helpful.
> 
> I don't appear to have a 32-bit env. handy so I'll kick off an install
> in the meantime and take a closer look from there...
> 

Just a heads up that I've been able to reproduce. What I think might be
going on is that the log is clean, but the log recovery pass looks back
behind the latest unmount record, runs into some records/data written by
the alternate architecture from that which is running, and then fails
due to crc mismatch. The problem doesn't seem to manifest right away,
however, so I could still be missing something here.

Anyways, I'll dig into it and try to come up with a fix. Thanks for the
report!

Brian

> Brian
> 
> > Thanks, Jan
> > 
> > _______________________________________________
> > xfs mailing list
> > xfs@oss.sgi.com
> > http://oss.sgi.com/mailman/listinfo/xfs
> 
> _______________________________________________
> xfs mailing list
> xfs@oss.sgi.com
> http://oss.sgi.com/mailman/listinfo/xfs

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2016-02-29 18:08 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2016-02-29 15:29 XFS: false "torn write" errors (preventing mount) Jan Beulich
2016-02-29 15:57 ` Brian Foster
2016-02-29 18:08   ` Brian Foster

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox