From: "Darrick J. Wong" <djwong@kernel.org>
To: Amir Goldstein <amir73il@gmail.com>
Cc: Ignat Korchagin <ignat@cloudflare.com>,
Matthew Wilcox <willy@infradead.org>,
Daniel Dao <dqminh@cloudflare.com>,
Dave Chinner <david@fromorbit.com>,
kernel-team <kernel-team@cloudflare.com>,
linux-fsdevel@vger.kernel.org,
Chandan Babu R <chandanrlinux@gmail.com>,
Leah Rumancik <lrumancik@google.com>,
linux-xfs <linux-xfs@vger.kernel.org>,
"Luis R. Rodriguez" <mcgrof@kernel.org>
Subject: Re: Backporting of series xfs/iomap: fix data corruption due to stale cached iomap
Date: Fri, 30 Jun 2023 08:16:57 -0700 [thread overview]
Message-ID: <20230630151657.GJ11441@frogsfrogsfrogs> (raw)
In-Reply-To: <CAOQ4uxju10zrQhVDA5WS+vTSbuW17vOD6EGBBJUmZg8c95vsrA@mail.gmail.com>
On Fri, Jun 30, 2023 at 04:05:36PM +0300, Amir Goldstein wrote:
> On Fri, Jun 30, 2023 at 3:30 PM Ignat Korchagin <ignat@cloudflare.com> wrote:
> >
> > On Fri, Jun 30, 2023 at 11:39 AM Amir Goldstein <amir73il@gmail.com> wrote:
> > >
> > > On Thu, Jun 29, 2023 at 10:31 PM Ignat Korchagin <ignat@cloudflare.com> wrote:
> > > >
> > > > On Thu, Jun 29, 2023 at 7:14 PM Darrick J. Wong <djwong@kernel.org> wrote:
> > > > >
> > > > > [add the xfs lts maintainers]
> > > > >
> > > > > On Thu, Jun 29, 2023 at 05:34:00PM +0100, Matthew Wilcox wrote:
> > > > > > On Thu, Jun 29, 2023 at 05:09:41PM +0100, Daniel Dao wrote:
> > > > > > > Hi Dave and Derrick,
> > > > > > >
> > > > > > > We are tracking down some corruptions on xfs for our rocksdb workload,
> > > > > > > running on kernel 6.1.25. The corruptions were
> > > > > > > detected by rocksdb block checksum. The workload seems to share some
> > > > > > > similarities
> > > > > > > with the multi-threaded write workload described in
> > > > > > > https://lore.kernel.org/linux-fsdevel/20221129001632.GX3600936@dread.disaster.area/
> > > > > > >
> > > > > > > Can we backport the patch series to stable since it seemed to fix data
> > > > > > > corruptions ?
> > > > > >
> > > > > > For clarity, are you asking for permission or advice about doing this
> > > > > > yourself, or are you asking somebody else to do the backport for you?
> > > > >
> > > > > Nobody's officially committed to backporting and testing patches for
> > > > > 6.1; are you (Cloudflare) volunteering?
> > > >
> > > > Yes, we have applied them on top of 6.1.36, will be gradually
> > > > releasing to our servers and will report back if we see the issues go
> > > > away
> > > >
> > >
> > > Getting feedback back from Cloudflare production servers is awesome
> > > but it's not enough.
> > >
> > > The standard for getting xfs LTS backports approved is:
> > > 1. Test the backports against regressions with several rounds of fstests
> > > check -g auto on selected xfs configurations [1]
> > > 2. Post the backport series to xfs list and get an ACK from upstream
> > > xfs maintainers
> > >
> > > We have volunteers doing this work for 5.4.y, 5.10.y and 5.15.y.
> > > We do not yet have a volunteer to do that work for 6.1.y.
> > >
> > > The question is whether you (or your team) are volunteering to
> > > do that work for 6.1.y xfs backports to help share the load?
> >
> > We are not a big team and apart from other internal project work our
> > efforts are focused on fixing this issue in production, because it
> > affects many teams and workloads. If we confirm that these patches fix
> > the issue in production, we will definitely consider dedicating some
> > work to ensure they are officially backported. But if not - we would
> > be required to search for a fix first before we can commit to any
> > work.
> >
> > So, IOW - can we come back to you a bit later on this after we get the
> > feedback from production?
> >
>
> Of course.
> The volunteering question for 6.1.y is independent.
>
> When you decide that you have a series of backports
> that proves to fix a real bug in production,
> a way to test the series will be worked out.
/me notes that xfs/558 and xfs/559 (in fstests) are the functional tests
for these patches that you're backporting; it would be useful to have a
third party (i.e. not just the reporter and the author) confirm that the
two fstests pass when real workloads are fixed.
--D
> Thanks,
> Amir.
next prev parent reply other threads:[~2023-06-30 15:18 UTC|newest]
Thread overview: 12+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-06-29 16:09 Backporting of series xfs/iomap: fix data corruption due to stale cached iomap Daniel Dao
2023-06-29 16:34 ` Matthew Wilcox
2023-06-29 18:14 ` Darrick J. Wong
2023-06-29 19:30 ` Ignat Korchagin
2023-06-30 10:39 ` Amir Goldstein
2023-06-30 12:30 ` Ignat Korchagin
2023-06-30 13:05 ` Amir Goldstein
2023-06-30 15:16 ` Darrick J. Wong [this message]
2023-07-19 20:37 ` Ignat Korchagin
2023-07-20 6:45 ` Amir Goldstein
2023-07-20 18:30 ` Luis Chamberlain
2023-07-20 18:38 ` Frederick Lawler
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20230630151657.GJ11441@frogsfrogsfrogs \
--to=djwong@kernel.org \
--cc=amir73il@gmail.com \
--cc=chandanrlinux@gmail.com \
--cc=david@fromorbit.com \
--cc=dqminh@cloudflare.com \
--cc=ignat@cloudflare.com \
--cc=kernel-team@cloudflare.com \
--cc=linux-fsdevel@vger.kernel.org \
--cc=linux-xfs@vger.kernel.org \
--cc=lrumancik@google.com \
--cc=mcgrof@kernel.org \
--cc=willy@infradead.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).