From: "J. Bruce Fields" <bfields@fieldses.org>
To: Chuck Lever <chuck.lever@oracle.com>
Cc: Linux NFS Mailing List <linux-nfs@vger.kernel.org>
Subject: Re: [PATCH v1 1/3] nfsd: Limit end of page list when decoding NFSv4 WRITE
Date: Wed, 23 Aug 2017 21:18:05 -0400 [thread overview]
Message-ID: <20170824011805.GB10035@fieldses.org> (raw)
In-Reply-To: <C680E88E-B51D-464C-AC4C-5F9C0B31ED9B@oracle.com>
On Wed, Aug 23, 2017 at 02:36:33PM -0400, Chuck Lever wrote:
>
> > On Aug 22, 2017, at 5:45 PM, J. Bruce Fields <bfields@fieldses.org> wrote:
> >
> > On Mon, Aug 21, 2017 at 06:08:15PM -0400, Chuck Lever wrote:
> >>
> >>> On Aug 21, 2017, at 5:21 PM, J. Bruce Fields <bfields@fieldses.org> wrote:
> >>>
> >>> On Mon, Aug 21, 2017 at 05:15:38PM -0400, Chuck Lever wrote:
> >>>>
> >>>>> On Aug 21, 2017, at 5:13 PM, J. Bruce Fields <bfields@fieldses.org> wrote:
> >>>>>
> >>>>> On Fri, Aug 18, 2017 at 11:12:19AM -0400, Chuck Lever wrote:
> >>>>>> When processing an NFSv4 WRITE operation, argp->end should never
> >>>>>> point past the end of the data in the final page of the page list.
> >>>>>> Otherwise, nfsd4_decode_compound can walk into uninitialized memory.
> >>>>>>
> >>>>>> Signed-off-by: Chuck Lever <chuck.lever@oracle.com>
> >>>>>> ---
> >>>>>> fs/nfsd/nfs4xdr.c | 6 ++----
> >>>>>> 1 file changed, 2 insertions(+), 4 deletions(-)
> >>>>>>
> >>>>>> diff --git a/fs/nfsd/nfs4xdr.c b/fs/nfsd/nfs4xdr.c
> >>>>>> index 51e729a..7c48d68 100644
> >>>>>> --- a/fs/nfsd/nfs4xdr.c
> >>>>>> +++ b/fs/nfsd/nfs4xdr.c
> >>>>>> @@ -144,7 +144,7 @@ static void next_decode_page(struct nfsd4_compoundargs *argp)
> >>>>>> argp->p = page_address(argp->pagelist[0]);
> >>>>>> argp->pagelist++;
> >>>>>> if (argp->pagelen < PAGE_SIZE) {
> >>>>>> - argp->end = argp->p + (argp->pagelen>>2);
> >>>>>> + argp->end = argp->p + XDR_QUADLEN(argp->pagelen);
> >>
> >> ^^^^^^^^^^^^^^ A
> >>
> >>>>>> argp->pagelen = 0;
> >>>>>> } else {
> >>>>>> argp->end = argp->p + (PAGE_SIZE>>2);
> >>>>>> @@ -1279,9 +1279,7 @@ static __be32 nfsd4_decode_opaque(struct nfsd4_compoundargs *argp, struct xdr_ne
> >>>>>> argp->pagelen -= pages * PAGE_SIZE;
> >>>>>> len -= pages * PAGE_SIZE;
> >>>>>>
> >>>>>> - argp->p = (__be32 *)page_address(argp->pagelist[0]);
> >>>>>> - argp->pagelist++;
> >>>>>> - argp->end = argp->p + XDR_QUADLEN(PAGE_SIZE);
> >>
> >> ^^^^^^^^^^^^^^ B
> >>
> >>>>>> + next_decode_page(argp);
> >>>>>
> >>>>> I think there's no change in behavior here *except* for adding a new
> >>>>> argp->pagelen=0 (or argp->pagelen -= PAGE_SIZE).
> >>>>
> >>>> The code around this change is currently working correctly,
> >>>> so there is no change in behavior AFAICT. This is a defensive
> >>>> change, but it also replaces duplicate code.
> >>>
> >>> I don't understand. I'm saying that by calling next_decode_page() there
> >>> you've added a new argp->pagelen assignment. I don't understand how
> >>> that can't change behavior, unless there's another bug in our bounds
> >>> checking someplace.
> >>
> >> Because of line B above, argp->end always points to the
> >> end of the final page in the page list. However, the
> >> buffer might end somewhere in the middle of that page,
> >> in which case, the transport hasn't initialized any of
> >> the bytes between the end of the buffer and the end of
> >> the page.
> >>
> >> As long as the other fields in the xdr_buf are set up
> >> properly, the XDR decoder will not walk into that uninit-
> >> ialized section of the last page. But there's nothing
> >> preventing a decoder or transport bug from causing it
> >> to walk into the uninitialized area. And always setting
> >> to the end of the page is confusing when the buffer
> >> itself is actually shorter.
> >>
> >> The key is to replace line B above with line A. argp->end
> >> is advanced by the remaining part of the final page rather
> >> than by a whole page.
> >
> > Got it, I agree with that part of the change, it's the pagelen change I
> > was having trouble with.
> >
> > But looking at it more, I think your patch is a fix and the current code
> > is wrong.
> >
> >> The next patch uses this new behavior to signal precisely
> >> when it has to move from the page list to the tail iovec.
> >>
> >>
> >>> Most likely it could cause subsequent op parsers to believe there's less
> >>> space in the argument buffer than there really is, so it might fail to
> >>> parse a compound with a write plus some other ops, if that puts the
> >>> total call close to the maximum size?
> >>
> >> Where is argp->pagelen used after the final next_decode_page
> >> call?
> >
> > Well, it's checked in every read_buf and next_decode_page to decide how
> > much space is left.
>
> Right, and I didn't change the pagelen adjustment that occurs
> in the loop. Just the final adjustment should be different.
>
>
> > It looks to me like the current code is wrong not to be decreasing
> > page_len at the end there. So I wonder if there's a bug right now.
> > E.g. maybe a compound with multiple writes could leave the xdr decoding
> > thinking it has more space than it does and allow someone to write
> > unrelated memory to some file.
>
> I believe that's possible.
Yeah, I can totally crash the server this way.
I'll send this along for stable. (And see if I can figure out how to get
a test for this kind of thing into pynfs. Unfortunately it's seems to
require some contortions to get it to produce bad xdr.)
--b.
next prev parent reply other threads:[~2017-08-24 1:18 UTC|newest]
Thread overview: 13+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-08-18 15:12 [PATCH v1 0/3] Handle NFSv4 operations in xdr_buf tail Chuck Lever
2017-08-18 15:12 ` [PATCH v1 1/3] nfsd: Limit end of page list when decoding NFSv4 WRITE Chuck Lever
2017-08-21 21:13 ` J. Bruce Fields
2017-08-21 21:15 ` Chuck Lever
2017-08-21 21:21 ` J. Bruce Fields
2017-08-21 22:08 ` Chuck Lever
2017-08-22 21:45 ` J. Bruce Fields
2017-08-23 18:36 ` Chuck Lever
2017-08-24 1:18 ` J. Bruce Fields [this message]
2017-08-24 2:52 ` Weston Andros Adamson
2017-08-18 15:12 ` [PATCH v1 2/3] nfsd: Incoming xdr_bufs may have content in tail buffer Chuck Lever
2017-08-25 17:46 ` J. Bruce Fields
2017-08-18 15:12 ` [PATCH v1 3/3] svcrdma: Populate tail iovec when receiving Chuck Lever
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20170824011805.GB10035@fieldses.org \
--to=bfields@fieldses.org \
--cc=chuck.lever@oracle.com \
--cc=linux-nfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).