* [PATCH] xfs: clear _XBF_PAGES from buffers when readahead page allocation fails
@ 2017-01-26 3:08 Darrick J. Wong
2017-01-26 3:39 ` Eric Sandeen
0 siblings, 1 reply; 3+ messages in thread
From: Darrick J. Wong @ 2017-01-26 3:08 UTC (permalink / raw)
To: linux-xfs
If we try to allocate memory pages to back an xfs_buf that we're trying
to read, it's possible that we'll be so short on memory that the page
allocation fails. For a blocking read we'll just wait, but for
readahead we simply dump all the pages we've collected so far.
Unfortunately, after dumping the pages we neglect to clear the
_XBF_PAGES state, which means that other code might think that b_pages
still points to pages we own. If that other code is the buffer shrinker
and nobody else has grabbed the buffer, _buftarg_wait_rele will release
the buffer, which will see _XBF_PAGES and double-free the b_pages pages.
This results in screaming about negative page refcounts from the memory
manager, which xfs oughtn't be triggering. To reproduce this case,
mount a filesystem where the size of the inodes far outweighs the
availalble memory (a ~500M inode filesystem on a VM with 300MB memory
did the trick here) and run bulkstat in parallel with other memory
eating processes to put a huge load on the system. The "check summary"
phase of xfs_scrub also works for this purpose.
Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
---
fs/xfs/xfs_buf.c | 1 +
1 file changed, 1 insertion(+)
diff --git a/fs/xfs/xfs_buf.c b/fs/xfs/xfs_buf.c
index 7f0a01f..ac3b4db 100644
--- a/fs/xfs/xfs_buf.c
+++ b/fs/xfs/xfs_buf.c
@@ -422,6 +422,7 @@ xfs_buf_allocate_memory(
out_free_pages:
for (i = 0; i < bp->b_page_count; i++)
__free_page(bp->b_pages[i]);
+ bp->b_flags &= ~_XBF_PAGES;
return error;
}
^ permalink raw reply related [flat|nested] 3+ messages in thread
* Re: [PATCH] xfs: clear _XBF_PAGES from buffers when readahead page allocation fails
2017-01-26 3:08 [PATCH] xfs: clear _XBF_PAGES from buffers when readahead page allocation fails Darrick J. Wong
@ 2017-01-26 3:39 ` Eric Sandeen
2017-01-26 3:48 ` Darrick J. Wong
0 siblings, 1 reply; 3+ messages in thread
From: Eric Sandeen @ 2017-01-26 3:39 UTC (permalink / raw)
To: Darrick J. Wong, linux-xfs
On 1/25/17 9:08 PM, Darrick J. Wong wrote:
> If we try to allocate memory pages to back an xfs_buf that we're trying
> to read, it's possible that we'll be so short on memory that the page
> allocation fails. For a blocking read we'll just wait, but for
> readahead we simply dump all the pages we've collected so far.
>
> Unfortunately, after dumping the pages we neglect to clear the
> _XBF_PAGES state, which means that other code might think that b_pages
> still points to pages we own. If that other code is the buffer shrinker
> and nobody else has grabbed the buffer, _buftarg_wait_rele will release
> the buffer, which will see _XBF_PAGES and double-free the b_pages pages.
>
> This results in screaming about negative page refcounts from the memory
> manager, which xfs oughtn't be triggering. To reproduce this case,
> mount a filesystem where the size of the inodes far outweighs the
> availalble memory (a ~500M inode filesystem on a VM with 300MB memory
> did the trick here) and run bulkstat in parallel with other memory
> eating processes to put a huge load on the system. The "check summary"
> phase of xfs_scrub also works for this purpose.
>
> Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
> ---
> fs/xfs/xfs_buf.c | 1 +
> 1 file changed, 1 insertion(+)
>
> diff --git a/fs/xfs/xfs_buf.c b/fs/xfs/xfs_buf.c
> index 7f0a01f..ac3b4db 100644
> --- a/fs/xfs/xfs_buf.c
> +++ b/fs/xfs/xfs_buf.c
> @@ -422,6 +422,7 @@ xfs_buf_allocate_memory(
> out_free_pages:
> for (i = 0; i < bp->b_page_count; i++)
> __free_page(bp->b_pages[i]);
> + bp->b_flags &= ~_XBF_PAGES;
> return error;
> }
If xfs_buf_allocate_memory() fails, its one caller immediately
frees the bp. xfs_buf_free then looks at _XBF_PAGES, and
if set will call __free_page on each page.
I think that's where the double free is coming from, right?
-Eric
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: [PATCH] xfs: clear _XBF_PAGES from buffers when readahead page allocation fails
2017-01-26 3:39 ` Eric Sandeen
@ 2017-01-26 3:48 ` Darrick J. Wong
0 siblings, 0 replies; 3+ messages in thread
From: Darrick J. Wong @ 2017-01-26 3:48 UTC (permalink / raw)
To: Eric Sandeen; +Cc: linux-xfs
On Wed, Jan 25, 2017 at 09:39:15PM -0600, Eric Sandeen wrote:
> On 1/25/17 9:08 PM, Darrick J. Wong wrote:
> > If we try to allocate memory pages to back an xfs_buf that we're trying
> > to read, it's possible that we'll be so short on memory that the page
> > allocation fails. For a blocking read we'll just wait, but for
> > readahead we simply dump all the pages we've collected so far.
> >
> > Unfortunately, after dumping the pages we neglect to clear the
> > _XBF_PAGES state, which means that other code might think that b_pages
> > still points to pages we own. If that other code is the buffer shrinker
> > and nobody else has grabbed the buffer, _buftarg_wait_rele will release
> > the buffer, which will see _XBF_PAGES and double-free the b_pages pages.
> >
> > This results in screaming about negative page refcounts from the memory
> > manager, which xfs oughtn't be triggering. To reproduce this case,
> > mount a filesystem where the size of the inodes far outweighs the
> > availalble memory (a ~500M inode filesystem on a VM with 300MB memory
> > did the trick here) and run bulkstat in parallel with other memory
> > eating processes to put a huge load on the system. The "check summary"
> > phase of xfs_scrub also works for this purpose.
> >
> > Signed-off-by: Darrick J. Wong <darrick.wong@oracle.com>
> > ---
> > fs/xfs/xfs_buf.c | 1 +
> > 1 file changed, 1 insertion(+)
> >
> > diff --git a/fs/xfs/xfs_buf.c b/fs/xfs/xfs_buf.c
> > index 7f0a01f..ac3b4db 100644
> > --- a/fs/xfs/xfs_buf.c
> > +++ b/fs/xfs/xfs_buf.c
> > @@ -422,6 +422,7 @@ xfs_buf_allocate_memory(
> > out_free_pages:
> > for (i = 0; i < bp->b_page_count; i++)
> > __free_page(bp->b_pages[i]);
> > + bp->b_flags &= ~_XBF_PAGES;
> > return error;
> > }
>
> If xfs_buf_allocate_memory() fails, its one caller immediately
> frees the bp. xfs_buf_free then looks at _XBF_PAGES, and
> if set will call __free_page on each page.
>
> I think that's where the double free is coming from, right?
Oops. Yeah, the double free comes immediately after, not from the shrinker.
I'll fix the commit message.
--D
>
> -Eric
> --
> To unsubscribe from this list: send the line "unsubscribe linux-xfs" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2017-01-26 3:48 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2017-01-26 3:08 [PATCH] xfs: clear _XBF_PAGES from buffers when readahead page allocation fails Darrick J. Wong
2017-01-26 3:39 ` Eric Sandeen
2017-01-26 3:48 ` Darrick J. Wong
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).