* Re: kernel panic / NULL pointer dereference [not found] <jognqm$o21$1@dough.gmane.org> @ 2012-05-10 16:43 ` Bernd Schubert 2012-05-11 13:49 ` [PATCH] bio allocation failure due to bio_get_nr_vecs() Bernd Schubert 0 siblings, 1 reply; 9+ messages in thread From: Bernd Schubert @ 2012-05-10 16:43 UTC (permalink / raw) Cc: linux-fsdevel@vger.kernel.org On 05/10/2012 05:45 PM, Bernd Schubert wrote: > Hi all, > > I'm just playing with an SRP connected NetApp system and just got an XFS > related kernel panic. I guess it is due to large IO (32MiB). At least it > just came up after enabling 32MiB device max_sectors. > As the tests are running in a RHEL6 image and as I needed at least > 2.6.39 to get a large srp_tablsize with SRP, I simply installed the > lasted oracle uek kernel. If needed I'm going to update to a vanilla > version. > > >> May 10 17:31:49 sgi01 kernel: XFS (sdb): Mounting Filesystem >> May 10 17:31:49 sgi01 kernel: XFS (sdb): Ending clean mount >> May 10 17:33:00 sgi01 kernel: BUG: unable to handle kernel NULL >> pointer dereference at (null) >> May 10 17:33:00 sgi01 kernel: IP: [<ffffffffa07f5483>] >> xfs_alloc_ioend_bio+0x33/0x50 [xfs] Oh, there is a bio allocation path to return NULL: bvec_alloc_bs(gfp_mask, nr_iovecs, ) => NULL when nr_iovecs > BIO_MAX_PAGES bio_alloc_bioset(gfp_mask, nr_iovecs, ...) bio_alloc(GFP_NOIO, nvecs) xfs_alloc_ioend_bio() And nvecs/nr_iovecs is obtained by bio_get_nr_vecs(), which does not check for BIO_MAX_PAGES. Of course, all of that only happens with large IO sizes, which is exactly what I'm doing. As xfs_alloc_ioend_bio() is using GFP_NOIO it does not expect bio_alloc to fail, but as I'm trying to send large IOs I guess that is exactly what happens here. >May 10 17:33:00 sgi01 kernel: [<ffffffffa07f561e>] xfs_submit_ioend+0xfe/0x110 [xfs] >May 10 17:33:00 sgi01 kernel: [<ffffffffa07f696b>] xfs_vm_writepage+0x26b/0x510 [xfs] >May 10 17:33:00 sgi01 kernel: [<ffffffff81112377>] __writepage+0x17/0x40 >May 10 17:33:00 sgi01 kernel: [<ffffffff81113696>] write_cache_pages+0x246/0x520 >May 10 17:33:00 sgi01 kernel: [<ffffffff81112360>] ? set_page_dirty+0x70/0x70 >May 10 17:33:00 sgi01 kernel: [<ffffffff811139c1>] generic_writepages+0x51/0x80 >May 10 17:33:00 sgi01 kernel: [<ffffffffa07f537d>] xfs_vm_writepages+0x5d/0x80 [xfs] >May 10 17:33:00 sgi01 kernel: [<ffffffff81113a11>] do_writepages+0x21/0x40 >May 10 17:33:00 sgi01 kernel: [<ffffffff8118df2e>] writeback_single_inode+0x10e/0x270 ^ permalink raw reply [flat|nested] 9+ messages in thread
* [PATCH] bio allocation failure due to bio_get_nr_vecs() 2012-05-10 16:43 ` kernel panic / NULL pointer dereference Bernd Schubert @ 2012-05-11 13:49 ` Bernd Schubert 2012-05-11 14:06 ` Jeff Moyer 2012-05-11 14:36 ` Jens Axboe 0 siblings, 2 replies; 9+ messages in thread From: Bernd Schubert @ 2012-05-11 13:49 UTC (permalink / raw) To: Bernd Schubert Cc: linux-fsdevel@vger.kernel.org, linux-xfs, sandeen, Kent Overstreet, Tejun Heo, Jens Axboe >>> May 10 17:31:49 sgi01 kernel: XFS (sdb): Mounting Filesystem >>> May 10 17:31:49 sgi01 kernel: XFS (sdb): Ending clean mount >>> May 10 17:33:00 sgi01 kernel: BUG: unable to handle kernel NULL >>> pointer dereference at (null) >>> May 10 17:33:00 sgi01 kernel: IP: [<ffffffffa07f5483>] >>> xfs_alloc_ioend_bio+0x33/0x50 [xfs] > > Oh, there is a bio allocation path to return NULL: > > bvec_alloc_bs(gfp_mask, nr_iovecs, ) => NULL when nr_iovecs> BIO_MAX_PAGES > bio_alloc_bioset(gfp_mask, nr_iovecs, ...) > bio_alloc(GFP_NOIO, nvecs) > xfs_alloc_ioend_bio() > > And nvecs/nr_iovecs is obtained by bio_get_nr_vecs(), which does not check for > BIO_MAX_PAGES. Of course, all of that only happens with large IO sizes, > which is exactly what I'm doing. > As xfs_alloc_ioend_bio() is using GFP_NOIO it does not expect bio_alloc > to fail, but as I'm trying to send large IOs I guess that is exactly what happens here. I see that Kent already fixed an overflow issue in commit 5abebfdd02450fa1349daacf242e70b3736581e3. But even with this commit, bio_get_nr_vecs() still only checks for queue_max_segments(). As we have a maximum of 2048 segments, that does not help much here. After cherry-picking 5abebfdd02450fa1349daacf242e70b3736581e3 and applying the patch below, I didn't run into panics / NULL pointer dereferences anymore. bio: bio_get_nr_vecs() must not return more than BIO_MAX_PAGES From: Bernd Schubert <bernd.schubert@itwm.fraunhofer.de> The number of bio_get_nr_vecs() is passed down via bio_alloc() to bvec_alloc_bs(), which fails the bio allocation if nr_iovecs > BIO_MAX_PAGES. For the underlying caller this causes an unexpected bio allocation failure. Limiting to queue_max_segments() is not sufficient, as max_segments also might be very large. bvec_alloc_bs(gfp_mask, nr_iovecs, ) => NULL when nr_iovecs > BIO_MAX_PAGES bio_alloc_bioset(gfp_mask, nr_iovecs, ...) bio_alloc(GFP_NOIO, nvecs) xfs_alloc_ioend_bio() Signed-off-by: Bernd Schubert <bernd.schubert@itwm.fraunhofer.de> --- fs/bio.c | 7 ++++++- 1 file changed, 6 insertions(+), 1 deletion(-) diff --git a/fs/bio.c b/fs/bio.c index e453924..84da885 100644 --- a/fs/bio.c +++ b/fs/bio.c @@ -505,9 +505,14 @@ EXPORT_SYMBOL(bio_clone); int bio_get_nr_vecs(struct block_device *bdev) { struct request_queue *q = bdev_get_queue(bdev); - return min_t(unsigned, + int nr_pages; + + nr_pages = min_t(unsigned, queue_max_segments(q), queue_max_sectors(q) / (PAGE_SIZE >> 9) + 1); + + return min_t(unsigned, nr_pages, BIO_MAX_PAGES); + } EXPORT_SYMBOL(bio_get_nr_vecs); ^ permalink raw reply related [flat|nested] 9+ messages in thread
* Re: [PATCH] bio allocation failure due to bio_get_nr_vecs() 2012-05-11 13:49 ` [PATCH] bio allocation failure due to bio_get_nr_vecs() Bernd Schubert @ 2012-05-11 14:06 ` Jeff Moyer 2012-05-11 14:31 ` Bernd Schubert 2012-05-11 14:36 ` Jens Axboe 2012-05-11 14:36 ` Jens Axboe 1 sibling, 2 replies; 9+ messages in thread From: Jeff Moyer @ 2012-05-11 14:06 UTC (permalink / raw) To: Bernd Schubert Cc: linux-fsdevel@vger.kernel.org, linux-xfs, sandeen, Kent Overstreet, Tejun Heo, Jens Axboe Bernd Schubert <bernd.schubert@itwm.fraunhofer.de> writes: > diff --git a/fs/bio.c b/fs/bio.c > index e453924..84da885 100644 > --- a/fs/bio.c > +++ b/fs/bio.c > @@ -505,9 +505,14 @@ EXPORT_SYMBOL(bio_clone); > int bio_get_nr_vecs(struct block_device *bdev) > { > struct request_queue *q = bdev_get_queue(bdev); > - return min_t(unsigned, > + int nr_pages; Looks like a corrupt patch. ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [PATCH] bio allocation failure due to bio_get_nr_vecs() 2012-05-11 14:06 ` Jeff Moyer @ 2012-05-11 14:31 ` Bernd Schubert 2012-05-11 14:36 ` Jens Axboe 1 sibling, 0 replies; 9+ messages in thread From: Bernd Schubert @ 2012-05-11 14:31 UTC (permalink / raw) To: Jeff Moyer Cc: linux-fsdevel@vger.kernel.org, linux-xfs, sandeen, Kent Overstreet, Tejun Heo, Jens Axboe [-- Attachment #1: Type: text/plain, Size: 606 bytes --] On 05/11/2012 04:06 PM, Jeff Moyer wrote: > Bernd Schubert<bernd.schubert@itwm.fraunhofer.de> writes: > >> diff --git a/fs/bio.c b/fs/bio.c >> index e453924..84da885 100644 >> --- a/fs/bio.c >> +++ b/fs/bio.c >> @@ -505,9 +505,14 @@ EXPORT_SYMBOL(bio_clone); >> int bio_get_nr_vecs(struct block_device *bdev) >> { >> struct request_queue *q = bdev_get_queue(bdev); >> - return min_t(unsigned, >> + int nr_pages; > > Looks like a corrupt patch. What do you actually mean? Issue by thunderbird? I just saved the mail in my sent folder and it looks? Just to be sure, patch attached. Thanks, Bernd [-- Attachment #2: fix-bio-nrvec.patch --] [-- Type: text/x-patch, Size: 1242 bytes --] bio: bio_get_nr_vecs() must not return more than BIO_MAX_PAGES From: Bernd Schubert <bernd.schubert@itwm.fraunhofer.de> The number of bio_get_nr_vecs() is passed down via bio_alloc() to bvec_alloc_bs(), which fails the bio allocation if nr_iovecs > BIO_MAX_PAGES. For the underlying caller this causes an unexpected bio allocation failure. Limiting to queue_max_segments() is not sufficiet, as max_segments also might be very large. bvec_alloc_bs(gfp_mask, nr_iovecs, ) => NULL when nr_iovecs > BIO_MAX_PAGES bio_alloc_bioset(gfp_mask, nr_iovecs, ...) bio_alloc(GFP_NOIO, nvecs) xfs_alloc_ioend_bio() Signed-off-by: Bernd Schubert <bernd.schubert@itwm.fraunhofer.de> --- fs/bio.c | 7 ++++++- 1 file changed, 6 insertions(+), 1 deletion(-) diff --git a/fs/bio.c b/fs/bio.c index e453924..84da885 100644 --- a/fs/bio.c +++ b/fs/bio.c @@ -505,9 +505,14 @@ EXPORT_SYMBOL(bio_clone); int bio_get_nr_vecs(struct block_device *bdev) { struct request_queue *q = bdev_get_queue(bdev); - return min_t(unsigned, + int nr_pages; + + nr_pages = min_t(unsigned, queue_max_segments(q), queue_max_sectors(q) / (PAGE_SIZE >> 9) + 1); + + return min_t(unsigned, nr_pages, BIO_MAX_PAGES); + } EXPORT_SYMBOL(bio_get_nr_vecs); ^ permalink raw reply related [flat|nested] 9+ messages in thread
* Re: [PATCH] bio allocation failure due to bio_get_nr_vecs() 2012-05-11 14:06 ` Jeff Moyer 2012-05-11 14:31 ` Bernd Schubert @ 2012-05-11 14:36 ` Jens Axboe 2012-05-11 16:29 ` Jeff Moyer 1 sibling, 1 reply; 9+ messages in thread From: Jens Axboe @ 2012-05-11 14:36 UTC (permalink / raw) To: Jeff Moyer Cc: Bernd Schubert, linux-fsdevel@vger.kernel.org, linux-xfs, sandeen, Kent Overstreet, Tejun Heo On 05/11/2012 04:06 PM, Jeff Moyer wrote: > Bernd Schubert <bernd.schubert@itwm.fraunhofer.de> writes: > >> diff --git a/fs/bio.c b/fs/bio.c >> index e453924..84da885 100644 >> --- a/fs/bio.c >> +++ b/fs/bio.c >> @@ -505,9 +505,14 @@ EXPORT_SYMBOL(bio_clone); >> int bio_get_nr_vecs(struct block_device *bdev) >> { >> struct request_queue *q = bdev_get_queue(bdev); >> - return min_t(unsigned, >> + int nr_pages; > > Looks like a corrupt patch. It's fine, I think you are misreading the added and removed lines :-) -- Jens Axboe ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [PATCH] bio allocation failure due to bio_get_nr_vecs() 2012-05-11 14:36 ` Jens Axboe @ 2012-05-11 16:29 ` Jeff Moyer 0 siblings, 0 replies; 9+ messages in thread From: Jeff Moyer @ 2012-05-11 16:29 UTC (permalink / raw) To: Jens Axboe Cc: Bernd Schubert, linux-fsdevel@vger.kernel.org, linux-xfs, sandeen, Kent Overstreet, Tejun Heo Jens Axboe <axboe@kernel.dk> writes: > On 05/11/2012 04:06 PM, Jeff Moyer wrote: >> Bernd Schubert <bernd.schubert@itwm.fraunhofer.de> writes: >> >>> diff --git a/fs/bio.c b/fs/bio.c >>> index e453924..84da885 100644 >>> --- a/fs/bio.c >>> +++ b/fs/bio.c >>> @@ -505,9 +505,14 @@ EXPORT_SYMBOL(bio_clone); >>> int bio_get_nr_vecs(struct block_device *bdev) >>> { >>> struct request_queue *q = bdev_get_queue(bdev); >>> - return min_t(unsigned, >>> + int nr_pages; >> >> Looks like a corrupt patch. > > It's fine, I think you are misreading the added and removed lines :-) Whoops, sorry! ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [PATCH] bio allocation failure due to bio_get_nr_vecs() 2012-05-11 13:49 ` [PATCH] bio allocation failure due to bio_get_nr_vecs() Bernd Schubert 2012-05-11 14:06 ` Jeff Moyer @ 2012-05-11 14:36 ` Jens Axboe 2012-05-11 14:44 ` Bernd Schubert 1 sibling, 1 reply; 9+ messages in thread From: Jens Axboe @ 2012-05-11 14:36 UTC (permalink / raw) To: Bernd Schubert Cc: linux-fsdevel@vger.kernel.org, linux-xfs, sandeen, Kent Overstreet, Tejun Heo On 05/11/2012 03:49 PM, Bernd Schubert wrote: > The number of bio_get_nr_vecs() is passed down via bio_alloc() to > bvec_alloc_bs(), which fails the bio allocation if > nr_iovecs > BIO_MAX_PAGES. For the underlying caller this causes an > unexpected bio allocation failure. > Limiting to queue_max_segments() is not sufficient, as max_segments > also might be very large. > > bvec_alloc_bs(gfp_mask, nr_iovecs, ) => NULL when nr_iovecs > BIO_MAX_PAGES > bio_alloc_bioset(gfp_mask, nr_iovecs, ...) > bio_alloc(GFP_NOIO, nvecs) > xfs_alloc_ioend_bio() Thanks, looks sane. Applied. -- Jens Axboe ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [PATCH] bio allocation failure due to bio_get_nr_vecs() 2012-05-11 14:36 ` Jens Axboe @ 2012-05-11 14:44 ` Bernd Schubert 2012-05-11 14:45 ` Jens Axboe 0 siblings, 1 reply; 9+ messages in thread From: Bernd Schubert @ 2012-05-11 14:44 UTC (permalink / raw) To: Jens Axboe Cc: linux-fsdevel@vger.kernel.org, linux-xfs, sandeen, Kent Overstreet, Tejun Heo On 05/11/2012 04:36 PM, Jens Axboe wrote: > On 05/11/2012 03:49 PM, Bernd Schubert wrote: >> The number of bio_get_nr_vecs() is passed down via bio_alloc() to >> bvec_alloc_bs(), which fails the bio allocation if >> nr_iovecs> BIO_MAX_PAGES. For the underlying caller this causes an >> unexpected bio allocation failure. >> Limiting to queue_max_segments() is not sufficient, as max_segments >> also might be very large. >> >> bvec_alloc_bs(gfp_mask, nr_iovecs, ) => NULL when nr_iovecs> BIO_MAX_PAGES >> bio_alloc_bioset(gfp_mask, nr_iovecs, ...) >> bio_alloc(GFP_NOIO, nvecs) >> xfs_alloc_ioend_bio() > > Thanks, looks sane. Applied. > Great, thanks! Should we CC linux-stable for commit 5abebfdd02450fa1349daacf242e70b3736581e3 and this one, as I got a hard kernel panic? Thanks, Bernd ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [PATCH] bio allocation failure due to bio_get_nr_vecs() 2012-05-11 14:44 ` Bernd Schubert @ 2012-05-11 14:45 ` Jens Axboe 0 siblings, 0 replies; 9+ messages in thread From: Jens Axboe @ 2012-05-11 14:45 UTC (permalink / raw) To: Bernd Schubert Cc: linux-fsdevel@vger.kernel.org, linux-xfs, sandeen, Kent Overstreet, Tejun Heo On 05/11/2012 04:44 PM, Bernd Schubert wrote: > On 05/11/2012 04:36 PM, Jens Axboe wrote: >> On 05/11/2012 03:49 PM, Bernd Schubert wrote: >>> The number of bio_get_nr_vecs() is passed down via bio_alloc() to >>> bvec_alloc_bs(), which fails the bio allocation if >>> nr_iovecs> BIO_MAX_PAGES. For the underlying caller this causes an >>> unexpected bio allocation failure. >>> Limiting to queue_max_segments() is not sufficient, as max_segments >>> also might be very large. >>> >>> bvec_alloc_bs(gfp_mask, nr_iovecs, ) => NULL when nr_iovecs> BIO_MAX_PAGES >>> bio_alloc_bioset(gfp_mask, nr_iovecs, ...) >>> bio_alloc(GFP_NOIO, nvecs) >>> xfs_alloc_ioend_bio() >> >> Thanks, looks sane. Applied. >> > > Great, thanks! Should we CC linux-stable for commit > 5abebfdd02450fa1349daacf242e70b3736581e3 and this one, as I got a hard > kernel panic? Yes, that's a good idea. I've ammended the commit now to include stable. -- Jens Axboe ^ permalink raw reply [flat|nested] 9+ messages in thread
end of thread, other threads:[~2012-05-11 18:09 UTC | newest] Thread overview: 9+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- [not found] <jognqm$o21$1@dough.gmane.org> 2012-05-10 16:43 ` kernel panic / NULL pointer dereference Bernd Schubert 2012-05-11 13:49 ` [PATCH] bio allocation failure due to bio_get_nr_vecs() Bernd Schubert 2012-05-11 14:06 ` Jeff Moyer 2012-05-11 14:31 ` Bernd Schubert 2012-05-11 14:36 ` Jens Axboe 2012-05-11 16:29 ` Jeff Moyer 2012-05-11 14:36 ` Jens Axboe 2012-05-11 14:44 ` Bernd Schubert 2012-05-11 14:45 ` Jens Axboe
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).