* kdevtmpfs oops since yesterdays vfs merge @ 2011-07-24 23:17 Dave Jones 2011-07-24 23:28 ` Al Viro 0 siblings, 1 reply; 11+ messages in thread From: Dave Jones @ 2011-07-24 23:17 UTC (permalink / raw) To: Al Viro; +Cc: Linux Kernel I see an oops in handle_create when I try to boot current tree.. full trace: https://s3.amazonaws.com/twitpic/photos/large/355006460.jpg?AWSAccessKeyId=AKIAJF3XCCKACR3QDMOA&Expires=1311550232&Signature=IIO%2Bya1uEDJzSXTD0DXh2%2BdZpoU%3D Dave ^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: kdevtmpfs oops since yesterdays vfs merge 2011-07-24 23:17 kdevtmpfs oops since yesterdays vfs merge Dave Jones @ 2011-07-24 23:28 ` Al Viro 2011-07-24 23:40 ` Dave Jones 0 siblings, 1 reply; 11+ messages in thread From: Al Viro @ 2011-07-24 23:28 UTC (permalink / raw) To: Dave Jones, Linux Kernel On Sun, Jul 24, 2011 at 07:17:01PM -0400, Dave Jones wrote: > I see an oops in handle_create when I try to boot current tree.. > > full trace: > https://s3.amazonaws.com/twitpic/photos/large/355006460.jpg?AWSAccessKeyId=AKIAJF3XCCKACR3QDMOA&Expires=1311550232&Signature=IIO%2Bya1uEDJzSXTD0DXh2%2BdZpoU%3D Where in handle_create() is that? At least dump objdump -d of your devtmpfs.o someplace readable... ^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: kdevtmpfs oops since yesterdays vfs merge 2011-07-24 23:28 ` Al Viro @ 2011-07-24 23:40 ` Dave Jones 2011-07-24 23:51 ` Al Viro 0 siblings, 1 reply; 11+ messages in thread From: Dave Jones @ 2011-07-24 23:40 UTC (permalink / raw) To: Al Viro; +Cc: Linux Kernel On Mon, Jul 25, 2011 at 12:28:12AM +0100, Al Viro wrote: > On Sun, Jul 24, 2011 at 07:17:01PM -0400, Dave Jones wrote: > > I see an oops in handle_create when I try to boot current tree.. > > > > full trace: > > https://s3.amazonaws.com/twitpic/photos/large/355006460.jpg?AWSAccessKeyId=AKIAJF3XCCKACR3QDMOA&Expires=1311550232&Signature=IIO%2Bya1uEDJzSXTD0DXh2%2BdZpoU%3D > > Where in handle_create() is that? At least dump objdump -d of your > devtmpfs.o someplace readable... http://codemonkey.org.uk/devtmpfs.s Dave ^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: kdevtmpfs oops since yesterdays vfs merge 2011-07-24 23:40 ` Dave Jones @ 2011-07-24 23:51 ` Al Viro 2011-07-25 1:53 ` Dave Jones 0 siblings, 1 reply; 11+ messages in thread From: Al Viro @ 2011-07-24 23:51 UTC (permalink / raw) To: Dave Jones, Linux Kernel On Sun, Jul 24, 2011 at 07:40:29PM -0400, Dave Jones wrote: > On Mon, Jul 25, 2011 at 12:28:12AM +0100, Al Viro wrote: > > On Sun, Jul 24, 2011 at 07:17:01PM -0400, Dave Jones wrote: > > > I see an oops in handle_create when I try to boot current tree.. > > > > > > full trace: > > > https://s3.amazonaws.com/twitpic/photos/large/355006460.jpg?AWSAccessKeyId=AKIAJF3XCCKACR3QDMOA&Expires=1311550232&Signature=IIO%2Bya1uEDJzSXTD0DXh2%2BdZpoU%3D > > > > Where in handle_create() is that? At least dump objdump -d of your > > devtmpfs.o someplace readable... > > http://codemonkey.org.uk/devtmpfs.s Smells like req->dev somehow managing to be NULL at that point, but that doesn't make any sense - we get to devtmpfs_create_node() only from one place, it sets req.dev to the argument it got from callers and that caller would have oopsed itself before getting to that call with dev == NULL... Could you stick a BUG_ON(!dev) in the beginning of handle_create() to see if that's what somehow manages to happen? ^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: kdevtmpfs oops since yesterdays vfs merge 2011-07-24 23:51 ` Al Viro @ 2011-07-25 1:53 ` Dave Jones 2011-07-25 1:56 ` Dave Jones 0 siblings, 1 reply; 11+ messages in thread From: Dave Jones @ 2011-07-25 1:53 UTC (permalink / raw) To: Al Viro; +Cc: Linux Kernel On Mon, Jul 25, 2011 at 12:51:54AM +0100, Al Viro wrote: > On Sun, Jul 24, 2011 at 07:40:29PM -0400, Dave Jones wrote: > > On Mon, Jul 25, 2011 at 12:28:12AM +0100, Al Viro wrote: > > > On Sun, Jul 24, 2011 at 07:17:01PM -0400, Dave Jones wrote: > > > > I see an oops in handle_create when I try to boot current tree.. > > > > > > > > full trace: > > > > https://s3.amazonaws.com/twitpic/photos/large/355006460.jpg?AWSAccessKeyId=AKIAJF3XCCKACR3QDMOA&Expires=1311550232&Signature=IIO%2Bya1uEDJzSXTD0DXh2%2BdZpoU%3D > > > > > > Where in handle_create() is that? At least dump objdump -d of your > > > devtmpfs.o someplace readable... > > > > http://codemonkey.org.uk/devtmpfs.s > > Smells like req->dev somehow managing to be NULL at that point, but that > doesn't make any sense - we get to devtmpfs_create_node() only from one > place, it sets req.dev to the argument it got from callers and that caller > would have oopsed itself before getting to that call with dev == NULL... > > Could you stick a BUG_ON(!dev) in the beginning of handle_create() to see > if that's what somehow manages to happen? So I built a kernel with this, and then couldn't reproduce it. Made a clean kernel again, and still nothing.. After a number of reboots, it finally triggered again, with that BUG_ON(). fwiw 'nodename' is pointing at garbage when that happens too. Either it only triggers occasionally, or it's dependent on how quickly I type my luks password in. Dave ^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: kdevtmpfs oops since yesterdays vfs merge 2011-07-25 1:53 ` Dave Jones @ 2011-07-25 1:56 ` Dave Jones 2011-07-25 2:44 ` Al Viro 0 siblings, 1 reply; 11+ messages in thread From: Dave Jones @ 2011-07-25 1:56 UTC (permalink / raw) To: Al Viro, Linux Kernel On Sun, Jul 24, 2011 at 09:53:24PM -0400, Dave Jones wrote: > On Mon, Jul 25, 2011 at 12:51:54AM +0100, Al Viro wrote: > > On Sun, Jul 24, 2011 at 07:40:29PM -0400, Dave Jones wrote: > > > On Mon, Jul 25, 2011 at 12:28:12AM +0100, Al Viro wrote: > > > > On Sun, Jul 24, 2011 at 07:17:01PM -0400, Dave Jones wrote: > > > > > I see an oops in handle_create when I try to boot current tree.. > > > > > > > > > > full trace: > > > > > https://s3.amazonaws.com/twitpic/photos/large/355006460.jpg?AWSAccessKeyId=AKIAJF3XCCKACR3QDMOA&Expires=1311550232&Signature=IIO%2Bya1uEDJzSXTD0DXh2%2BdZpoU%3D > > > > > > > > Where in handle_create() is that? At least dump objdump -d of your > > > > devtmpfs.o someplace readable... > > > > > > http://codemonkey.org.uk/devtmpfs.s > > > > Smells like req->dev somehow managing to be NULL at that point, but that > > doesn't make any sense - we get to devtmpfs_create_node() only from one > > place, it sets req.dev to the argument it got from callers and that caller > > would have oopsed itself before getting to that call with dev == NULL... > > > > Could you stick a BUG_ON(!dev) in the beginning of handle_create() to see > > if that's what somehow manages to happen? > > So I built a kernel with this, and then couldn't reproduce it. > Made a clean kernel again, and still nothing.. After a number of reboots, > it finally triggered again, with that BUG_ON(). fwiw 'nodename' is pointing > at garbage when that happens too. > > Either it only triggers occasionally, or it's dependent on how quickly > I type my luks password in. one more datapoint. On a succesful boot, I see .. [ 7.760774] dracut: luksOpen /dev/sda2 luks-b5a1fb36-5672-4191-a260-e3f389eb0bb6 [ 14.787158] nodename: dm-0 [ 15.082391] nodename: dm-0 when it triggers the bug_on(), it's that second nodename that is garbage. Dave ^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: kdevtmpfs oops since yesterdays vfs merge 2011-07-25 1:56 ` Dave Jones @ 2011-07-25 2:44 ` Al Viro 2011-07-25 4:58 ` Dave Jones 0 siblings, 1 reply; 11+ messages in thread From: Al Viro @ 2011-07-25 2:44 UTC (permalink / raw) To: Dave Jones, Linux Kernel On Sun, Jul 24, 2011 at 09:56:12PM -0400, Dave Jones wrote: > [ 7.760774] dracut: luksOpen /dev/sda2 luks-b5a1fb36-5672-4191-a260-e3f389eb0bb6 > [ 14.787158] nodename: dm-0 > [ 15.082391] nodename: dm-0 > > > when it triggers the bug_on(), it's that second nodename that is garbage. Interesting... The next experiment would be to stick BUG_ON(!req.dev) into devtmpfs_create_node() right after the assigment to that field. We couldn't be hit by the lack of barriers here, could we? Store to req.dev happens before spin_unlock(&req_lock), so by the time when that request is seen by loop in devtmpfsd() and passed to handle() it should be seen - we have grabbed req_lock, found a pointer to req, dropped req_lock and called handle(). Should've been enough... Might be interesting to print &req from devtmpfs_create_node(), both on entry and on exit, and print req right before the call of handle()... Incidentally, that disassembly shows one really ugly thing - offset of ->devt in struct device is 0x3c0. IOW, each of those suckers eats a kilobyte... ;-/ ^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: kdevtmpfs oops since yesterdays vfs merge 2011-07-25 2:44 ` Al Viro @ 2011-07-25 4:58 ` Dave Jones 2011-07-25 5:12 ` Al Viro 0 siblings, 1 reply; 11+ messages in thread From: Dave Jones @ 2011-07-25 4:58 UTC (permalink / raw) To: Al Viro; +Cc: Linux Kernel On Mon, Jul 25, 2011 at 03:44:44AM +0100, Al Viro wrote: > > when it triggers the bug_on(), it's that second nodename that is garbage. > > Interesting... The next experiment would be to stick BUG_ON(!req.dev) > into devtmpfs_create_node() right after the assigment to that field. couldn't get that to trigger. > We couldn't be hit by the lack of barriers here, could we? Store to > req.dev happens before spin_unlock(&req_lock), so by the time when > that request is seen by loop in devtmpfsd() and passed to handle() it > should be seen - we have grabbed req_lock, found a pointer to req, dropped > req_lock and called handle(). Should've been enough... > > Might be interesting to print &req from devtmpfs_create_node(), both on > entry and on exit, and print req right before the call of handle()... Here's latest.. https://s3.amazonaws.com/twitpic/photos/full/355219312.jpg?AWSAccessKeyId=AKIAJF3XCCKACR3QDMOA&Expires=1311570683&Signature=xr3tusulMiV2bIsxux9YNrawUDA%3D apologies for crappy picture, but it's legible at fullsize.. interesting thing here is that the req that causes the oops, I couldn't find any call to create_handle for that address, so where devtmpfsd got it is a mystery. The address is curious too, in that it's way off from all the reqs created around that time. I'll add some more printk's to see if I can figure where that's being created. Dave ^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: kdevtmpfs oops since yesterdays vfs merge 2011-07-25 4:58 ` Dave Jones @ 2011-07-25 5:12 ` Al Viro 2011-07-25 5:53 ` Dave Jones 0 siblings, 1 reply; 11+ messages in thread From: Al Viro @ 2011-07-25 5:12 UTC (permalink / raw) To: Dave Jones, Linux Kernel On Mon, Jul 25, 2011 at 12:58:52AM -0400, Dave Jones wrote: > On Mon, Jul 25, 2011 at 03:44:44AM +0100, Al Viro wrote: > > > > when it triggers the bug_on(), it's that second nodename that is garbage. > > > > Interesting... The next experiment would be to stick BUG_ON(!req.dev) > > into devtmpfs_create_node() right after the assigment to that field. > > couldn't get that to trigger. Interesting... > > We couldn't be hit by the lack of barriers here, could we? Store to > > req.dev happens before spin_unlock(&req_lock), so by the time when > > that request is seen by loop in devtmpfsd() and passed to handle() it > > should be seen - we have grabbed req_lock, found a pointer to req, dropped > > req_lock and called handle(). Should've been enough... > > > > Might be interesting to print &req from devtmpfs_create_node(), both on > > entry and on exit, and print req right before the call of handle()... > > Here's latest.. > > https://s3.amazonaws.com/twitpic/photos/full/355219312.jpg?AWSAccessKeyId=AKIAJF3XCCKACR3QDMOA&Expires=1311570683&Signature=xr3tusulMiV2bIsxux9YNrawUDA%3D > > apologies for crappy picture, but it's legible at fullsize.. > > interesting thing here is that the req that causes the oops, I couldn't > find any call to create_handle for that address, so where devtmpfsd got it > is a mystery. The address is curious too, in that it's way off from all the > reqs created around that time. Arrgh... OK, I see what's going on. req->err = handle(req->name, req->mode, req->dev); complete(&req->done); req = req->next; is letting the request creator to continue; if it leaves the scope, guess what is left in *req? That's right, garbage... Including req->next. All right, try this and let's see if it fixes the problem: diff --git a/drivers/base/devtmpfs.c b/drivers/base/devtmpfs.c index 3644dd4..49b6cba 100644 --- a/drivers/base/devtmpfs.c +++ b/drivers/base/devtmpfs.c @@ -406,9 +406,10 @@ static int devtmpfsd(void *p) requests = NULL; spin_unlock(&req_lock); while (req) { + struct req *next = req->next; req->err = handle(req->name, req->mode, req->dev); complete(&req->done); - req = req->next; + req = next; } spin_lock(&req_lock); } ^ permalink raw reply related [flat|nested] 11+ messages in thread
* Re: kdevtmpfs oops since yesterdays vfs merge 2011-07-25 5:12 ` Al Viro @ 2011-07-25 5:53 ` Dave Jones 2011-07-25 6:15 ` Al Viro 0 siblings, 1 reply; 11+ messages in thread From: Dave Jones @ 2011-07-25 5:53 UTC (permalink / raw) To: Al Viro; +Cc: Linux Kernel On Mon, Jul 25, 2011 at 06:12:51AM +0100, Al Viro wrote: > Arrgh... OK, I see what's going on. > > req->err = handle(req->name, req->mode, req->dev); > complete(&req->done); > req = req->next; > is letting the request creator to continue; if it leaves the scope, guess > what is left in *req? That's right, garbage... Including req->next. > All right, try this and let's see if it fixes the problem: Yep, that solves the problem. Dave ^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: kdevtmpfs oops since yesterdays vfs merge 2011-07-25 5:53 ` Dave Jones @ 2011-07-25 6:15 ` Al Viro 0 siblings, 0 replies; 11+ messages in thread From: Al Viro @ 2011-07-25 6:15 UTC (permalink / raw) To: Dave Jones, Linux Kernel On Mon, Jul 25, 2011 at 01:53:08AM -0400, Dave Jones wrote: > On Mon, Jul 25, 2011 at 06:12:51AM +0100, Al Viro wrote: > > > Arrgh... OK, I see what's going on. > > > > req->err = handle(req->name, req->mode, req->dev); > > complete(&req->done); > > req = req->next; > > is letting the request creator to continue; if it leaves the scope, guess > > what is left in *req? That's right, garbage... Including req->next. > > All right, try this and let's see if it fixes the problem: > > Yep, that solves the problem. OK, to Linus it goes tomorrow morning... I'm about to fall asleep right now and queue needs a bit of reordering ;-/ ^ permalink raw reply [flat|nested] 11+ messages in thread
end of thread, other threads:[~2011-07-25 6:15 UTC | newest] Thread overview: 11+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2011-07-24 23:17 kdevtmpfs oops since yesterdays vfs merge Dave Jones 2011-07-24 23:28 ` Al Viro 2011-07-24 23:40 ` Dave Jones 2011-07-24 23:51 ` Al Viro 2011-07-25 1:53 ` Dave Jones 2011-07-25 1:56 ` Dave Jones 2011-07-25 2:44 ` Al Viro 2011-07-25 4:58 ` Dave Jones 2011-07-25 5:12 ` Al Viro 2011-07-25 5:53 ` Dave Jones 2011-07-25 6:15 ` Al Viro
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox