public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* [RFC][PATCH] Fix lock inversion aio_kick_handler()
@ 2006-07-29  0:10 Zach Brown
  2006-07-29  1:34 ` Benjamin LaHaise
  0 siblings, 1 reply; 6+ messages in thread
From: Zach Brown @ 2006-07-29  0:10 UTC (permalink / raw)
  To: linux-kernel, linux-aio; +Cc: Benjamin LaHaise, Arjan van de Ven

Fix lock inversion aio_kick_handler()

lockdep found a AB BC CA lock inversion in retry-based AIO:

1) The task struct's alloc_lock (A) is acquired in process context with
interrupts enabled.  An interrupt might arrive and call wake_up() which grabs
the wait queue's q->lock (B).

2) When performing retry-based AIO the AIO core registers aio_wake_function()
as the wake funtion for iocb->ki_wait.  It is called with the wait queue's
q->lock (B) held and then tries to add the iocb to the run list after acquiring
the ctx_lock (C).

3) aio_kick_handler() holds the ctx_lock (C) while acquiring the alloc_lock (A)
via lock_task() and unuse_mm().  Lockdep emits a warning saying that we're
trying to connect the irq-safe q->lock to the irq-unsafe alloc_lock via
ctx_lock.

This fixes the inversion by calling unuse_mm() in the AIO kick handing path
after we've released the ctx_lock.

Signed-off-by: Zach Brown <zach.brown@oracle.com>
---

Ben, can you remember why we put unuse_mm() inside the ctx_lock?  Is that
intentional and still needed?

Index: 2.6.18-rc2-mm1-cmdepoll/fs/aio.c
===================================================================
--- 2.6.18-rc2-mm1-cmdepoll.orig/fs/aio.c
+++ 2.6.18-rc2-mm1-cmdepoll/fs/aio.c
@@ -598,9 +598,6 @@ static void use_mm(struct mm_struct *mm)
  *	by the calling kernel thread
  *	(Note: this routine is intended to be called only
  *	from a kernel thread context)
- *
- * Comments: Called with ctx->ctx_lock held. This nests
- * task_lock instead ctx_lock.
  */
 static void unuse_mm(struct mm_struct *mm)
 {
@@ -866,8 +863,8 @@ static void aio_kick_handler(void *data)
 	use_mm(ctx->mm);
 	spin_lock_irq(&ctx->ctx_lock);
 	requeue =__aio_run_iocbs(ctx);
- 	unuse_mm(ctx->mm);
 	spin_unlock_irq(&ctx->ctx_lock);
+ 	unuse_mm(ctx->mm);
 	set_fs(oldfs);
 	/*
 	 * we're in a worker thread already, don't use queue_delayed_work,

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [RFC][PATCH] Fix lock inversion aio_kick_handler()
  2006-07-29  0:10 [RFC][PATCH] Fix lock inversion aio_kick_handler() Zach Brown
@ 2006-07-29  1:34 ` Benjamin LaHaise
  2006-07-29  2:02   ` Zach Brown
  0 siblings, 1 reply; 6+ messages in thread
From: Benjamin LaHaise @ 2006-07-29  1:34 UTC (permalink / raw)
  To: Zach Brown; +Cc: linux-kernel, linux-aio, Arjan van de Ven

On Fri, Jul 28, 2006 at 05:10:32PM -0700, Zach Brown wrote:
> Fix lock inversion aio_kick_handler()

Doh.  Unfortunately, this patch isn't entirely correct as it could race with 
__put_ioctx() which sets ioctx->mm = NULL.  Something like the following 
should do the trick:

	struct mm_struct *mm;
...
> - 	unuse_mm(ctx->mm);
+	mm = ctx->mm;
>  	spin_unlock_irq(&ctx->ctx_lock);
+ 	unuse_mm(mm);
>  	set_fs(oldfs);
>  	/*
...


		-ben
-- 
"Time is of no importance, Mr. President, only life is important."
Don't Email: <dont@kvack.org>.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [RFC][PATCH] Fix lock inversion aio_kick_handler()
  2006-07-29  1:34 ` Benjamin LaHaise
@ 2006-07-29  2:02   ` Zach Brown
  2006-11-09 22:41     ` Jeff Moyer
  0 siblings, 1 reply; 6+ messages in thread
From: Zach Brown @ 2006-07-29  2:02 UTC (permalink / raw)
  To: Benjamin LaHaise; +Cc: linux-kernel, linux-aio, Arjan van de Ven

Benjamin LaHaise wrote:
> On Fri, Jul 28, 2006 at 05:10:32PM -0700, Zach Brown wrote:
>> Fix lock inversion aio_kick_handler()
> 
> Doh.  Unfortunately, this patch isn't entirely correct as it could race with 
> __put_ioctx() which sets ioctx->mm = NULL.

Aha, yeah, that's what I was missing.  Thanks.

> Something like the following should do the trick:

Cool, I'll respin and send it out.

- z

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [RFC][PATCH] Fix lock inversion aio_kick_handler()
  2006-07-29  2:02   ` Zach Brown
@ 2006-11-09 22:41     ` Jeff Moyer
  2006-11-09 22:51       ` Zach Brown
  0 siblings, 1 reply; 6+ messages in thread
From: Jeff Moyer @ 2006-11-09 22:41 UTC (permalink / raw)
  To: Zach Brown; +Cc: Benjamin LaHaise, linux-kernel, linux-aio, Arjan van de Ven

==> On Fri, 28 Jul 2006 19:02:23 -0700, Zach Brown <zach.brown@oracle.com> said:

Zach> Benjamin LaHaise wrote:
Zach> > On Fri, Jul 28, 2006 at 05:10:32PM -0700, Zach Brown wrote:
Zach> >> Fix lock inversion aio_kick_handler()
Zach> > 
Zach> > Doh.  Unfortunately, this patch isn't entirely correct as it
Zach> could race with > __put_ioctx() which sets ioctx->mm = NULL.

Zach> Aha, yeah, that's what I was missing.  Thanks.

Zach> > Something like the following should do the trick:

Zach> Cool, I'll respin and send it out.

Did you ever resend this patch, Zach?  It doesn't appear to be in
current kernels.  I'm still running into the lockdep warnings.

-Jeff

--- linux-2.6.19-rc5-mm1/fs/aio.c.orig	2006-11-09 17:28:43.000000000 -0500
+++ linux-2.6.19-rc5-mm1/fs/aio.c	2006-11-09 17:29:29.000000000 -0500
@@ -864,13 +864,15 @@ static void aio_kick_handler(void *data)
 	struct kioctx *ctx = data;
 	mm_segment_t oldfs = get_fs();
 	int requeue;
+	struct mm_struct *mm;
 
 	set_fs(USER_DS);
 	use_mm(ctx->mm);
 	spin_lock_irq(&ctx->ctx_lock);
 	requeue =__aio_run_iocbs(ctx);
- 	unuse_mm(ctx->mm);
+	mm = ctx->mm;
 	spin_unlock_irq(&ctx->ctx_lock);
+ 	unuse_mm(mm);
 	set_fs(oldfs);
 	/*
 	 * we're in a worker thread already, don't use queue_delayed_work,

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [RFC][PATCH] Fix lock inversion aio_kick_handler()
  2006-11-09 22:41     ` Jeff Moyer
@ 2006-11-09 22:51       ` Zach Brown
  2006-11-09 22:57         ` Jeff Moyer
  0 siblings, 1 reply; 6+ messages in thread
From: Zach Brown @ 2006-11-09 22:51 UTC (permalink / raw)
  To: Jeff Moyer; +Cc: Benjamin LaHaise, linux-kernel, linux-aio, Arjan van de Ven


> Zach> > Doh.  Unfortunately, this patch isn't entirely correct as it
> Zach> could race with > __put_ioctx() which sets ioctx->mm = NULL.
> 
> Zach> Aha, yeah, that's what I was missing.  Thanks.
> 
> Zach> > Something like the following should do the trick:
> 
> Zach> Cool, I'll respin and send it out.
> 
> Did you ever resend this patch, Zach?

Sadly, no.  I vaguely remember thinking that the refcounting was pretty
messed up in these paths and that more than just this patch was needed.
 I don't remember the details.

Maybe I should take a look again :/.

> current kernels.  I'm still running into the lockdep warnings.

When doing what?  Working with that IO_CMD_EPOLL_WAIT patch?

- z

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [RFC][PATCH] Fix lock inversion aio_kick_handler()
  2006-11-09 22:51       ` Zach Brown
@ 2006-11-09 22:57         ` Jeff Moyer
  0 siblings, 0 replies; 6+ messages in thread
From: Jeff Moyer @ 2006-11-09 22:57 UTC (permalink / raw)
  To: Zach Brown; +Cc: Benjamin LaHaise, linux-kernel, linux-aio, Arjan van de Ven

==> On Thu, 09 Nov 2006 14:51:59 -0800, Zach Brown <zach.brown@oracle.com> said:

Zach> > Zach> > Doh.  Unfortunately, this patch isn't entirely correct
Zach> as it > Zach> could race with > __put_ioctx() which sets
Zach> ioctx->mm = NULL.
Zach> > 
Zach> > Zach> Aha, yeah, that's what I was missing.  Thanks.
Zach> > 
Zach> > Zach> > Something like the following should do the trick:
Zach> > 
Zach> > Zach> Cool, I'll respin and send it out.
Zach> > 
Zach> > Did you ever resend this patch, Zach?

Zach> Sadly, no.  I vaguely remember thinking that the refcounting was
Zach> pretty messed up in these paths and that more than just this
Zach> patch was needed.  I don't remember the details.

Zach> Maybe I should take a look again :/.

Zach> > current kernels.  I'm still running into the lockdep warnings.

Zach> When doing what?  Working with that IO_CMD_EPOLL_WAIT patch?

Yep.  =)

-Jeff

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2006-11-09 22:57 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2006-07-29  0:10 [RFC][PATCH] Fix lock inversion aio_kick_handler() Zach Brown
2006-07-29  1:34 ` Benjamin LaHaise
2006-07-29  2:02   ` Zach Brown
2006-11-09 22:41     ` Jeff Moyer
2006-11-09 22:51       ` Zach Brown
2006-11-09 22:57         ` Jeff Moyer

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox