From: Lars Ellenberg <lars.ellenberg@linbit.com>
To: Greg KH <gregkh@linuxfoundation.org>
Cc: Jens Axboe <axboe@kernel.dk>, David Mohr <david@mcbf.net>,
Philipp Reisner <philipp.reisner@linbit.com>,
stable@vger.kernel.org, drbd-dev@lists.linbit.com
Subject: Re: [Drbd-dev] [PATCH] drbd: fix regression 'out of mem, failed to invoke fence-peer helper'
Date: Tue, 7 Oct 2014 17:33:51 +0200 [thread overview]
Message-ID: <20141007153351.GH8574@soda.linbit> (raw)
In-Reply-To: <20141005234701.GA23078@kroah.com>
On Sun, Oct 05, 2014 at 04:47:01PM -0700, Greg KH wrote:
> On Wed, Oct 01, 2014 at 11:32:29AM +0200, Philipp Reisner wrote:
> > From: Lars Ellenberg <lars.ellenberg@linbit.com>
> >
> > Stable info:
> > This patch landed in upstream with v3.16 as commit
> > bbc1c5e8ad6dfebf9d13b8a4ccdf66c92913eac9
> > it should go into v3.14+
> >
> > Since linux kernel 3.13, kthread_run() internally uses
> > wait_for_completion_killable(). We sometimes may use kthread_run()
> > while we still have a signal pending, which we used to kick our threads
> > out of potentially blocking network functions, causing kthread_run() to
> > mistake that as a new fatal signal and fail.
> >
> > Fix: flush_signals() before kthread_run().
> >
> > Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
> > Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>
> > Signed-off-by: Jens Axboe <axboe@fb.com>
> > ---
> > drivers/block/drbd/drbd_nl.c | 6 ++++++
> > 1 file changed, 6 insertions(+)
> >
> > diff --git a/drivers/block/drbd/drbd_nl.c b/drivers/block/drbd/drbd_nl.c
> > index 1b35c45..3f2e167 100644
> > --- a/drivers/block/drbd/drbd_nl.c
> > +++ b/drivers/block/drbd/drbd_nl.c
> > @@ -544,6 +544,12 @@ void conn_try_outdate_peer_async(struct drbd_connection *connection)
> > struct task_struct *opa;
> >
> > kref_get(&connection->kref);
> > + /* We may just have force_sig()'ed this thread
> > + * to get it out of some blocking network function.
> > + * Clear signals; otherwise kthread_run(), which internally uses
> > + * wait_on_completion_killable(), will mistake our pending signal
> > + * for a new fatal signal and fail. */
> > + flush_signals(current);
> > opa = kthread_run(_try_outdate_peer_async, connection, "drbd_async_h");
> > if (IS_ERR(opa)) {
> > drbd_err(connection, "out of mem, failed to invoke fence-peer helper\n");
>
> This doesn't apply to 3.16-stable or 3.14-stable, can you please provide
> a working backport?
There was a rename of "tconn" to "connection" between 3.14 and .15.
Other than that, this has not changed.
Below applies to 3.13 and 3.14 stable as of today.
Lars
8<----
From a82efa2adeb992b5ded798b01b4567bc07b6ab1b Mon Sep 17 00:00:00 2001
From: Lars Ellenberg <lars.ellenberg@linbit.com>
Date: Tue, 7 Oct 2014 17:20:27 +0200
Subject: [PATCH] drbd: fix regression 'out of mem, failed to invoke fence-peer
helper'
Stable info:
This patch landed in upstream with v3.16 as commit
bbc1c5e8ad6dfebf9d13b8a4ccdf66c92913eac9
it should go into v3.13+
Since linux kernel 3.13, kthread_run() internally uses
wait_for_completion_killable(). We sometimes may use kthread_run()
while we still have a signal pending, which we used to kick our threads
out of potentially blocking network functions, causing kthread_run() to
mistake that as a new fatal signal and fail.
Fix: flush_signals() before kthread_run().
Signed-off-by: Philipp Reisner <philipp.reisner@linbit.com>
Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>
Signed-off-by: Jens Axboe <axboe@fb.com>
---
drivers/block/drbd/drbd_nl.c | 6 ++++++
1 file changed, 6 insertions(+)
diff --git a/drivers/block/drbd/drbd_nl.c b/drivers/block/drbd/drbd_nl.c
index c706d50..8c16c2f 100644
--- a/drivers/block/drbd/drbd_nl.c
+++ b/drivers/block/drbd/drbd_nl.c
@@ -525,6 +525,12 @@ void conn_try_outdate_peer_async(struct drbd_tconn *tconn)
struct task_struct *opa;
kref_get(&tconn->kref);
+ /* We may just have force_sig()'ed this thread
+ * to get it out of some blocking network function.
+ * Clear signals; otherwise kthread_run(), which internally uses
+ * wait_on_completion_killable(), will mistake our pending signal
+ * for a new fatal signal and fail. */
+ flush_signals(current);
opa = kthread_run(_try_outdate_peer_async, tconn, "drbd_async_h");
if (IS_ERR(opa)) {
conn_err(tconn, "out of mem, failed to invoke fence-peer helper\n");
--
1.9.1
next prev parent reply other threads:[~2014-10-07 15:33 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <026a6017e1b052f58cf908fc2f63aea7@de.mcbf.net>
2014-10-01 9:32 ` [Drbd-dev] [PATCH] drbd: fix regression 'out of mem, failed to invoke fence-peer helper' Philipp Reisner
2014-10-05 23:47 ` Greg KH
2014-10-07 15:33 ` Lars Ellenberg [this message]
2014-07-09 19:18 [Drbd-dev] [PATCH] Fix DRBD regression Philipp Reisner
2014-07-09 19:18 ` [Drbd-dev] [PATCH] drbd: fix regression 'out of mem, failed to invoke fence-peer helper' Philipp Reisner
2014-07-10 9:07 ` Jens Axboe
2014-07-10 9:53 ` Philipp Reisner
2014-07-10 9:55 ` Jens Axboe
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20141007153351.GH8574@soda.linbit \
--to=lars.ellenberg@linbit.com \
--cc=axboe@kernel.dk \
--cc=david@mcbf.net \
--cc=drbd-dev@lists.linbit.com \
--cc=gregkh@linuxfoundation.org \
--cc=philipp.reisner@linbit.com \
--cc=stable@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox