From: Thomas Gleixner <tglx@linutronix.de>
To: Andrew Morton <akpm@linux-foundation.org>
Cc: LKML <linux-kernel@vger.kernel.org>,
John Stultz <john.stultz@linaro.org>, Ingo Molnar <mingo@elte.hu>,
Ben Greear <greearb@candelatech.com>,
stable@kernel.org
Subject: Re: [patch 2/3] rtc: Fix hrtimer deadlock
Date: Sat, 23 Jul 2011 09:22:06 +0200 (CEST) [thread overview]
Message-ID: <alpine.LFD.2.02.1107230917160.2702@ionos> (raw)
In-Reply-To: <20110722151107.e1c9996d.akpm@linux-foundation.org>
On Fri, 22 Jul 2011, Andrew Morton wrote:
> On Fri, 22 Jul 2011 09:12:51 -0000
> Thomas Gleixner <tglx@linutronix.de> wrote:
>
> > Ben reported a lockup related to rtc. The lockup happens due to:
> >
> > CPU0 CPU1
> >
> > rtc_irq_set_state() __run_hrtimer()
> > spin_lock_irqsave(&rtc->irq_task_lock) rtc_handle_legacy_irq();
> > spin_lock(&rtc->irq_task_lock);
> > hrtimer_cancel()
> > while (callback_running);
> >
> > So the running callback never finishes as it's blocked on
> > rtc->irq_task_lock.
> >
> > Use hrtimer_try_to_cancel() instead and drop rtc->irq_task_lock while
> > waiting for the callback. Fix this for both rtc_irq_set_state() and
> > rtc_irq_set_freq().
> >
> > ...
> >
> > +static int rtc_update_hrtimer(struct rtc_device *rtc, int enabled)
> > +{
> > + /*
> > + * We unconditionally cancel the timer here, because otherwise
>
> The comment seems wrong. If hrtimer_try_to_cancel() fails, we simply
> bale out so we did not "unconditionally cancel the timer"?
Well, what I meant is that we cancel it before we start it. That's
required for self rearming timers. Will reword.
> > + * we could run into BUG_ON(timer->state != HRTIMER_STATE_CALLBACK);
> > + * when we manage to start the timer before the callback
> > + * returns HRTIMER_RESTART.
> > + *
> > + * We cannot use hrtimer_cancel() here as a running callback
> > + * could be blocked on rtc->irq_task_lock and hrtimer_cancel()
> > + * would spin forever.
> > + */
> > + if (hrtimer_try_to_cancel(&rtc->pie_timer) < 0)
> > + return -1;
> > +
> > + if (enabled) {
> > + ktime_t period = ktime_set(0, NSEC_PER_SEC / rtc->irq_freq);
> > +
> > + hrtimer_start(&rtc->pie_timer, period, HRTIMER_MODE_REL);
> > + }
> > + return 0;
> > +}
> > +
> > /**
> > * rtc_irq_set_state - enable/disable 2^N Hz periodic IRQs
> > * @rtc: the rtc device
> > @@ -651,24 +674,21 @@ int rtc_irq_set_state(struct rtc_device
> > int err = 0;
> > unsigned long flags;
> >
> > +retry:
> > spin_lock_irqsave(&rtc->irq_task_lock, flags);
> > if (rtc->irq_task != NULL && task == NULL)
> > err = -EBUSY;
> > if (rtc->irq_task != task)
> > err = -EACCES;
> > - if (err)
> > - goto out;
> > -
> > - if (enabled) {
> > - ktime_t period = ktime_set(0, NSEC_PER_SEC/rtc->irq_freq);
> > - hrtimer_start(&rtc->pie_timer, period, HRTIMER_MODE_REL);
> > - } else {
> > - hrtimer_cancel(&rtc->pie_timer);
> > + if (!err) {
> > + if (rtc_update_hrtimer(rtc, enabled) < 0) {
> > + spin_unlock_irqrestore(&rtc->irq_task_lock, flags);
> > + cpu_relax();
> > + goto retry;
> > + }
> > + rtc->pie_enabled = enabled;
>
> Well this is rather nasty. Sort of an open-coded expensive spinlock.
> All rather pointless on SMP=n builds, too.
>
> Is there no better way, such as fixing up the locking properly?
Probably there is, but that requires a rather large patch and a
complete locking rewrite, nothing you want to push back into
stable. And we want this as the deadlock has been observed and
reported already.
next prev parent reply other threads:[~2011-07-23 7:22 UTC|newest]
Thread overview: 14+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-07-22 9:12 [patch 0/3] rtc: Assorted bug fixes Thomas Gleixner
2011-07-22 9:12 ` [patch 1/3] rtc: Handle errors correctly in rtc_irq_set_state() Thomas Gleixner
2011-07-22 22:04 ` Andrew Morton
2011-07-23 7:15 ` Thomas Gleixner
2011-07-22 9:12 ` [patch 2/3] rtc: Fix hrtimer deadlock Thomas Gleixner
2011-07-22 22:11 ` Andrew Morton
2011-07-23 7:22 ` Thomas Gleixner [this message]
2011-07-22 9:12 ` [patch 3/3] rtc: Limit frequency Thomas Gleixner
2011-07-22 22:05 ` Andrew Morton
2011-07-22 22:39 ` [stable] " Willy Tarreau
2011-08-05 3:39 ` Joshua Kinard
2011-08-05 9:04 ` John Stultz
2011-08-06 7:28 ` Joshua Kinard
2011-07-23 7:17 ` Thomas Gleixner
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=alpine.LFD.2.02.1107230917160.2702@ionos \
--to=tglx@linutronix.de \
--cc=akpm@linux-foundation.org \
--cc=greearb@candelatech.com \
--cc=john.stultz@linaro.org \
--cc=linux-kernel@vger.kernel.org \
--cc=mingo@elte.hu \
--cc=stable@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox