From: Ying Xue <ying.xue@windriver.com>
To: Thomas Graf <tgraf@suug.ch>
Cc: <davem@davemloft.net>, <netdev@vger.kernel.org>
Subject: Re: [PATCH net-next] rhashtable: unnecessary to use delayed work
Date: Tue, 13 Jan 2015 17:48:54 +0800 [thread overview]
Message-ID: <54B4EA06.8010507@windriver.com> (raw)
In-Reply-To: <20150113093550.GG20387@casper.infradead.org>
On 01/13/2015 05:35 PM, Thomas Graf wrote:
> On 01/13/15 at 05:00pm, Ying Xue wrote:
>> When we put our declared work task in the global workqueue with
>> schedule_delayed_work(), its delay parameter is always zero.
>> Therefore, we should define a normal work in rhashtable structure
>> instead of a delayed work.
>>
>> Signed-off-by: Ying Xue <ying.xue@windriver.com>
>> Cc: Thomas Graf <tgraf@suug.ch>
>
>> @@ -914,7 +914,7 @@ void rhashtable_destroy(struct rhashtable *ht)
>>
>> mutex_lock(&ht->mutex);
>>
>> - cancel_delayed_work(&ht->run_work);
>> + cancel_work_sync(&ht->run_work);
>> bucket_table_free(rht_dereference(ht->tbl, ht));
>>
>> mutex_unlock(&ht->mutex);
>
> I like the patch!
>
> I think it introduces a possible dead lock though (see below). OTOH, it
> could actually explain the reason for the 0day lock debug splash that
> was reported.
>
> Dead lock: The worker could already have been kicked off but was
> interrupted before it acquired ht->mutex. rhashtable_destroy() is
> called and acquired ht->mutex. cancel_work_sync() waits for worker to
> finish while holding ht->mutex. Worker can't finish because it needs to
> acquire ht->mutex to do so.
>
> For the very same reason the reported warning could have been triggered.
> Instead of the dead lock, it would have called bucket_table_free()
> with a deferred resizer still underway.
>
> What about we do something like this?
>
> void rhashtable_destroy(struct rhashtable *ht)
> {
> ht->being_destroyed = true;
> cancel_work_sync(&ht->run_work);
>
> mutex_lock(&ht->mutex);
> bucket_table_free(rht_dereference(ht->tbl, ht));
> mutex_unlock(&ht->mutex);
> }
>
Damn! I knew your above described deadlock scenario. Thank you for the
nice catch!
> If you agree we can explain this shortly in the commit message and add:
> Fixes: 97defe1 ("rhashtable: Per bucket locks & deferred expansion/shrinking")
>
OK, I will deliver the next version.
By the way, I think we should check the following condition before call
cancel_work_sync(), otherwise, we may cancel an uninitialized work.
(ht->p.grow_decision || ht->p.shrink_decision)
What do you think?
Regards,
Ying
>
next prev parent reply other threads:[~2015-01-13 9:49 UTC|newest]
Thread overview: 4+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-01-13 9:00 [PATCH net-next] rhashtable: unnecessary to use delayed work Ying Xue
2015-01-13 9:35 ` Thomas Graf
2015-01-13 9:48 ` Ying Xue [this message]
2015-01-13 11:26 ` Thomas Graf
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=54B4EA06.8010507@windriver.com \
--to=ying.xue@windriver.com \
--cc=davem@davemloft.net \
--cc=netdev@vger.kernel.org \
--cc=tgraf@suug.ch \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.