From: Frank Rowand <frowand.list@gmail.com>
To: Rob Herring <robh@kernel.org>
Cc: Pantelis Antoniou <pantelis.antoniou@konsulko.com>,
David Airlie <airlied@linux.ie>, Jyri Sarha <jsarha@ti.com>,
"devicetree@vger.kernel.org" <devicetree@vger.kernel.org>,
"linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
Mark Rutland <mark.rutland@arm.com>,
Tomi Valkeinen <tomi.valkeinen@ti.com>,
dri-devel <dri-devel@lists.freedesktop.org>
Subject: Re: [PATCH 09/12] of: overlay: avoid race condition between applying multiple overlays
Date: Tue, 10 Oct 2017 17:53:20 -0700 [thread overview]
Message-ID: <59DD6B80.8050204@gmail.com> (raw)
In-Reply-To: <CAL_JsqK8cxBk754UBziGOTMx035C8P5ehPvENE0=-TNHqmTCbg@mail.gmail.com>
On 10/10/17 14:06, Rob Herring wrote:
> On Tue, Oct 10, 2017 at 2:39 PM, Frank Rowand <frowand.list@gmail.com> wrote:
>> On 10/10/17 11:40, Rob Herring wrote:
>>> On Wed, Oct 04, 2017 at 08:29:59PM -0700, Frank Rowand wrote:
>>>> On 10/04/17 08:19, Rob Herring wrote:
>>>>> On Mon, Oct 2, 2017 at 10:53 PM, <frowand.list@gmail.com> wrote:
>>>>>> From: Frank Rowand <frank.rowand@sony.com>
>>>>>>
>>>>>> The process of applying an overlay consists of:
>>>>>> - unflatten an overlay FDT (flattened device tree) into an
>>>>>> EDT (expanded device tree)
>>>>>> - fixup the phandle values in the overlay EDT to fit in a
>>>>>> range above the phandle values in the live device tree
>>>>>> - create the overlay changeset to reflect the contents of
>>>>>> the overlay EDT
>>>>>> - apply the overlay changeset, to modify the live device tree,
>>>>>> potentially changing the maximum phandle value in the live
>>>>>> device tree
>>>>>>
>>>>>> There is currently no protection against two overlay applies
>>>>>> concurrently determining what range of phandle values are in use
>>>>>> in the live device tree, and subsequently changing that range.
>>>>>> Add a mutex to prevent multiple overlay applies from occurring
>>>>>> simultaneously.
>>>>>>
>>>>>> Ignoring 2 checkpatch warnings: Prefer using '"%s...", __func__'
>>>>>> so that the WARN() string will be more easily grepped.
>>>>>>
>>>>>> Signed-off-by: Frank Rowand <frank.rowand@sony.com>
>>>>>> ---
>>>>>> drivers/gpu/drm/tilcdc/tilcdc_slave_compat.c | 7 +++++++
>>>>>> drivers/of/overlay.c | 22 ++++++++++++++++++++++
>>>>>> drivers/of/unittest.c | 21 +++++++++++++++++++++
>>>>>> include/linux/of.h | 19 +++++++++++++++++++
>>>>>> 4 files changed, 69 insertions(+)
>>>>>>
>>>>>> diff --git a/drivers/gpu/drm/tilcdc/tilcdc_slave_compat.c b/drivers/gpu/drm/tilcdc/tilcdc_slave_compat.c
>>>>>> index 7a7be0515bfd..c99f7924b1c6 100644
>>>>>> --- a/drivers/gpu/drm/tilcdc/tilcdc_slave_compat.c
>>>>>> +++ b/drivers/gpu/drm/tilcdc/tilcdc_slave_compat.c
>>>>>> @@ -221,6 +221,11 @@ static void __init tilcdc_convert_slave_node(void)
>>>>>> goto out;
>>>>>> }
>>>>>>
>>>>>> + /*
>>>>>> + * protect from of_resolve_phandles() through of_overlay_apply()
>>>>>> + */
>>>>>> + of_overlay_mutex_lock();
>>>>>> +
>>>>>
>>>>> We can't be relying on callers to get the locking right...
>>>>
>>>> Agreed.
>>>>
>>>>
>>>>>
>>>>>> overlay = tilcdc_get_overlay(&kft);
>>>>>> if (!overlay)
>>>>>> goto out;
>>>>>> @@ -256,6 +261,8 @@ static void __init tilcdc_convert_slave_node(void)
>>>>>> pr_info("%s: ti,tilcdc,slave node successfully converted\n",
>>>>>> __func__);
>>>>>> out:
>>>>>> + of_overlay_mutex_unlock();
>>>>>> +
>>>>>> kfree_table_free(&kft);
>>>>>> of_node_put(i2c);
>>>>>> of_node_put(slave);
>>>>>> diff --git a/drivers/of/overlay.c b/drivers/of/overlay.c
>>>>>> index a0d3222febdc..4ed372af6ce7 100644
>>>>>> --- a/drivers/of/overlay.c
>>>>>> +++ b/drivers/of/overlay.c
>>>>>> @@ -71,6 +71,28 @@ static int build_changeset_next_level(struct overlay_changeset *ovcs,
>>>>>> const struct device_node *overlay_node,
>>>>>> bool is_symbols_node);
>>>>>>
>>>>>> +/*
>>>>>> + * of_resolve_phandles() finds the largest phandle in the live tree.
>>>>>> + * of_overlay_apply() may add a larger phandle to the live tree.
>>>>>> + * Do not allow race between two overlays being applied simultaneously:
>>>>>> + * mutex_lock(&of_overlay_phandle_mutex)
>>>>>> + * of_resolve_phandles()
>>>>>> + * of_overlay_apply()
>>>>>> + * mutex_unlock(&of_overlay_phandle_mutex)
>>>>>
>>>>> Why do these need to be separate functions? I think I mentioned it
>>>>> before, but essentially overlay_data_add() should be part of the
>>>>> overlay API. We may need to allow for callers to do each step, but
>>>>> generally I think the interface should just be "apply this fdt blob".
>>>>
>>>> Yes, that is where I want to end up.
>>>
>>> So, is that not doable now? To put it another way, why does
>>> of_resolve_phandles need to be a separate call? Seems like an internal
>>> detail of how you apply an overlay to me.
>>>
>>> Rob
>>
>> Yes, of_resolve_phandles() should become an internal call made from
>> the "apply this fdt blob" function.
>
> I mean just moving of_resolve_phandles into of_overlay_apply. Not the
> unflattening too.
OK, I can do that. I'll send another patch that shows what is involved.
>> The biggest obstacle is drivers/gpu/drm/tilcdc/tilcdc_slave_compat.c
>> using overlays in a convoluted manner. The second obstacle will
>> probably be the older overlay tests in drivers/of/unittest.c. I
>> need to look at how to convert them to using actual overlays.
>>
>> There are other fixes and improvements to the overlay code that
>> need to occur, but it is like pulling on a loose thread in a
>> sweater - it just goes on and on. I'd like to get this set of
>> patches in, with whatever changes are absolutely essential,
>> then continue on with more patch sets. This code will be
>> much easier for me to modify in the future if this patch set
>> is applied.
>
> AFAICT, I don't think anything between of_resolve_phandles and
> of_overlay_apply calls in tilcdc depends on the phandles being fixed
> up.
I started looking at that, then decided that it wasn't worth the
aggravation at the moment. That use of overlays is fragile and
really needs to go away. With this latest change on top of all
the others, someone who has that hardware really needs to test
the patches to make sure nothing broke.
> And for the unittests that don't call of_resolve_phandles, would
> there be any side effect of calling of_resolve_phandles?
Yes. The old style overlay tests do not use true overlays. Those
tests were created before dtc was able to handle overlays, so they
are somewhat hand crafted.
This has two results. First, the subtrees that are passed to
of_overlay_apply() are deeper in the tree than a true overlay
would be. Thus they do not contain the __local_fixups__ node
that would be in a true overlay. The __local_fixups__ node
is at the root level of the testcases.dtb tree, and fixups
for the entire tree are done in one step, instead of fixing
up each overlay separately. So when of_resolve_phandles()
is called on each overlay tree, there is no __local_fixups__
node visible, and a second fixup is not attempted. The
second result is that the overlay trees are not detached,
so I temporarily disabled one the detached check in
of_resolve_phandles() so overlay_apply() will not fail
for these overlays.
I'll have to rework the old style overlay tests to use true
overlays soon. But that is another activity that I would
like to do as a subsequent step to this patch set.
-Frank
next prev parent reply other threads:[~2017-10-11 0:53 UTC|newest]
Thread overview: 27+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-10-03 3:53 [PATCH 00/12] of: overlay: clean up device tree overlay code frowand.list-Re5JQEeQqe8AvxtiuMwx3w
2017-10-03 3:53 ` [PATCH 01/12] of: overlay.c: Remove comments that state the obvious, to reduce clutter frowand.list
2017-10-03 3:53 ` [PATCH 02/12] of: overlay.c: Convert comparisons to zero or NULL to logical expressions frowand.list
2017-10-03 3:53 ` [PATCH 04/12] of: overlay: rename identifiers in dup_and_fixup_symbol_prop() frowand.list
2017-10-03 3:53 ` [PATCH 05/12] of: overlay: minor restructuring frowand.list
2017-10-03 3:53 ` [PATCH 06/12] of: overlay: detect cases where device tree may become corrupt frowand.list
[not found] ` <1507002826-16393-1-git-send-email-frowand.list-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
2017-10-03 3:53 ` [PATCH 03/12] of: overlay: rename identifiers to more reflect what they do frowand.list-Re5JQEeQqe8AvxtiuMwx3w
2017-10-03 3:53 ` [PATCH 07/12] of: overlay: expand check of whether overlay changeset can be removed frowand.list-Re5JQEeQqe8AvxtiuMwx3w
2017-10-03 3:53 ` [PATCH 10/12] of: overlay: simplify applying symbols from an overlay frowand.list-Re5JQEeQqe8AvxtiuMwx3w
2017-10-04 14:56 ` [PATCH 00/12] of: overlay: clean up device tree overlay code Rob Herring
2017-10-05 6:53 ` Tomi Valkeinen
[not found] ` <3eac16b0-fdd9-fd6c-e1ca-e0de49ce27cb-l0cyMroinI0@public.gmane.org>
2017-10-05 8:33 ` Jyri Sarha
2017-10-17 8:43 ` Jyri Sarha
2017-10-05 14:46 ` Rob Herring
2017-10-03 3:53 ` [PATCH 08/12] of: overlay: loosen overly strict phandle clash check frowand.list
2017-10-03 3:53 ` [PATCH 09/12] of: overlay: avoid race condition between applying multiple overlays frowand.list
2017-10-04 15:19 ` Rob Herring
2017-10-05 3:29 ` Frank Rowand
2017-10-10 18:40 ` Rob Herring
2017-10-10 19:39 ` Frank Rowand
2017-10-10 21:06 ` Rob Herring
2017-10-11 0:53 ` Frank Rowand [this message]
[not found] ` <CAL_JsqK8cxBk754UBziGOTMx035C8P5ehPvENE0=-TNHqmTCbg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2017-10-11 1:41 ` Frank Rowand
2017-10-03 3:53 ` [PATCH 11/12] of: overlay: remove a dependency on device node full_name frowand.list
2017-10-03 3:53 ` [PATCH 12/12] of: overlay: remove unneeded check for NULL kbasename() frowand.list
2017-10-13 22:03 ` [PATCH 00/12] of: overlay: clean up device tree overlay code Frank Rowand
2017-10-16 20:55 ` Rob Herring
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=59DD6B80.8050204@gmail.com \
--to=frowand.list@gmail.com \
--cc=airlied@linux.ie \
--cc=devicetree@vger.kernel.org \
--cc=dri-devel@lists.freedesktop.org \
--cc=jsarha@ti.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mark.rutland@arm.com \
--cc=pantelis.antoniou@konsulko.com \
--cc=robh@kernel.org \
--cc=tomi.valkeinen@ti.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).