From: takahiro.akashi@linaro.org (AKASHI Takahiro)
To: linux-arm-kernel@lists.infradead.org
Subject: [RFC] arm64: ftrace with regs for livepatch support
Date: Wed, 20 Jan 2016 12:12:04 +0900 [thread overview]
Message-ID: <569EFB04.6070807@linaro.org> (raw)
In-Reply-To: <569EE216.6030000@huawei.com>
Li,
On 01/20/2016 10:25 AM, Li Bin wrote:
> Hi Takahiro,
> Thanks for your reply firstly.
>
> on 2016/1/19 16:28, AKASHI Takahiro wrote:
>>>> 1) instruction sequence
>>>> Unlike x86, we have to preserve link register(x30) explicitly on arm64 since
>>>> a ftrace help function will be invoked before a function prologue. so we
>>>> need a few, not one, instructions here. Two possible ways:
>>>>
>>>> (a) stp x29, x30, [sp, #-16]!
>>>> mov x29, sp
>>>> bl <mcount>
>>>> ldp x29, x30, [sp], #16
>>>> <function prologue>
>>>> ...
>>>>
>>>> (b) mov x9, x30
>>>> bl <mcount>
>>>> mov x30, x9
>>>> <function prologue>
>>>> ...
>>>>
>>>> (a) complies with a normal calling convention.
>>>> (b) is Li Bin's idea in his old patch. While (b) can save some memory
>>>> accesses by using a scratch register(x9 in this example), we have no way
>>>> to recover an actual value for this register.
>>>>
>>>> Q#1. Which approach should we take here?
>>>>
>>>>
>>>> 2) replacing an instruction sequence
>>>> (This issue is orthogonal to Q#1.)
>>>>
>>>> Replacing can happen anytime, so we have to do it (without any locking) in
>>>> such a safe way that any task either calls a helper or doesn't call it, but
>>>> never runs in any intermediate state.
>>>>
>>>> Again here, two possible ways:
>>>>
>>>> (a) initialize the code in the shape of (A') at boot time,
>>>> (B) -> (B') -> (A')
>>>> then switching to (A) or (A')
>>>> (b) take a few steps each time. For example,
>>>> to enable tracing,
>>>> (B) -> (B') -> (A') -> (A)
>>>> to disable tracing,
>>>> (A) -> (A') -> (B') -> (A)
>>>> Obviously, we need cache flushing/invalidation and barriers between.
>>>>
>>>> (A) (A')
>>>> stp x29, x30, [sp, #-16]! b 1f
>>>> mov x29, sp mov x29, sp
>>>> bl <_mcount> bl <_mcount>
>>>> ldp x29, x30, [sp], #16 ld x29, x30, [sp], #16
>>>> 1:
>>>> <function prologue>
>>>> <function body>
>>>> ...
>>>>
>>>> (B) (B')
>>>> nop b 1f
>>>> nop nop
>>>> nop nop
>>>> nop nop
>>>> 1:
>>>> <function prologue>
>>>> <function body>
>>>> ...
>>>>
>>>
>>> Hi takahiro,
>>> This method can not guarantee the correctness of replacing the multi instrucions
>>> from (A') to (B') or from (B') to (A'), even if under kstop_machine especially for
>>> preemptable kernel or NMI context (which will be supported on arm64 in future).
>>> Right?
>>
>> You seem to be right.
>> I thought that we could use aarch64_insn_patch_text() here, but
>> it doesn't ensure any atomicity of replacement.
>> Switching from (A') to (A) or (A) to (A') can be used instead,
>> but the performance penalty will not be acceptable.
>>
>> Why does your livepatch with -mfentry work?
>>
>
> For my method with -mfentry, the instruction replacement for converting
> nops to ftrace calls or back is as following,
>
> (A) mov x9, x30 (A') mov x9, x30
> nop <--------------> bl <__fentry__>
> mov x30, x9 mov x30, x9
> <function prologue> <function prologue>
>
> so it is compatible with the current recordmcount and ftrace logic, and the
> only effect is that introducing two extra low cost mov instruction.
Last night I thought this issue and reached almost the same conclusion :)
The only other way would be to forcedly suppress gcc's instruction
scheduling in a function prologue and generate an always-the-same prologue.
But this will be unlikely.
(requiring pt_regs for livepatch is just too much.)
Other than a performance impact (I'm still not sure about it),
we might have a problem *with -fprolog-add=N* when the kernel sets up this
multiple-instructions sequence in each traceable function at boot time because
we have no way to do so transparently. I will check the ftrace code.
Thanks,
-Takahiro AKASHI
> Thanks,
> Li Bin
>
>> -Takahiro AKASHI
>>
>>> Thanks,
>>> Li Bin
>>>
>>>> (a) is much simpler, but (b) has less performance penalty(?) when tracing
>>>> is disabled. I'm afraid that I might simplify the issue too much.
>>>>
>>>> Q#2. Which one is more preferable?
>>>>
>>>>
>>>> [1] https://gcc.gnu.org/ml/gcc/2015-05/msg00267.html, and
>>>> https://gcc.gnu.org/ml/gcc/2015-10/msg00090.html
>>>>
>>>>
>>>> Thanks,
>>>> -Takahiro AKASHI
>>>>
>>>> .
>>>>
>
>
next prev parent reply other threads:[~2016-01-20 3:12 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-11-20 11:47 [RFC] arm64: ftrace with regs for livepatch support AKASHI Takahiro
2015-12-14 8:22 ` Li Bin
2015-12-14 15:56 ` Steven Rostedt
2015-12-26 9:28 ` Li Bin
2016-01-19 8:28 ` AKASHI Takahiro
2016-01-20 1:25 ` Li Bin
2016-01-20 3:12 ` AKASHI Takahiro [this message]
2016-01-25 8:56 ` Li Bin
2016-01-26 8:07 ` AKASHI Takahiro
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=569EFB04.6070807@linaro.org \
--to=takahiro.akashi@linaro.org \
--cc=linux-arm-kernel@lists.infradead.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).