From: Song Liu <songliubraving@fb.com>
To: Peter Zijlstra <peterz@infradead.org>
Cc: Networking <netdev@vger.kernel.org>,
Kernel Team <Kernel-team@fb.com>,
Alexei Starovoitov <ast@kernel.org>,
Daniel Borkmann <daniel@iogearbox.net>
Subject: Re: [PATCH bpf-next 1/2] bpf: enable stackmap with build_id in nmi context
Date: Wed, 2 May 2018 16:48:32 +0000 [thread overview]
Message-ID: <30F32ACF-5155-459B-BD47-5060CCA52788@fb.com> (raw)
In-Reply-To: <20180502092109.GI12180@hirez.programming.kicks-ass.net>
> On May 2, 2018, at 2:21 AM, Peter Zijlstra <peterz@infradead.org> wrote:
>
> On Tue, May 01, 2018 at 05:02:19PM -0700, Song Liu wrote:
>> @@ -267,17 +285,27 @@ static void stack_map_get_build_id_offset(struct bpf_stack_build_id *id_offs,
>> {
>> int i;
>> struct vm_area_struct *vma;
>> + bool in_nmi_ctx = in_nmi();
>> + bool irq_work_busy = false;
>> + struct stack_map_irq_work *work;
>> +
>> + if (in_nmi_ctx) {
>> + work = this_cpu_ptr(&irq_work);
>> + if (work->sem)
>> + /* cannot queue more up_read, fallback */
>> + irq_work_busy = true;
>> + }
>>
>> /*
>> + * We cannot do up_read() in nmi context. To do build_id lookup
>> + * in nmi context, we need to run up_read() in irq_work. We use
>> + * a percpu variable to do the irq_work. If the irq_work is
>> + * already used by another lookup, we fall back to report ips.
>> *
>> * Same fallback is used for kernel stack (!user) on a stackmap
>> * with build_id.
>> */
>> + if (!user || !current || !current->mm || irq_work_busy ||
>> down_read_trylock(¤t->mm->mmap_sem) == 0) {
>> /* cannot access current->mm, fall back to ips */
>> for (i = 0; i < trace_nr; i++) {
>> @@ -299,7 +327,13 @@ static void stack_map_get_build_id_offset(struct bpf_stack_build_id *id_offs,
>> - vma->vm_start;
>> id_offs[i].status = BPF_STACK_BUILD_ID_VALID;
>> }
>> +
>> + if (!in_nmi_ctx)
>> + up_read(¤t->mm->mmap_sem);
>> + else {
>> + work->sem = ¤t->mm->mmap_sem;
>> + irq_work_queue(&work->work);
>> + }
>> }
>
> This is disguisting.. :-)
>
> It's broken though, I've bet you've never actually ran this with lockdep
> enabled for example.
I am not following here. I just run the new selftest with CONFIG_LOCKDEP on,
and got no warning for this.
> Also, you set work->sem before you do trylock, if the trylock fails you
> return early and keep work->sem set, which will thereafter always result
> in irq_work_busy.
work->sem was set after down_read_trylock(). I guess you misread the patch?
Thanks,
Song
next prev parent reply other threads:[~2018-05-02 16:49 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-05-02 0:02 [PATCH bpf-next 1/2] bpf: enable stackmap with build_id in nmi context Song Liu
2018-05-02 0:02 ` [PATCH bpf-next 2/2] bpf: add selftest for stackmap with build_id in NMI context Song Liu
2018-05-02 3:34 ` [RFC PATCH] bpf: __pcpu_scope_irq_work can be static kbuild test robot
2018-05-02 3:34 ` [PATCH bpf-next 1/2] bpf: enable stackmap with build_id in nmi context kbuild test robot
2018-05-02 9:21 ` Peter Zijlstra
2018-05-02 16:48 ` Song Liu [this message]
2018-05-02 17:30 ` Peter Zijlstra
2018-05-02 17:57 ` Song Liu
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=30F32ACF-5155-459B-BD47-5060CCA52788@fb.com \
--to=songliubraving@fb.com \
--cc=Kernel-team@fb.com \
--cc=ast@kernel.org \
--cc=daniel@iogearbox.net \
--cc=netdev@vger.kernel.org \
--cc=peterz@infradead.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox