From: Yury Norov <yury.norov@gmail.com>
To: "Lameter, Christopher" <cl@os.amperecomputing.com>
Cc: Huang Shijie <shijie@os.amperecomputing.com>,
gregkh@linuxfoundation.org, patches@amperecomputing.com,
rafael@kernel.org, paul.walmsley@sifive.com, palmer@dabbelt.com,
aou@eecs.berkeley.edu, kuba@kernel.org, vschneid@redhat.com,
mingo@kernel.org, akpm@linux-foundation.org, vbabka@suse.cz,
rppt@kernel.org, tglx@linutronix.de, jpoimboe@kernel.org,
ndesaulniers@google.com, mikelley@microsoft.com,
mhiramat@kernel.org, arnd@arndb.de, linux-kernel@vger.kernel.org,
linux-riscv@lists.infradead.org,
linux-arm-kernel@lists.infradead.org, catalin.marinas@arm.com,
will@kernel.org, mark.rutland@arm.com, mpe@ellerman.id.au,
linuxppc-dev@lists.ozlabs.org, chenhuacai@kernel.org,
jiaxun.yang@flygoat.com, linux-mips@vger.kernel.org
Subject: Re: [PATCH v2] NUMA: Early use of cpu_to_node() returns 0 instead of the correct node id
Date: Wed, 24 Jan 2024 09:41:18 -0800 [thread overview]
Message-ID: <ZbFLvnMQ3wsQ0pIF@yury-ThinkPad> (raw)
In-Reply-To: <4a13353c-cf4b-a388-5776-389c61c63ec0@os.amperecomputing.com>
On Wed, Jan 24, 2024 at 09:19:00AM -0800, Lameter, Christopher wrote:
> On Tue, 23 Jan 2024, Huang Shijie wrote:
>
> > During the kernel booting, the generic cpu_to_node() is called too early in
> > arm64, powerpc and riscv when CONFIG_NUMA is enabled.
> >
> > For arm64/powerpc/riscv, there are at least four places in the common code
> > where the generic cpu_to_node() is called before it is initialized:
> > 1.) early_trace_init() in kernel/trace/trace.c
> > 2.) sched_init() in kernel/sched/core.c
> > 3.) init_sched_fair_class() in kernel/sched/fair.c
> > 4.) workqueue_init_early() in kernel/workqueue.c
> >
> > In order to fix the bug, the patch changes generic cpu_to_node to
> > function pointer, and export it for kernel modules.
> > Introduce smp_prepare_boot_cpu_start() to wrap the original
> > smp_prepare_boot_cpu(), and set cpu_to_node with early_cpu_to_node.
> > Introduce smp_prepare_cpus_done() to wrap the original smp_prepare_cpus(),
> > and set the cpu_to_node to formal _cpu_to_node().
>
> Would you please fix this cleanly without a function pointer?
>
> What I think needs to be done is a patch series.
>
> 1. Instrument cpu_to_node so that some warning is issued if it is used too
> early. Preloading the array with NUMA_NO_NODE would allow us to do that.
By preloading do you mean compile-time initialization?
> 2. Implement early_cpu_to_node on platforms that currently do not have it.
>
> 3. A series of patches that fix each place where cpu_to_node is used too
> early.
Agree. This is the right way to go. And pretty well all of it was discussed
in v1, isn't?
Thanks,
Yury
WARNING: multiple messages have this Message-ID (diff)
From: Yury Norov <yury.norov@gmail.com>
To: "Lameter, Christopher" <cl@os.amperecomputing.com>
Cc: Huang Shijie <shijie@os.amperecomputing.com>,
gregkh@linuxfoundation.org, patches@amperecomputing.com,
rafael@kernel.org, paul.walmsley@sifive.com, palmer@dabbelt.com,
aou@eecs.berkeley.edu, kuba@kernel.org, vschneid@redhat.com,
mingo@kernel.org, akpm@linux-foundation.org, vbabka@suse.cz,
rppt@kernel.org, tglx@linutronix.de, jpoimboe@kernel.org,
ndesaulniers@google.com, mikelley@microsoft.com,
mhiramat@kernel.org, arnd@arndb.de, linux-kernel@vger.kernel.org,
linux-riscv@lists.infradead.org,
linux-arm-kernel@lists.infradead.org, catalin.marinas@arm.com,
will@kernel.org, mark.rutland@arm.com, mpe@ellerman.id.au,
linuxppc-dev@lists.ozlabs.org, chenhuacai@kernel.org,
jiaxun.yang@flygoat.com, linux-mips@vger.kernel.org
Subject: Re: [PATCH v2] NUMA: Early use of cpu_to_node() returns 0 instead of the correct node id
Date: Wed, 24 Jan 2024 09:41:18 -0800 [thread overview]
Message-ID: <ZbFLvnMQ3wsQ0pIF@yury-ThinkPad> (raw)
In-Reply-To: <4a13353c-cf4b-a388-5776-389c61c63ec0@os.amperecomputing.com>
On Wed, Jan 24, 2024 at 09:19:00AM -0800, Lameter, Christopher wrote:
> On Tue, 23 Jan 2024, Huang Shijie wrote:
>
> > During the kernel booting, the generic cpu_to_node() is called too early in
> > arm64, powerpc and riscv when CONFIG_NUMA is enabled.
> >
> > For arm64/powerpc/riscv, there are at least four places in the common code
> > where the generic cpu_to_node() is called before it is initialized:
> > 1.) early_trace_init() in kernel/trace/trace.c
> > 2.) sched_init() in kernel/sched/core.c
> > 3.) init_sched_fair_class() in kernel/sched/fair.c
> > 4.) workqueue_init_early() in kernel/workqueue.c
> >
> > In order to fix the bug, the patch changes generic cpu_to_node to
> > function pointer, and export it for kernel modules.
> > Introduce smp_prepare_boot_cpu_start() to wrap the original
> > smp_prepare_boot_cpu(), and set cpu_to_node with early_cpu_to_node.
> > Introduce smp_prepare_cpus_done() to wrap the original smp_prepare_cpus(),
> > and set the cpu_to_node to formal _cpu_to_node().
>
> Would you please fix this cleanly without a function pointer?
>
> What I think needs to be done is a patch series.
>
> 1. Instrument cpu_to_node so that some warning is issued if it is used too
> early. Preloading the array with NUMA_NO_NODE would allow us to do that.
By preloading do you mean compile-time initialization?
> 2. Implement early_cpu_to_node on platforms that currently do not have it.
>
> 3. A series of patches that fix each place where cpu_to_node is used too
> early.
Agree. This is the right way to go. And pretty well all of it was discussed
in v1, isn't?
Thanks,
Yury
_______________________________________________
linux-riscv mailing list
linux-riscv@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/linux-riscv
WARNING: multiple messages have this Message-ID (diff)
From: Yury Norov <yury.norov@gmail.com>
To: "Lameter, Christopher" <cl@os.amperecomputing.com>
Cc: mark.rutland@arm.com, rafael@kernel.org, catalin.marinas@arm.com,
jiaxun.yang@flygoat.com, mikelley@microsoft.com,
linux-riscv@lists.infradead.org, will@kernel.org,
mingo@kernel.org, vschneid@redhat.com, chenhuacai@kernel.org,
vbabka@suse.cz, kuba@kernel.org, patches@amperecomputing.com,
linux-mips@vger.kernel.org, aou@eecs.berkeley.edu, arnd@arndb.de,
paul.walmsley@sifive.com, tglx@linutronix.de,
jpoimboe@kernel.org, linux-arm-kernel@lists.infradead.org,
Huang Shijie <shijie@os.amperecomputing.com>,
gregkh@linuxfoundation.org, ndesaulniers@google.com,
linux-kernel@vger.kernel.org, palmer@dabbelt.com,
mhiramat@kernel.org, akpm@linux-foundation.org,
linuxppc-dev@lists.ozlabs.org, rppt@kernel.org
Subject: Re: [PATCH v2] NUMA: Early use of cpu_to_node() returns 0 instead of the correct node id
Date: Wed, 24 Jan 2024 09:41:18 -0800 [thread overview]
Message-ID: <ZbFLvnMQ3wsQ0pIF@yury-ThinkPad> (raw)
In-Reply-To: <4a13353c-cf4b-a388-5776-389c61c63ec0@os.amperecomputing.com>
On Wed, Jan 24, 2024 at 09:19:00AM -0800, Lameter, Christopher wrote:
> On Tue, 23 Jan 2024, Huang Shijie wrote:
>
> > During the kernel booting, the generic cpu_to_node() is called too early in
> > arm64, powerpc and riscv when CONFIG_NUMA is enabled.
> >
> > For arm64/powerpc/riscv, there are at least four places in the common code
> > where the generic cpu_to_node() is called before it is initialized:
> > 1.) early_trace_init() in kernel/trace/trace.c
> > 2.) sched_init() in kernel/sched/core.c
> > 3.) init_sched_fair_class() in kernel/sched/fair.c
> > 4.) workqueue_init_early() in kernel/workqueue.c
> >
> > In order to fix the bug, the patch changes generic cpu_to_node to
> > function pointer, and export it for kernel modules.
> > Introduce smp_prepare_boot_cpu_start() to wrap the original
> > smp_prepare_boot_cpu(), and set cpu_to_node with early_cpu_to_node.
> > Introduce smp_prepare_cpus_done() to wrap the original smp_prepare_cpus(),
> > and set the cpu_to_node to formal _cpu_to_node().
>
> Would you please fix this cleanly without a function pointer?
>
> What I think needs to be done is a patch series.
>
> 1. Instrument cpu_to_node so that some warning is issued if it is used too
> early. Preloading the array with NUMA_NO_NODE would allow us to do that.
By preloading do you mean compile-time initialization?
> 2. Implement early_cpu_to_node on platforms that currently do not have it.
>
> 3. A series of patches that fix each place where cpu_to_node is used too
> early.
Agree. This is the right way to go. And pretty well all of it was discussed
in v1, isn't?
Thanks,
Yury
WARNING: multiple messages have this Message-ID (diff)
From: Yury Norov <yury.norov@gmail.com>
To: "Lameter, Christopher" <cl@os.amperecomputing.com>
Cc: Huang Shijie <shijie@os.amperecomputing.com>,
gregkh@linuxfoundation.org, patches@amperecomputing.com,
rafael@kernel.org, paul.walmsley@sifive.com, palmer@dabbelt.com,
aou@eecs.berkeley.edu, kuba@kernel.org, vschneid@redhat.com,
mingo@kernel.org, akpm@linux-foundation.org, vbabka@suse.cz,
rppt@kernel.org, tglx@linutronix.de, jpoimboe@kernel.org,
ndesaulniers@google.com, mikelley@microsoft.com,
mhiramat@kernel.org, arnd@arndb.de, linux-kernel@vger.kernel.org,
linux-riscv@lists.infradead.org,
linux-arm-kernel@lists.infradead.org, catalin.marinas@arm.com,
will@kernel.org, mark.rutland@arm.com, mpe@ellerman.id.au,
linuxppc-dev@lists.ozlabs.org, chenhuacai@kernel.org,
jiaxun.yang@flygoat.com, linux-mips@vger.kernel.org
Subject: Re: [PATCH v2] NUMA: Early use of cpu_to_node() returns 0 instead of the correct node id
Date: Wed, 24 Jan 2024 09:41:18 -0800 [thread overview]
Message-ID: <ZbFLvnMQ3wsQ0pIF@yury-ThinkPad> (raw)
In-Reply-To: <4a13353c-cf4b-a388-5776-389c61c63ec0@os.amperecomputing.com>
On Wed, Jan 24, 2024 at 09:19:00AM -0800, Lameter, Christopher wrote:
> On Tue, 23 Jan 2024, Huang Shijie wrote:
>
> > During the kernel booting, the generic cpu_to_node() is called too early in
> > arm64, powerpc and riscv when CONFIG_NUMA is enabled.
> >
> > For arm64/powerpc/riscv, there are at least four places in the common code
> > where the generic cpu_to_node() is called before it is initialized:
> > 1.) early_trace_init() in kernel/trace/trace.c
> > 2.) sched_init() in kernel/sched/core.c
> > 3.) init_sched_fair_class() in kernel/sched/fair.c
> > 4.) workqueue_init_early() in kernel/workqueue.c
> >
> > In order to fix the bug, the patch changes generic cpu_to_node to
> > function pointer, and export it for kernel modules.
> > Introduce smp_prepare_boot_cpu_start() to wrap the original
> > smp_prepare_boot_cpu(), and set cpu_to_node with early_cpu_to_node.
> > Introduce smp_prepare_cpus_done() to wrap the original smp_prepare_cpus(),
> > and set the cpu_to_node to formal _cpu_to_node().
>
> Would you please fix this cleanly without a function pointer?
>
> What I think needs to be done is a patch series.
>
> 1. Instrument cpu_to_node so that some warning is issued if it is used too
> early. Preloading the array with NUMA_NO_NODE would allow us to do that.
By preloading do you mean compile-time initialization?
> 2. Implement early_cpu_to_node on platforms that currently do not have it.
>
> 3. A series of patches that fix each place where cpu_to_node is used too
> early.
Agree. This is the right way to go. And pretty well all of it was discussed
in v1, isn't?
Thanks,
Yury
_______________________________________________
linux-arm-kernel mailing list
linux-arm-kernel@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/linux-arm-kernel
next prev parent reply other threads:[~2024-01-24 17:41 UTC|newest]
Thread overview: 24+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-01-23 4:58 [PATCH v2] NUMA: Early use of cpu_to_node() returns 0 instead of the correct node id Huang Shijie
2024-01-23 4:58 ` Huang Shijie
2024-01-23 4:58 ` Huang Shijie
2024-01-23 4:58 ` Huang Shijie
2024-01-24 17:19 ` Lameter, Christopher
2024-01-24 17:19 ` Lameter, Christopher
2024-01-24 17:19 ` Lameter, Christopher
2024-01-24 17:19 ` Lameter, Christopher
2024-01-24 17:41 ` Yury Norov [this message]
2024-01-24 17:41 ` Yury Norov
2024-01-24 17:41 ` Yury Norov
2024-01-24 17:41 ` Yury Norov
2024-01-25 2:42 ` Shijie Huang
2024-01-25 2:42 ` Shijie Huang
2024-01-25 2:42 ` Shijie Huang
2024-01-25 2:42 ` Shijie Huang
2024-01-25 7:31 ` Mike Rapoport
2024-01-25 7:31 ` Mike Rapoport
2024-01-25 7:31 ` Mike Rapoport
2024-01-25 7:31 ` Mike Rapoport
2024-01-25 9:15 ` Shijie Huang
2024-01-25 9:15 ` Shijie Huang
2024-01-25 9:15 ` Shijie Huang
2024-01-25 9:15 ` Shijie Huang
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=ZbFLvnMQ3wsQ0pIF@yury-ThinkPad \
--to=yury.norov@gmail.com \
--cc=akpm@linux-foundation.org \
--cc=aou@eecs.berkeley.edu \
--cc=arnd@arndb.de \
--cc=catalin.marinas@arm.com \
--cc=chenhuacai@kernel.org \
--cc=cl@os.amperecomputing.com \
--cc=gregkh@linuxfoundation.org \
--cc=jiaxun.yang@flygoat.com \
--cc=jpoimboe@kernel.org \
--cc=kuba@kernel.org \
--cc=linux-arm-kernel@lists.infradead.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mips@vger.kernel.org \
--cc=linux-riscv@lists.infradead.org \
--cc=linuxppc-dev@lists.ozlabs.org \
--cc=mark.rutland@arm.com \
--cc=mhiramat@kernel.org \
--cc=mikelley@microsoft.com \
--cc=mingo@kernel.org \
--cc=mpe@ellerman.id.au \
--cc=ndesaulniers@google.com \
--cc=palmer@dabbelt.com \
--cc=patches@amperecomputing.com \
--cc=paul.walmsley@sifive.com \
--cc=rafael@kernel.org \
--cc=rppt@kernel.org \
--cc=shijie@os.amperecomputing.com \
--cc=tglx@linutronix.de \
--cc=vbabka@suse.cz \
--cc=vschneid@redhat.com \
--cc=will@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.