From: Ingo Molnar <mingo@elte.hu>
To: Jeff Mahoney <jeffm@suse.com>
Cc: Jiri Kosina <jkosina@suse.cz>,
Peter Zijlstra <peterz@infradead.org>,
Linux Kernel Mailing List <linux-kernel@vger.kernel.org>,
Tony Luck <tony.luck@intel.com>,
Fenghua Yu <fenghua.yu@intel.com>,
linux-ia64@vger.kernel.org,
Linus Torvalds <torvalds@linux-foundation.org>
Subject: Re: Commit 34d76c41 causes linker errors on ia64 with NR_CPUS=4096
Date: Tue, 20 Oct 2009 15:43:08 +0200 [thread overview]
Message-ID: <20091020134308.GA3930@elte.hu> (raw)
In-Reply-To: <4ADDB640.4020707@suse.com>
* Jeff Mahoney <jeffm@suse.com> wrote:
> On 10/20/2009 02:35 AM, Ingo Molnar wrote:
> >
> > * Jiri Kosina <jkosina@suse.cz> wrote:
> >
> >> On Tue, 20 Oct 2009, Ingo Molnar wrote:
> >>
> >>>> Commit 34d76c41 introduced percpu array update_shares_data, size of which
> >>>> being proportional to NR_CPUS. Unfortunately this blows up ia64 for large
> >>>> NR_CPUS configuration, as ia64 allows only 64k for .percpu section.
> >>>>
> >>>> Fix this by allocating this array dynamically and keep only pointer to it
> >>>> percpu.
> >>>>
> >>>> Signed-off-by: Jiri Kosina <jkosina@suse.cz>
> >>>> ---
> >>>> kernel/sched.c | 15 +++++++--------
> >>>> 1 files changed, 7 insertions(+), 8 deletions(-)
> >>>
> >>> Seems like an IA64 bug to me.
> >>
> >> IA64 guys actually use that as some kind of optimization for fast
> >> access to the percpu data in their pagefault handler, as far as I
> >> know.
> >
> > Still looks like a bug if it causes a breakage (linker error) on IA64,
> > and if the 'fix' (i'd call it a workaround) causes a (small but nonzero)
> > performance regression on other architectures.
>
> The linker error isn't a bug, it's enforcement. The ia64 linker script
> explicitly rewinds the location pointer back to the start of
> .data.percpu + 64k to start the .data section to cause the error if
> .data.percpu is larger than 64k.
Since every other SMP architecture manages to support more than 64K of
pecpu data, this is clearly an ugly, self-inflicted limitation of IA64
that has now escallated into a link failure.
Now, 34d76c41 could certainly be improved in a way that works around the
IA64 problem too: we can allocate the data dynamically as long as the
proper percpu allocator is used (not kmalloc as in the patch in this
thread). But arguing that the current IA64 64K limit behavior is
anything but very broken is rather shortsighted.
IA64 should be fixed really - we can get past the 64K of percpu data
limit anytime we add a few more pages of per-cpu data to the kernel -
the scheduler just happened to be the one to cross it this time.
The scheduler change in 34d76c41 has been done two months ago and has
been upstream for a month, so this compaint is rather late and at
minimum a certain degree of honesty about the situation is warranted.
Saying that all static percpu data must be below 64K, which will only be
noticed once IA64 gets its testing act together months after it's been
created is silly. If you want to enforce such a limit make it testable
in a _timely_ fashion. Or fix the limit really.
Ingo
next prev parent reply other threads:[~2009-10-20 13:43 UTC|newest]
Thread overview: 36+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-10-18 22:28 Commit 34d76c41 causes linker errors on ia64 with NR_CPUS=4096 Jeff Mahoney
2009-10-20 2:02 ` Jiri Kosina
2009-10-20 4:57 ` Jiri Kosina
2009-10-20 5:21 ` Tejun Heo
2009-10-20 5:58 ` Jiri Kosina
2009-10-20 6:12 ` Tejun Heo
2009-10-20 6:14 ` Tejun Heo
2009-10-20 6:27 ` Jiri Kosina
2009-10-20 14:18 ` Jeff Mahoney
2009-10-20 14:49 ` Jeff Mahoney
2009-10-21 6:11 ` Ingo Molnar
2009-10-21 15:19 ` Tejun Heo
2009-10-21 22:11 ` Luck, Tony
2009-10-22 14:49 ` Jiri Kosina
2009-10-22 14:53 ` Jeff Mahoney
2009-10-22 22:24 ` Luck, Tony
2009-10-23 7:51 ` Ingo Molnar
2009-10-23 12:30 ` Jiri Kosina
2009-10-26 16:38 ` Jiri Kosina
2009-10-26 20:16 ` Ingo Molnar
2009-10-27 10:03 ` Jiri Kosina
2009-10-27 10:52 ` Jiri Kosina
2009-10-20 6:15 ` Ingo Molnar
2009-10-20 6:26 ` Jiri Kosina
2009-10-20 6:35 ` Ingo Molnar
2009-10-20 7:11 ` Eric Dumazet
2009-10-20 7:39 ` Tejun Heo
2009-10-20 7:12 ` Tejun Heo
2009-10-20 7:17 ` Jiri Kosina
2009-10-20 7:36 ` Tejun Heo
2009-10-20 13:08 ` Jeff Mahoney
2009-10-20 13:43 ` Ingo Molnar [this message]
2009-10-20 13:57 ` Tejun Heo
2009-10-20 13:58 ` Tejun Heo
2009-10-21 6:43 ` Christoph Lameter
2009-10-20 9:21 ` Peter Zijlstra
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20091020134308.GA3930@elte.hu \
--to=mingo@elte.hu \
--cc=fenghua.yu@intel.com \
--cc=jeffm@suse.com \
--cc=jkosina@suse.cz \
--cc=linux-ia64@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=peterz@infradead.org \
--cc=tony.luck@intel.com \
--cc=torvalds@linux-foundation.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox