From: James Bottomley <James.Bottomley@HansenPartnership.com>
To: Thomas Gleixner <tglx@linutronix.de>,
Vlastimil Babka <vbabka@suse.cz>,
Linus Torvalds <torvalds@linux-foundation.org>,
Guenter Roeck <linux@roeck-us.net>
Cc: linux-kernel@vger.kernel.org, Linux-MM <linux-mm@kvack.org>,
Helge Deller <deller@gmx.de>,
linux-parisc@vger.kernel.org
Subject: Re: [PATCH 6.10 000/809] 6.10.3-rc3 review
Date: Tue, 06 Aug 2024 20:49:19 -0400 [thread overview]
Message-ID: <c54ab27ff0f0bb3e9e681eec9a62549e5e245a6b.camel@HansenPartnership.com> (raw)
In-Reply-To: <87frrh44mf.ffs@tglx>
On Wed, 2024-08-07 at 01:24 +0200, Thomas Gleixner wrote:
> Cc+: Helge, parisc ML
>
> We're chasing a weird failure which has been tracked down to the
> placement of the division library functions (I assume they are
> imported
> from libgcc).
>
> See the thread starting at:
>
>
> https://lore.kernel.org/all/718b8afe-222f-4b3a-96d3-93af0e4ceff1@roeck-us.net
>
> On Tue, Aug 06 2024 at 21:25, Vlastimil Babka wrote:
> > On 8/6/24 19:33, Thomas Gleixner wrote:
> > >
> > > So this change adds 16 bytes to __softirq() which moves the
> > > division
> > > functions up by 16 bytes. That's all it takes to make the stupid
> > > go
> > > away....
> >
> > Heh I was actually wondering if the division is somhow messed up
> > because
> > maxobj = order_objects() and order_objects() does a division. Now I
> > suspect
> > it even more.
>
> check_slab() calls into that muck, but I checked the disassembly of a
> working and a broken kernel and the only difference there is the
> displacement offset when the code calculates the call address, but
> that's as expected a difference of 16 bytes.
>
> Now it becomes interesting.
>
> I added a unused function after __do_softirq() into the softirq text
> section and filled it with ASM nonsense so that it occupies exactly
> one
> page. That moves $$divoI, which is what check_slab() calls, exactly
> one
> page forward:
>
> -0000000041218c70 T $$divoI
> +0000000041219c70 T $$divoI
>
> Guess what happens? If falls on it's nose again.
>
> Now with that ASM gunk I can steer the size conveniently. It works up
> to:
>
> 0000000041219c50 T $$divoI
>
> and fails for
>
> 0000000041219c60 T $$divoI
> 0000000041219c70 T $$divoI
>
> and works again at
>
> 0000000041219c80 T $$divoI
So just on this, you seem to have proved that only exact multiples of
48 work. In terms of how PA-RISC caching works that's completely nuts
... however, there may be something else at work, like stack frame
alignment.
>
> So I added the following:
>
> +extern void testme(void);
> +extern unsigned int testsize;
> +
> +unsigned int testsize = 192;
> +
> +void __init testme(void)
> +{
> + pr_info("TESTME: %lu\n", PAGE_SIZE / testsize);
> +}
>
> called that _before_ mm_core_init() from init/main.c and adjusted my
> ASM hack to make $$divoI be at:
>
> 0000000041219c70 T $$divoI
>
> again and surprisingly the output is:
>
> [ 0.000000] softirq: TESTME: 21
OK, why is that surprising? 4096/192 is 21 due to integer rounding.
> Now I went back to the hppa64 gcc version 12.2.0 again and did the
> same ASM gunk adjustment so that $$divoI ends up at the offset 0xc70
> in the page and the same happens.
>
> So it's not a compiler dependent problem.
>
> But then I added a testme() call to the error path and get:
>
> [ 0.000000] softirq: TESTME: 21
> [ 0.000000]
> =====================================================================
> ========
> [ 0.000000] BUG kmem_cache_node (Not tainted): objects 21 > max 16
> size 192 sorder 0
>
> Now what's wrong?
>
> Adding more debug:
>
> [ 0.000000] BUG kmem_cache_node (Not tainted): objects 21 > max 16
> size 192 sorder 0 21
>
> where the last '21' is the output of the same call which made maxobj
> go
> south:
>
> static int check_slab(struct kmem_cache *s, struct slab *slab)
> {
> int maxobj;
> @@ -1386,8 +1388,10 @@ static int check_slab(struct kmem_cache
>
> maxobj = order_objects(slab_order(slab), s->size);
> if (slab->objects > maxobj) {
> - slab_err(s, slab, "objects %u > max %u",
> - slab->objects, maxobj);
> + testme();
> + slab_err(s, slab, "objects %u > max %u size %u sorder
> %u %u",
> + slab->objects, maxobj, s->size,
> slab_order(slab),
> + order_objects(slab_order(slab), s->size));
> return 0;
> }
> if (slab->inuse > slab->objects) {
>
> I don't know and I don't want to know TBH...
OK, so you're telling us we have a problem with slab_order on parisc
... that's folio_order, so it smells like a parisc bug with
folio_test_large? Unfortuntely I'm a bit pissed in an airport lounge
on my way to the UK, so I've lost access to my pa test rig and can't
test further for a while.
Regards,
James
next prev parent reply other threads:[~2024-08-07 0:49 UTC|newest]
Thread overview: 33+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <20240731095022.970699670@linuxfoundation.org>
[not found] ` <718b8afe-222f-4b3a-96d3-93af0e4ceff1@roeck-us.net>
2024-08-05 3:28 ` [PATCH 6.10 000/809] 6.10.3-rc3 review Guenter Roeck
2024-08-05 8:56 ` Thomas Gleixner
2024-08-05 12:51 ` Thomas Gleixner
2024-08-05 15:02 ` Guenter Roeck
2024-08-05 21:49 ` Thomas Gleixner
2024-08-06 1:16 ` Guenter Roeck
2024-08-05 17:42 ` Guenter Roeck
[not found] ` <CAHk-=wiZ7WJQ1y=CwuMwqBxQYtaD8psq+Vxa3r1Z6_ftDZK+hA@mail.gmail.com>
[not found] ` <53b2e1f2-4291-48e5-a668-7cf57d900ecd@suse.cz>
[not found] ` <87le194kuq.ffs@tglx>
[not found] ` <90e02d99-37a2-437e-ad42-44b80c4e94f6@suse.cz>
2024-08-06 23:24 ` Thomas Gleixner
2024-08-07 0:49 ` James Bottomley [this message]
2024-08-07 1:38 ` Guenter Roeck
2024-08-07 12:45 ` Thomas Gleixner
2024-08-08 1:07 ` Guenter Roeck
2024-08-08 7:48 ` Vlastimil Babka
2024-08-08 14:46 ` Guenter Roeck
2024-08-08 9:57 ` Thomas Gleixner
2024-08-08 14:59 ` Guenter Roeck
2024-08-08 15:58 ` John David Anglin
2024-08-08 15:53 ` Linus Torvalds
2024-08-08 16:12 ` Thomas Gleixner
2024-08-08 16:33 ` Linus Torvalds
2024-08-08 17:48 ` Thomas Gleixner
2024-08-08 18:19 ` Linus Torvalds
2024-08-08 20:52 ` Guenter Roeck
2024-08-08 21:50 ` John David Anglin
2024-08-08 22:29 ` John David Anglin
2024-08-08 23:33 ` Linus Torvalds
2024-08-09 0:33 ` John David Anglin
2024-08-09 0:56 ` Guenter Roeck
2024-08-09 0:50 ` Guenter Roeck
2024-08-08 22:15 ` Richard Henderson
2024-09-03 7:54 ` Helge Deller
2024-09-03 14:13 ` Guenter Roeck
2024-09-03 18:43 ` Linus Torvalds
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=c54ab27ff0f0bb3e9e681eec9a62549e5e245a6b.camel@HansenPartnership.com \
--to=james.bottomley@hansenpartnership.com \
--cc=deller@gmx.de \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=linux-parisc@vger.kernel.org \
--cc=linux@roeck-us.net \
--cc=tglx@linutronix.de \
--cc=torvalds@linux-foundation.org \
--cc=vbabka@suse.cz \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox