From: Peter Zijlstra <peterz@infradead.org>
To: Steven Rostedt <rostedt@goodmis.org>
Cc: LKML <linux-kernel@vger.kernel.org>,
Thomas Gleixner <tglx@linutronix.de>,
Ankur Arora <ankur.a.arora@oracle.com>,
Linus Torvalds <torvalds@linux-foundation.org>,
linux-mm@kvack.org, x86@kernel.org, akpm@linux-foundation.org,
luto@kernel.org, bp@alien8.de, dave.hansen@linux.intel.com,
hpa@zytor.com, mingo@redhat.com, juri.lelli@redhat.com,
vincent.guittot@linaro.org, willy@infradead.org, mgorman@suse.de,
jon.grimm@amd.com, bharata@amd.com, raghavendra.kt@amd.com,
boris.ostrovsky@oracle.com, konrad.wilk@oracle.com,
jgross@suse.com, andrew.cooper3@citrix.com,
Joel Fernandes <joel@joelfernandes.org>,
Youssef Esmat <youssefesmat@chromium.org>,
Vineeth Pillai <vineethrp@google.com>,
Suleiman Souhlal <suleiman@google.com>,
Ingo Molnar <mingo@kernel.org>,
Daniel Bristot de Oliveira <bristot@kernel.org>
Subject: Re: [POC][RFC][PATCH] sched: Extended Scheduler Time Slice
Date: Wed, 25 Oct 2023 12:29:52 +0200 [thread overview]
Message-ID: <20231025102952.GG37471@noisy.programming.kicks-ass.net> (raw)
In-Reply-To: <20231025054219.1acaa3dd@gandalf.local.home>
On Wed, Oct 25, 2023 at 05:42:19AM -0400, Steven Rostedt wrote:
> That is, there's this structure for every thread. It's assigned with:
>
> fd = open("/sys/kernel/extend_sched", O_RDWR);
> extend_map = mmap(NULL, getpagesize(), PROT_READ | PROT_WRITE, MAP_SHARED, fd, 0);
>
> I don't actually like this interface, as it wastes a full page for just two
> bits :-p Perhaps it should be a new system call, where it just locks in
> existing memory from the user application? The requirement is that each
> thread needs its own bits to play with. It should not be shared with other
> threads. It could be, as it will not mess up the kernel, but will mess up
> the application.
What was wrong with using rseq?
> Anyway, to tell the kernel to "extend" the time slice if possible because
> it's in a critical section, we have:
>
> static void extend(void)
> {
> if (!extend_map)
> return;
>
> extend_map->flags = 1;
> }
>
> And to say that's it's done:
>
> static void unextend(void)
> {
> unsigned long prev;
>
> if (!extend_map)
> return;
>
> prev = xchg(&extend_map->flags, 0);
> if (prev & 2)
> sched_yield();
> }
>
> So, bit 1 is for user space to tell the kernel "please extend me", and bit
> two is for the kernel to tell user space "OK, I extended you, but call
> sched_yield() when done".
So what if it doesn't ? Can we kill it for not playing nice ?
[ aside from it being bit 0 and bit 1 as you yourself point out, it is
also jarring you use a numeral for one and write out the other. ]
That said, I properly hate all these things, extending a slice doesn't
reliably work and we're always left with people demanding an ever longer
extension.
The *much* better heuristic is what the kernel uses, don't spin if the
lock holder isn't running.
next prev parent reply other threads:[~2023-10-25 10:30 UTC|newest]
Thread overview: 26+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-10-25 9:42 [POC][RFC][PATCH] sched: Extended Scheduler Time Slice Steven Rostedt
2023-10-25 9:46 ` Steven Rostedt
2023-10-25 10:29 ` Peter Zijlstra [this message]
2023-10-25 12:54 ` Steven Rostedt
2023-10-25 13:55 ` Peter Zijlstra
2023-10-25 14:31 ` Steven Rostedt
2023-10-25 14:53 ` Mathieu Desnoyers
2023-10-25 15:07 ` Steven Rostedt
2023-10-25 15:42 ` Mathieu Desnoyers
2023-10-25 16:24 ` Mateusz Guzik
2023-10-25 17:17 ` Steven Rostedt
2023-10-25 18:49 ` Mathieu Desnoyers
2023-10-25 19:19 ` Steven Rostedt
2023-10-25 21:56 ` Mathieu Desnoyers
2023-10-26 8:54 ` Peter Zijlstra
2023-10-26 13:40 ` Steven Rostedt
2023-10-26 15:49 ` Steven Rostedt
2023-10-26 16:31 ` Daniel Bristot de Oliveira
2023-10-26 17:26 ` Steven Rostedt
2023-10-26 8:44 ` Peter Zijlstra
2023-10-26 13:16 ` Steven Rostedt
2023-10-30 13:29 ` Peter Zijlstra
2023-10-30 13:52 ` Steven Rostedt
2023-10-26 5:03 ` Ankur Arora
2023-10-25 15:12 ` Steven Rostedt
2023-10-25 15:34 ` Rasmus Villemoes
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20231025102952.GG37471@noisy.programming.kicks-ass.net \
--to=peterz@infradead.org \
--cc=akpm@linux-foundation.org \
--cc=andrew.cooper3@citrix.com \
--cc=ankur.a.arora@oracle.com \
--cc=bharata@amd.com \
--cc=boris.ostrovsky@oracle.com \
--cc=bp@alien8.de \
--cc=bristot@kernel.org \
--cc=dave.hansen@linux.intel.com \
--cc=hpa@zytor.com \
--cc=jgross@suse.com \
--cc=joel@joelfernandes.org \
--cc=jon.grimm@amd.com \
--cc=juri.lelli@redhat.com \
--cc=konrad.wilk@oracle.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=luto@kernel.org \
--cc=mgorman@suse.de \
--cc=mingo@kernel.org \
--cc=mingo@redhat.com \
--cc=raghavendra.kt@amd.com \
--cc=rostedt@goodmis.org \
--cc=suleiman@google.com \
--cc=tglx@linutronix.de \
--cc=torvalds@linux-foundation.org \
--cc=vincent.guittot@linaro.org \
--cc=vineethrp@google.com \
--cc=willy@infradead.org \
--cc=x86@kernel.org \
--cc=youssefesmat@chromium.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.