From: "Michael Kerrisk (man-pages)" <mtk.manpages@gmail.com>
To: Peter Zijlstra <peterz@infradead.org>
Cc: mtk.manpages@gmail.com, Juri Lelli <juri.lelli@gmail.com>,
Dario Faggioli <raistlin@linux.it>,
Thomas Gleixner <tglx@linutronix.de>,
Ingo Molnar <mingo@redhat.com>,
rostedt@goodmis.org, Oleg Nesterov <oleg@redhat.com>,
fweisbec@gmail.com, darren@dvhart.com, johan.eker@ericsson.com,
p.faure@akatech.ch, Linux Kernel <linux-kernel@vger.kernel.org>,
claudio@evidence.eu.com, michael@amarulasolutions.com,
fchecconi@gmail.com, tommaso.cucinotta@sssup.it,
nicola.manica@disi.unitn.it, luca.abeni@unitn.it,
dhaval.giani@gmail.com, hgu1972@gmail.com,
Paul McKenney <paulmck@linux.vnet.ibm.com>,
insop.song@gmail.com, liming.wang@windriver.com,
jkacur@redhat.com, linux-man@vger.kernel.org
Subject: Re: sched_{set,get}attr() manpage
Date: Fri, 09 May 2014 10:23:22 +0200 [thread overview]
Message-ID: <536C907A.1040205@gmail.com> (raw)
In-Reply-To: <20140506081636.GA4460@twins.programming.kicks-ass.net>
Hi Peter,
I'm working on this text. I see the following in kernel/sched/core.c:
[[
static int __sched_setscheduler(struct task_struct *p,
const struct sched_attr *attr,
bool user)
{
...
int policy = attr->sched_policy;
...
if (policy < 0) {
reset_on_fork = p->sched_reset_on_fork;
policy = oldpolicy = p->policy;
]]
What's a negative policy about? Is this something that should
be documented?
Cheers,
Michael
On 05/06/2014 10:16 AM, Peter Zijlstra wrote:
> On Mon, May 05, 2014 at 09:21:14AM +0200, Peter Zijlstra wrote:
>> On Mon, May 05, 2014 at 08:55:28AM +0200, Michael Kerrisk (man-pages) wrote:
>>> Hi Peter,
>>>
>>> Looks like a good set of comments from Juri. Could you revise and
>>> resubmit?
>>
>> Yeah, I'll try and get it done today, but there's a few icky bugs
>> waiting for my attention as well, I'll do me bestest :-)
>
> OK, not quite managed it yesterday, but here goes.
>
> So Verbatim license, for the first part to me and whoever I borrowed
> sched_setscheduler() bits from.
>
> For the second part to me and Juri.
>
> ---
>
>> [1] A page describing the sched_setattr() and sched_getattr() APIs
>
> NAME
> sched_setattr, sched_getattr - set and get scheduling policy/attributes
>
> SYNOPSIS
> #include <sched.h>
>
> struct sched_attr {
> u32 size;
> u32 sched_policy;
> u64 sched_flags;
>
> /* SCHED_NORMAL, SCHED_BATCH */
> s32 sched_nice;
>
> /* SCHED_FIFO, SCHED_RR */
> u32 sched_priority;
>
> /* SCHED_DEADLINE */
> u64 sched_runtime;
> u64 sched_deadline;
> u64 sched_period;
> };
>
> int sched_setattr(pid_t pid, const struct sched_attr *attr, unsigned int flags);
>
> int sched_getattr(pid_t pid, const struct sched_attr *attr, unsigned int size, unsigned int flags);
>
> DESCRIPTION
> sched_setattr() sets both the scheduling policy and the
> associated attributes for the process whose ID is specified in
> pid.
>
> sched_setattr() replaces sched_setscheduler(), sched_setparam(),
> nice() and some of setpriority().
>
> If pid equals zero, the scheduling policy and attributes
> of the calling process will be set. The interpretation of the
> argument attr depends on the selected policy. Currently, Linux
> supports the following "normal" (i.e., non-real-time) scheduling
> policies:
>
> SCHED_OTHER the standard "fair" time-sharing policy;
>
> SCHED_BATCH for "batch" style execution of processes; and
>
> SCHED_IDLE for running very low priority background jobs.
>
> The following "real-time" policies are also supported, for
> special time-critical applications that need precise control
> over the way in which runnable processes are selected for
> execution:
>
> SCHED_FIFO a static priority first-in, first-out policy;
>
> SCHED_RR a static priority round-robin policy; and
>
> SCHED_DEADLINE a dynamic priority deadline policy.
>
> The semantics of each of these policies are detailed in
> sched(7).
>
> sched_attr::size must be set to the size of the structure, as in
> sizeof(struct sched_attr), if the provided structure is smaller
> than the kernel structure, any additional fields are assumed
> '0'. If the provided structure is larger than the kernel
> structure, the kernel verifies all additional fields are '0' if
> not the syscall will fail with -E2BIG.
>
> sched_attr::sched_policy the desired scheduling policy.
>
> sched_attr::sched_flags additional flags that can influence
> scheduling behaviour. Currently as per Linux kernel 3.14:
>
> SCHED_FLAG_RESET_ON_FORK - resets the scheduling policy
> to: (struct sched_attr){ .sched_policy = SCHED_OTHER, }
> on fork().
>
> is the only supported flag.
>
> sched_attr::sched_nice should only be set for SCHED_OTHER,
> SCHED_BATCH, the desired nice value [-20,19], see sched(7).
>
> sched_attr::sched_priority should only be set for SCHED_FIFO,
> SCHED_RR, the desired static priority [1,99], see sched(7).
>
> sched_attr::sched_runtime in nanoseconds,
> sched_attr::sched_deadline in nanoseconds,
> sched_attr::sched_period in nanoseconds, should only be set for
> SCHED_DEADLINE and are the traditional sporadic task model
> parameters, see sched(7).
>
> The flags argument should be 0.
>
> sched_getattr() queries the scheduling policy currently applied
> to the process identified by pid.
>
> Similar to sched_setattr(), sched_getattr() replaces
> sched_getscheduler(), sched_getparam() and some of
> getpriority().
>
> If pid equals zero, the policy of the calling process will be
> retrieved.
>
> The size argument should reflect the size of struct sched_attr
> as known to userspace. The kernel fills out sched_attr::size to
> the size of its sched_attr structure. If the user provided
> structure is larger, additional fields are not touched. If the
> user provided structure is smaller, but the kernel needs to
> return values outside the provided space, the syscall will fail
> with -E2BIG.
>
> The flags argument should be 0.
>
> The other sched_attr fields are filled out as described in
> sched_setattr().
>
> RETURN VALUE
> On success, sched_setattr() and sched_getattr() return 0. On
> error, -1 is returned, and errno is set appropriately.
>
> ERRORS
> EINVAL The scheduling policy is not one of the recognized policies,
> param is NULL, or param does not make sense for the selected
> policy.
>
> EPERM The calling process does not have appropriate privileges.
>
> ESRCH The process whose ID is pid could not be found.
>
> E2BIG The provided storage for struct sched_attr is either too
> big, see sched_setattr(), or too small, see sched_getattr().
>
> EBUSY SCHED_DEADLINE admission control failure, see sched(7).
>
> NOTES
> While the text above (and in sched_setscheduler(2)) talks about
> processes, in actual fact these system calls are thread specific.
>
> While the SCHED_DEADLINE parameters are in nanoseconds, current
> kernels truncate the lower 10 bits and we get an effective
> microsecond resolution.
>
>> [2] A piece of text describing the SCHED_DEADLINE policy, which I can
>> drop into sched(7).
>
> SCHED_DEADLINE: Sporadic task model deadline scheduling
> SCHED_DEADLINE is currently implemented using GEDF (Global
> Earliest Deadline First) with additional CBS (Constant Bandwidth
> Server).
>
> A sporadic task is one that has a sequence of jobs, where each
> job is activated at most once per period. Each job has also a
> relative deadline, before which it should finish execution, and a
> computation time, that is the time necessary for executing the
> job without interruption. The instant of time when a task wakes
> up, because a new job has to be executed, is called arrival time
> (and it is also referred to as request time or release time).
> Start time is instead the time at which a task starts its
> execution. The absolute deadline is thus obtained adding the
> relative deadline to the arrival time.
>
> The following diagram clarifies these terms:
>
> arrival/wakeup absolute deadline
> | start time |
> v v v
> -------x--------xoooooooooooo-------x--------x-----
> |<- comp. ->|
> |<---------- rel. deadline ->|
> |<---------- period ----------------->|
>
> SCHED_DEADLINE allows the user to specify three parameters (see
> sched_setattr(2)): Runtime [ns], Deadline [ns] and Period [ns].
> Such parameters has not necessarily to correspond to the
> aforementioned terms, while usual practise is to set Runtime to
> something bigger than the average computation time (or worst-case
> execution time for hard real-time tasks), Deadline to the
> relative deadline and Period to the period of the task. With such
> a setting we would have:
>
> arrival/wakeup absolute deadline
> | start time |
> v v v
> -------x--------xoooooooooooo-------x--------x-----
> |<- Runtime -->|
> |<---------- Deadline ------>|
> |<---------- Period ----------------->|
>
> It is checked that: Runtime <= Deadline <= Period.
>
> The CBS guarantees non-interference between tasks, by throttling
> tasks that attempt to over-run their specified Runtime.
>
> In general the set of all SCHED_DEADLINE tasks is not
> feasible/schedulable within the given constraints. To guarantee
> some degree of timeliness we must do an admittance test on
> setting/changing SCHED_DEADLINE policy/attributes.
>
> This admission test calculates that the task set is
> feasible/schedulable, failing this, sched_setattr() will return
> -EBUSY.
>
> For example, it is required (but not necessarily sufficient) for
> the total utilization to be less or equal to the total amount of
> CPUs available, where, since each task can maximally run for
> Runtime per Period, that task's utilization is its
> Runtime/Period.
>
> Because we must be able to calculate admittance SCHED_DEADLINE
> tasks are the highest priority (user controllable) tasks in the
> system, if any SCHED_DEADLINE task is runnable it will preempt
> any FIFO/RR/OTHER/BATCH/IDLE task.
>
> SCHED_DEADLINE tasks will fail fork(2) with -EAGAIN, except when
> the forking task has SCHED_FLAG_RESET_ON_FORK set.
>
> A SCHED_DEADLINE task calling sched_yield() will 'yield' the
> current job and wait for a new period to begin.
>
--
Michael Kerrisk
Linux man-pages maintainer; http://www.kernel.org/doc/man-pages/
Linux/UNIX System Programming Training: http://man7.org/training/
next prev parent reply other threads:[~2014-05-09 8:23 UTC|newest]
Thread overview: 71+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-12-17 12:27 [PATCH 00/13] sched, deadline: patches Peter Zijlstra
2013-12-17 12:27 ` [PATCH 01/13] sched: Add 3 new scheduler syscalls to support an extended scheduling parameters ABI Peter Zijlstra
2014-01-21 14:36 ` Michael Kerrisk
2014-01-21 15:38 ` Peter Zijlstra
2014-01-21 15:46 ` Peter Zijlstra
2014-01-21 16:02 ` Steven Rostedt
2014-01-21 16:06 ` Peter Zijlstra
2014-01-21 16:46 ` Juri Lelli
2014-02-14 14:13 ` Michael Kerrisk (man-pages)
2014-02-14 16:19 ` Peter Zijlstra
2014-02-15 12:52 ` Ingo Molnar
2014-02-17 13:20 ` Michael Kerrisk (man-pages)
2014-04-09 9:25 ` sched_{set,get}attr() manpage Peter Zijlstra
2014-04-09 15:19 ` Henrik Austad
2014-04-09 15:42 ` Peter Zijlstra
2014-04-10 7:47 ` Juri Lelli
2014-04-10 9:59 ` Claudio Scordino
2014-04-27 15:47 ` Michael Kerrisk (man-pages)
2014-04-27 19:34 ` Peter Zijlstra
2014-04-27 19:45 ` Steven Rostedt
2014-04-28 7:39 ` Juri Lelli
2014-04-28 8:18 ` Peter Zijlstra
2014-04-29 13:08 ` Michael Kerrisk (man-pages)
2014-04-29 14:22 ` Peter Zijlstra
2014-04-29 16:04 ` Peter Zijlstra
2014-04-30 11:09 ` Michael Kerrisk (man-pages)
2014-04-30 12:35 ` Peter Zijlstra
2014-04-30 13:09 ` Peter Zijlstra
2014-05-03 10:43 ` Juri Lelli
2014-05-05 6:55 ` Michael Kerrisk (man-pages)
2014-05-05 7:21 ` Peter Zijlstra
2014-05-05 7:41 ` Michael Kerrisk (man-pages)
2014-05-05 7:47 ` Peter Zijlstra
2014-05-05 9:53 ` Michael Kerrisk (man-pages)
2014-05-06 8:16 ` Peter Zijlstra
2014-05-09 8:23 ` Michael Kerrisk (man-pages) [this message]
2014-05-09 8:53 ` Peter Zijlstra
2014-05-09 9:26 ` Michael Kerrisk (man-pages)
2014-05-19 13:06 ` [tip:sched/core] sched: Disallow sched_attr::sched_policy < 0 tip-bot for Peter Zijlstra
2014-05-22 12:25 ` tip-bot for Peter Zijlstra
2014-02-21 20:32 ` [tip:sched/urgent] sched: Add 'flags' argument to sched_{set, get}attr() syscalls tip-bot for Peter Zijlstra
2014-01-26 9:48 ` [PATCH 01/13] sched: Add 3 new scheduler syscalls to support an extended scheduling parameters ABI Geert Uytterhoeven
2013-12-17 12:27 ` [PATCH 02/13] sched: SCHED_DEADLINE structures & implementation Peter Zijlstra
2013-12-17 12:27 ` [PATCH 03/13] sched: SCHED_DEADLINE SMP-related data structures & logic Peter Zijlstra
2013-12-17 12:27 ` [PATCH 04/13] [PATCH 05/13] sched: SCHED_DEADLINE avg_update accounting Peter Zijlstra
2013-12-17 12:27 ` [PATCH 05/13] sched: Add period support for -deadline tasks Peter Zijlstra
2013-12-17 12:27 ` [PATCH 06/13] [PATCH 07/13] sched: Add latency tracing " Peter Zijlstra
2013-12-17 12:27 ` [PATCH 07/13] rtmutex: Turn the plist into an rb-tree Peter Zijlstra
2013-12-17 12:27 ` [PATCH 08/13] sched: Drafted deadline inheritance logic Peter Zijlstra
2013-12-17 12:27 ` [PATCH 09/13] sched: Add bandwidth management for sched_dl Peter Zijlstra
2013-12-18 16:55 ` Peter Zijlstra
2013-12-20 17:13 ` Peter Zijlstra
2013-12-20 17:37 ` Steven Rostedt
2013-12-20 17:42 ` Peter Zijlstra
2013-12-20 18:23 ` Steven Rostedt
2013-12-20 18:26 ` Steven Rostedt
2013-12-20 21:44 ` Peter Zijlstra
2013-12-20 23:29 ` Steven Rostedt
2013-12-21 10:05 ` Peter Zijlstra
2013-12-21 17:26 ` Peter Zijlstra
2014-01-13 15:55 ` [tip:sched/core] sched/deadline: Fix hotplug admission control tip-bot for Peter Zijlstra
2013-12-17 12:27 ` [PATCH 10/13] sched: speed up -dl pushes with a push-heap Peter Zijlstra
2013-12-17 12:27 ` [PATCH 11/13] sched: Remove sched_setscheduler2() Peter Zijlstra
2013-12-17 12:27 ` [PATCH 12/13] sched, deadline: Fixup the smp-affinity mask tests Peter Zijlstra
2013-12-17 12:27 ` [PATCH 13/13] sched, deadline: Remove the sysctl_sched_dl knobs Peter Zijlstra
2013-12-17 20:17 ` [PATCH] sched, deadline: Properly initialize def_dl_bandwidth lock Steven Rostedt
2013-12-18 10:01 ` Peter Zijlstra
2013-12-20 13:51 ` [PATCH 00/13] sched, deadline: patches Juri Lelli
2013-12-20 14:28 ` Steven Rostedt
2013-12-20 14:51 ` Peter Zijlstra
2013-12-20 15:19 ` Steven Rostedt
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=536C907A.1040205@gmail.com \
--to=mtk.manpages@gmail.com \
--cc=claudio@evidence.eu.com \
--cc=darren@dvhart.com \
--cc=dhaval.giani@gmail.com \
--cc=fchecconi@gmail.com \
--cc=fweisbec@gmail.com \
--cc=hgu1972@gmail.com \
--cc=insop.song@gmail.com \
--cc=jkacur@redhat.com \
--cc=johan.eker@ericsson.com \
--cc=juri.lelli@gmail.com \
--cc=liming.wang@windriver.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-man@vger.kernel.org \
--cc=luca.abeni@unitn.it \
--cc=michael@amarulasolutions.com \
--cc=mingo@redhat.com \
--cc=nicola.manica@disi.unitn.it \
--cc=oleg@redhat.com \
--cc=p.faure@akatech.ch \
--cc=paulmck@linux.vnet.ibm.com \
--cc=peterz@infradead.org \
--cc=raistlin@linux.it \
--cc=rostedt@goodmis.org \
--cc=tglx@linutronix.de \
--cc=tommaso.cucinotta@sssup.it \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).