public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* [GIT PULL] sched_ext: Changes for v6.13
@ 2024-11-18 16:45 Tejun Heo
  2024-11-20 18:32 ` pr-tracker-bot
  0 siblings, 1 reply; 2+ messages in thread
From: Tejun Heo @ 2024-11-18 16:45 UTC (permalink / raw)
  To: Linus Torvalds; +Cc: linux-kernel, David Vernet

The following changes since commit a759bf0dfc4db3cb6556fc79c7c98da3a46b2b80:

  sched_ext: Update scx_show_state.py to match scx_ops_bypass_depth's new type (2024-11-05 11:45:27 -1000)

are available in the Git repository at:

  git://git.kernel.org/pub/scm/linux/kernel/git/tj/sched_ext.git/ tags/sched_ext-for-6.13

for you to fetch changes up to 6b8950ef993bcf198d4a80cde0b2da805b75ed70:

  sched_ext: Replace scx_next_task_picked() with switch_class() in comment (2024-11-14 15:30:24 -1000)

----------------------------------------------------------------
sched_ext: Change for v6.13

- Improve the default select_cpu() implementation making it topology aware
  and handle WAKE_SYNC better.

- set_arg_maybe_null() was used to inform the verifier which ops args could
  be NULL in a rather hackish way. Use the new __nullable CFI stub tags
  instead.

- On Sapphire Rapids multi-socket systems, a BPF scheduler, by hammering on
  the same queue across sockets, could live-lock the system to the point
  where the system couldn't make reasonable forward progress. This could
  lead to soft-lockup triggered resets or stalling out bypass mode switch
  and thus BPF scheduler ejection for tens of minutes if not hours. After
  trying a number of mitigations, the following set worked reliably:

  - Injecting artificial cpu_relax() loops in two places while sched_ext is
    trying to turn on the bypass mode.

  - Triggering scheduler ejection when soft-lockup detection is imminent (a
    quarter of threshold left).

  While not the prettiest, the impact both in terms of code complexity and
  overhead is minimal.

- A common complaint on the API is the overuse of the word "dispatch" and
  the confusion around "consume". This is due to how the dispatch queues
  became more generic over time. Rename the affected kfuncs for clarity.
  Thanks to BPF's compatibility features, this change can be made in a way
  that's both forward and backward compatible. The compatibility code will
  be dropped in a few releases.

- Pull sched_ext/for-6.12-fixes to receive a prerequisite change. Other misc
  changes.

----------------------------------------------------------------
Andrea Righi (6):
      sched_ext: improve WAKE_SYNC behavior for default idle CPU selection
      sched_ext: Clarify ops.select_cpu() for single-CPU tasks
      sched_ext: Introduce LLC awareness to the default idle selection policy
      sched_ext: Introduce NUMA awareness to the default idle selection policy
      sched_ext: Do not enable LLC/NUMA optimizations when domains overlap
      sched_ext: Fix incorrect use of bitwise AND

Changwoo Min (2):
      sched_ext: Clarify sched_ext_ops table for userland scheduler
      sched_ext: add a missing rcu_read_lock/unlock pair at scx_select_cpu_dfl()

Tejun Heo (10):
      sched_ext: Factor out move_task_between_dsqs() from scx_dispatch_from_dsq()
      sched_ext: Rename CFI stubs to names that are recognized by BPF
      sched_ext: Replace set_arg_maybe_null() with __nullable CFI stub tags
      Merge branch 'for-6.12-fixes' into for-6.13
      sched_ext: Avoid live-locking bypass mode switching
      sched_ext: Enable the ops breather and eject BPF scheduler on softlockup
      sched_ext: scx_bpf_dispatch_from_dsq_set_*() are allowed from unlocked context
      sched_ext: Rename scx_bpf_dispatch[_vtime]() to scx_bpf_dsq_insert[_vtime]()
      sched_ext: Rename scx_bpf_consume() to scx_bpf_dsq_move_to_local()
      sched_ext: Rename scx_bpf_dispatch[_vtime]_from_dsq*() -> scx_bpf_dsq_move[_vtime]*()

Tianchen Ding (1):
      sched_ext: Use btf_ids to resolve task_struct

Zhang Qiao (2):
      sched/ext: Fix unmatch trailing comment of CONFIG_EXT_GROUP_SCHED
      sched/ext: Use tg_cgroup() to elieminate duplicate code

Zhao Mengmeng (1):
      sched_ext: Replace scx_next_task_picked() with switch_class() in comment

 Documentation/scheduler/sched-ext.rst    |  71 ++-
 include/linux/sched/ext.h                |   2 +
 kernel/sched/ext.c                       | 968 +++++++++++++++++++++----------
 kernel/watchdog.c                        |   8 +
 tools/sched_ext/include/scx/common.bpf.h |  18 +-
 tools/sched_ext/include/scx/compat.bpf.h | 117 +++-
 tools/sched_ext/scx_central.bpf.c        |  14 +-
 tools/sched_ext/scx_flatcg.bpf.c         |  20 +-
 tools/sched_ext/scx_qmap.bpf.c           |  36 +-
 tools/sched_ext/scx_show_state.py        |   2 +
 tools/sched_ext/scx_simple.bpf.c         |  16 +-
 11 files changed, 878 insertions(+), 394 deletions(-)

-- 
tejun

^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: [GIT PULL] sched_ext: Changes for v6.13
  2024-11-18 16:45 [GIT PULL] sched_ext: Changes for v6.13 Tejun Heo
@ 2024-11-20 18:32 ` pr-tracker-bot
  0 siblings, 0 replies; 2+ messages in thread
From: pr-tracker-bot @ 2024-11-20 18:32 UTC (permalink / raw)
  To: Tejun Heo; +Cc: Linus Torvalds, linux-kernel, David Vernet

The pull request you sent on Mon, 18 Nov 2024 06:45:19 -1000:

> git://git.kernel.org/pub/scm/linux/kernel/git/tj/sched_ext.git/ tags/sched_ext-for-6.13

has been merged into torvalds/linux.git:
https://git.kernel.org/torvalds/c/8f7c8b88bda4988f44e595a760438febf51c92c8

Thank you!

-- 
Deet-doot-dot, I am a bot.
https://korg.docs.kernel.org/prtracker.html

^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2024-11-20 18:32 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2024-11-18 16:45 [GIT PULL] sched_ext: Changes for v6.13 Tejun Heo
2024-11-20 18:32 ` pr-tracker-bot

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox