* [PATCH 0/2] s390/gmap/pgtable improve handling of keyed KVM guests
@ 2022-05-30 9:27 Christian Borntraeger
2022-05-30 9:27 ` [PATCH 1/2] s390/gmap: voluntarily schedule during key setting Christian Borntraeger
` (2 more replies)
0 siblings, 3 replies; 7+ messages in thread
From: Christian Borntraeger @ 2022-05-30 9:27 UTC (permalink / raw)
To: KVM
Cc: Christian Borntraeger, Janosch Frank, David Hildenbrand,
linux-s390, Thomas Huth, Claudio Imbrenda, Heiko Carstens,
Vasily Gorbik, Alexander Gordeev, Janis Schoetterl-Glausch
These two patches try to address stalls/timeouts that we have seen when
switching many guests to enable storage keys.
Christian Borntraeger (2):
s390/gmap: voluntarily schedule during key setting
s390/pgtable: use non-quiescing sske for KVM switch to keyed
arch/s390/mm/gmap.c | 14 ++++++++++++++
arch/s390/mm/pgtable.c | 2 +-
2 files changed, 15 insertions(+), 1 deletion(-)
--
2.35.1
^ permalink raw reply [flat|nested] 7+ messages in thread
* [PATCH 1/2] s390/gmap: voluntarily schedule during key setting
2022-05-30 9:27 [PATCH 0/2] s390/gmap/pgtable improve handling of keyed KVM guests Christian Borntraeger
@ 2022-05-30 9:27 ` Christian Borntraeger
2022-05-30 9:56 ` Claudio Imbrenda
2022-05-31 7:23 ` Alexander Gordeev
2022-05-30 9:27 ` [PATCH 2/2] s390/pgtable: use non-quiescing sske for KVM switch to keyed Christian Borntraeger
2022-05-31 8:28 ` [PATCH 0/2] s390/gmap/pgtable improve handling of keyed KVM guests Christian Borntraeger
2 siblings, 2 replies; 7+ messages in thread
From: Christian Borntraeger @ 2022-05-30 9:27 UTC (permalink / raw)
To: KVM
Cc: Christian Borntraeger, Janosch Frank, David Hildenbrand,
linux-s390, Thomas Huth, Claudio Imbrenda, Heiko Carstens,
Vasily Gorbik, Alexander Gordeev, Janis Schoetterl-Glausch
With large and many guest with storage keys it is possible to create
large latencies or stalls during initial key setting:
rcu: INFO: rcu_sched self-detected stall on CPU
rcu: 18-....: (2099 ticks this GP) idle=54e/1/0x4000000000000002 softirq=35598716/35598716 fqs=998
(t=2100 jiffies g=155867385 q=20879)
Task dump for CPU 18:
CPU 1/KVM R running task 0 1030947 256019 0x06000004
Call Trace:
sched_show_task
rcu_dump_cpu_stacks
rcu_sched_clock_irq
update_process_times
tick_sched_handle
tick_sched_timer
__hrtimer_run_queues
hrtimer_interrupt
do_IRQ
ext_int_handler
ptep_zap_key
The mmap lock is held during the page walking but since this is a
semaphore scheduling is still possible. Same for the kvm srcu.
To minimize overhead do this on every segment table entry or large page.
Signed-off-by: Christian Borntraeger <borntraeger@linux.ibm.com>
---
arch/s390/mm/gmap.c | 14 ++++++++++++++
1 file changed, 14 insertions(+)
diff --git a/arch/s390/mm/gmap.c b/arch/s390/mm/gmap.c
index 1ac73917a8d3..b8ae4a4aa2ba 100644
--- a/arch/s390/mm/gmap.c
+++ b/arch/s390/mm/gmap.c
@@ -2608,6 +2608,18 @@ static int __s390_enable_skey_pte(pte_t *pte, unsigned long addr,
return 0;
}
+/*
+ * Give a chance to schedule after setting a key to 256 pages.
+ * We only hold the mm lock, which is a rwsem and the kvm srcu.
+ * Both can sleep.
+ */
+static int __s390_enable_skey_pmd(pmd_t *pmd, unsigned long addr,
+ unsigned long next, struct mm_walk *walk)
+{
+ cond_resched();
+ return 0;
+}
+
static int __s390_enable_skey_hugetlb(pte_t *pte, unsigned long addr,
unsigned long hmask, unsigned long next,
struct mm_walk *walk)
@@ -2630,12 +2642,14 @@ static int __s390_enable_skey_hugetlb(pte_t *pte, unsigned long addr,
end = start + HPAGE_SIZE - 1;
__storage_key_init_range(start, end);
set_bit(PG_arch_1, &page->flags);
+ cond_resched();
return 0;
}
static const struct mm_walk_ops enable_skey_walk_ops = {
.hugetlb_entry = __s390_enable_skey_hugetlb,
.pte_entry = __s390_enable_skey_pte,
+ .pmd_entry = __s390_enable_skey_pmd,
};
int s390_enable_skey(void)
--
2.35.1
^ permalink raw reply related [flat|nested] 7+ messages in thread
* [PATCH 2/2] s390/pgtable: use non-quiescing sske for KVM switch to keyed
2022-05-30 9:27 [PATCH 0/2] s390/gmap/pgtable improve handling of keyed KVM guests Christian Borntraeger
2022-05-30 9:27 ` [PATCH 1/2] s390/gmap: voluntarily schedule during key setting Christian Borntraeger
@ 2022-05-30 9:27 ` Christian Borntraeger
2022-05-30 9:56 ` Claudio Imbrenda
2022-05-31 8:28 ` [PATCH 0/2] s390/gmap/pgtable improve handling of keyed KVM guests Christian Borntraeger
2 siblings, 1 reply; 7+ messages in thread
From: Christian Borntraeger @ 2022-05-30 9:27 UTC (permalink / raw)
To: KVM
Cc: Christian Borntraeger, Janosch Frank, David Hildenbrand,
linux-s390, Thomas Huth, Claudio Imbrenda, Heiko Carstens,
Vasily Gorbik, Alexander Gordeev, Janis Schoetterl-Glausch
The switch to a keyed guest does not require a classic sske as the other
guest CPUs are not accessing the key before the switch is complete.
By using the NQ SSKE things are faster especially with multiple guests.
Signed-off-by: Christian Borntraeger <borntraeger@linux.ibm.com>
Suggested-by: Janis Schoetterl-Glausch <scgl@linux.ibm.com>
---
arch/s390/mm/pgtable.c | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/arch/s390/mm/pgtable.c b/arch/s390/mm/pgtable.c
index 697df02362af..4909dcd762e8 100644
--- a/arch/s390/mm/pgtable.c
+++ b/arch/s390/mm/pgtable.c
@@ -748,7 +748,7 @@ void ptep_zap_key(struct mm_struct *mm, unsigned long addr, pte_t *ptep)
pgste_val(pgste) |= PGSTE_GR_BIT | PGSTE_GC_BIT;
ptev = pte_val(*ptep);
if (!(ptev & _PAGE_INVALID) && (ptev & _PAGE_WRITE))
- page_set_storage_key(ptev & PAGE_MASK, PAGE_DEFAULT_KEY, 1);
+ page_set_storage_key(ptev & PAGE_MASK, PAGE_DEFAULT_KEY, 0);
pgste_set_unlock(ptep, pgste);
preempt_enable();
}
--
2.35.1
^ permalink raw reply related [flat|nested] 7+ messages in thread
* Re: [PATCH 1/2] s390/gmap: voluntarily schedule during key setting
2022-05-30 9:27 ` [PATCH 1/2] s390/gmap: voluntarily schedule during key setting Christian Borntraeger
@ 2022-05-30 9:56 ` Claudio Imbrenda
2022-05-31 7:23 ` Alexander Gordeev
1 sibling, 0 replies; 7+ messages in thread
From: Claudio Imbrenda @ 2022-05-30 9:56 UTC (permalink / raw)
To: Christian Borntraeger
Cc: KVM, Janosch Frank, David Hildenbrand, linux-s390, Thomas Huth,
Heiko Carstens, Vasily Gorbik, Alexander Gordeev,
Janis Schoetterl-Glausch
On Mon, 30 May 2022 11:27:05 +0200
Christian Borntraeger <borntraeger@linux.ibm.com> wrote:
> With large and many guest with storage keys it is possible to create
> large latencies or stalls during initial key setting:
>
> rcu: INFO: rcu_sched self-detected stall on CPU
> rcu: 18-....: (2099 ticks this GP) idle=54e/1/0x4000000000000002 softirq=35598716/35598716 fqs=998
> (t=2100 jiffies g=155867385 q=20879)
> Task dump for CPU 18:
> CPU 1/KVM R running task 0 1030947 256019 0x06000004
> Call Trace:
> sched_show_task
> rcu_dump_cpu_stacks
> rcu_sched_clock_irq
> update_process_times
> tick_sched_handle
> tick_sched_timer
> __hrtimer_run_queues
> hrtimer_interrupt
> do_IRQ
> ext_int_handler
> ptep_zap_key
>
> The mmap lock is held during the page walking but since this is a
> semaphore scheduling is still possible. Same for the kvm srcu.
> To minimize overhead do this on every segment table entry or large page.
>
> Signed-off-by: Christian Borntraeger <borntraeger@linux.ibm.com>
Reviewed-by: Claudio Imbrenda <imbrenda@linux.ibm.com>
> ---
> arch/s390/mm/gmap.c | 14 ++++++++++++++
> 1 file changed, 14 insertions(+)
>
> diff --git a/arch/s390/mm/gmap.c b/arch/s390/mm/gmap.c
> index 1ac73917a8d3..b8ae4a4aa2ba 100644
> --- a/arch/s390/mm/gmap.c
> +++ b/arch/s390/mm/gmap.c
> @@ -2608,6 +2608,18 @@ static int __s390_enable_skey_pte(pte_t *pte, unsigned long addr,
> return 0;
> }
>
> +/*
> + * Give a chance to schedule after setting a key to 256 pages.
> + * We only hold the mm lock, which is a rwsem and the kvm srcu.
> + * Both can sleep.
> + */
> +static int __s390_enable_skey_pmd(pmd_t *pmd, unsigned long addr,
> + unsigned long next, struct mm_walk *walk)
> +{
> + cond_resched();
> + return 0;
> +}
> +
> static int __s390_enable_skey_hugetlb(pte_t *pte, unsigned long addr,
> unsigned long hmask, unsigned long next,
> struct mm_walk *walk)
> @@ -2630,12 +2642,14 @@ static int __s390_enable_skey_hugetlb(pte_t *pte, unsigned long addr,
> end = start + HPAGE_SIZE - 1;
> __storage_key_init_range(start, end);
> set_bit(PG_arch_1, &page->flags);
> + cond_resched();
> return 0;
> }
>
> static const struct mm_walk_ops enable_skey_walk_ops = {
> .hugetlb_entry = __s390_enable_skey_hugetlb,
> .pte_entry = __s390_enable_skey_pte,
> + .pmd_entry = __s390_enable_skey_pmd,
> };
>
> int s390_enable_skey(void)
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH 2/2] s390/pgtable: use non-quiescing sske for KVM switch to keyed
2022-05-30 9:27 ` [PATCH 2/2] s390/pgtable: use non-quiescing sske for KVM switch to keyed Christian Borntraeger
@ 2022-05-30 9:56 ` Claudio Imbrenda
0 siblings, 0 replies; 7+ messages in thread
From: Claudio Imbrenda @ 2022-05-30 9:56 UTC (permalink / raw)
To: Christian Borntraeger
Cc: KVM, Janosch Frank, David Hildenbrand, linux-s390, Thomas Huth,
Heiko Carstens, Vasily Gorbik, Alexander Gordeev,
Janis Schoetterl-Glausch
On Mon, 30 May 2022 11:27:06 +0200
Christian Borntraeger <borntraeger@linux.ibm.com> wrote:
> The switch to a keyed guest does not require a classic sske as the other
> guest CPUs are not accessing the key before the switch is complete.
> By using the NQ SSKE things are faster especially with multiple guests.
>
> Signed-off-by: Christian Borntraeger <borntraeger@linux.ibm.com>
> Suggested-by: Janis Schoetterl-Glausch <scgl@linux.ibm.com>
Reviewed-by: Claudio Imbrenda <imbrenda@linux.ibm.com>
> ---
> arch/s390/mm/pgtable.c | 2 +-
> 1 file changed, 1 insertion(+), 1 deletion(-)
>
> diff --git a/arch/s390/mm/pgtable.c b/arch/s390/mm/pgtable.c
> index 697df02362af..4909dcd762e8 100644
> --- a/arch/s390/mm/pgtable.c
> +++ b/arch/s390/mm/pgtable.c
> @@ -748,7 +748,7 @@ void ptep_zap_key(struct mm_struct *mm, unsigned long addr, pte_t *ptep)
> pgste_val(pgste) |= PGSTE_GR_BIT | PGSTE_GC_BIT;
> ptev = pte_val(*ptep);
> if (!(ptev & _PAGE_INVALID) && (ptev & _PAGE_WRITE))
> - page_set_storage_key(ptev & PAGE_MASK, PAGE_DEFAULT_KEY, 1);
> + page_set_storage_key(ptev & PAGE_MASK, PAGE_DEFAULT_KEY, 0);
> pgste_set_unlock(ptep, pgste);
> preempt_enable();
> }
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH 1/2] s390/gmap: voluntarily schedule during key setting
2022-05-30 9:27 ` [PATCH 1/2] s390/gmap: voluntarily schedule during key setting Christian Borntraeger
2022-05-30 9:56 ` Claudio Imbrenda
@ 2022-05-31 7:23 ` Alexander Gordeev
1 sibling, 0 replies; 7+ messages in thread
From: Alexander Gordeev @ 2022-05-31 7:23 UTC (permalink / raw)
To: Christian Borntraeger
Cc: KVM, Janosch Frank, David Hildenbrand, linux-s390, Thomas Huth,
Claudio Imbrenda, Heiko Carstens, Vasily Gorbik,
Janis Schoetterl-Glausch
On Mon, May 30, 2022 at 11:27:05AM +0200, Christian Borntraeger wrote:
> With large and many guest with storage keys it is possible to create
> large latencies or stalls during initial key setting:
>
> rcu: INFO: rcu_sched self-detected stall on CPU
> rcu: 18-....: (2099 ticks this GP) idle=54e/1/0x4000000000000002 softirq=35598716/35598716 fqs=998
> (t=2100 jiffies g=155867385 q=20879)
> Task dump for CPU 18:
> CPU 1/KVM R running task 0 1030947 256019 0x06000004
> Call Trace:
> sched_show_task
> rcu_dump_cpu_stacks
> rcu_sched_clock_irq
> update_process_times
> tick_sched_handle
> tick_sched_timer
> __hrtimer_run_queues
> hrtimer_interrupt
> do_IRQ
> ext_int_handler
> ptep_zap_key
>
> The mmap lock is held during the page walking but since this is a
> semaphore scheduling is still possible. Same for the kvm srcu.
> To minimize overhead do this on every segment table entry or large page.
>
> Signed-off-by: Christian Borntraeger <borntraeger@linux.ibm.com>
Reviewed-by: Alexander Gordeev <agordeev@linux.ibm.com>
> ---
> arch/s390/mm/gmap.c | 14 ++++++++++++++
> 1 file changed, 14 insertions(+)
>
> diff --git a/arch/s390/mm/gmap.c b/arch/s390/mm/gmap.c
> index 1ac73917a8d3..b8ae4a4aa2ba 100644
> --- a/arch/s390/mm/gmap.c
> +++ b/arch/s390/mm/gmap.c
> @@ -2608,6 +2608,18 @@ static int __s390_enable_skey_pte(pte_t *pte, unsigned long addr,
> return 0;
> }
>
> +/*
> + * Give a chance to schedule after setting a key to 256 pages.
> + * We only hold the mm lock, which is a rwsem and the kvm srcu.
> + * Both can sleep.
> + */
> +static int __s390_enable_skey_pmd(pmd_t *pmd, unsigned long addr,
> + unsigned long next, struct mm_walk *walk)
> +{
> + cond_resched();
> + return 0;
> +}
> +
> static int __s390_enable_skey_hugetlb(pte_t *pte, unsigned long addr,
> unsigned long hmask, unsigned long next,
> struct mm_walk *walk)
> @@ -2630,12 +2642,14 @@ static int __s390_enable_skey_hugetlb(pte_t *pte, unsigned long addr,
> end = start + HPAGE_SIZE - 1;
> __storage_key_init_range(start, end);
> set_bit(PG_arch_1, &page->flags);
> + cond_resched();
> return 0;
> }
>
> static const struct mm_walk_ops enable_skey_walk_ops = {
> .hugetlb_entry = __s390_enable_skey_hugetlb,
> .pte_entry = __s390_enable_skey_pte,
> + .pmd_entry = __s390_enable_skey_pmd,
> };
>
> int s390_enable_skey(void)
> --
> 2.35.1
>
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH 0/2] s390/gmap/pgtable improve handling of keyed KVM guests
2022-05-30 9:27 [PATCH 0/2] s390/gmap/pgtable improve handling of keyed KVM guests Christian Borntraeger
2022-05-30 9:27 ` [PATCH 1/2] s390/gmap: voluntarily schedule during key setting Christian Borntraeger
2022-05-30 9:27 ` [PATCH 2/2] s390/pgtable: use non-quiescing sske for KVM switch to keyed Christian Borntraeger
@ 2022-05-31 8:28 ` Christian Borntraeger
2 siblings, 0 replies; 7+ messages in thread
From: Christian Borntraeger @ 2022-05-31 8:28 UTC (permalink / raw)
To: KVM
Cc: Janosch Frank, David Hildenbrand, linux-s390, Thomas Huth,
Claudio Imbrenda, Heiko Carstens, Vasily Gorbik,
Alexander Gordeev, Janis Schoetterl-Glausch
Am 30.05.22 um 11:27 schrieb Christian Borntraeger:
> These two patches try to address stalls/timeouts that we have seen when
> switching many guests to enable storage keys.
>
> Christian Borntraeger (2):
> s390/gmap: voluntarily schedule during key setting
> s390/pgtable: use non-quiescing sske for KVM switch to keyed
>
> arch/s390/mm/gmap.c | 14 ++++++++++++++
> arch/s390/mm/pgtable.c | 2 +-
> 2 files changed, 15 insertions(+), 1 deletion(-)
queued.
^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2022-05-31 8:28 UTC | newest]
Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2022-05-30 9:27 [PATCH 0/2] s390/gmap/pgtable improve handling of keyed KVM guests Christian Borntraeger
2022-05-30 9:27 ` [PATCH 1/2] s390/gmap: voluntarily schedule during key setting Christian Borntraeger
2022-05-30 9:56 ` Claudio Imbrenda
2022-05-31 7:23 ` Alexander Gordeev
2022-05-30 9:27 ` [PATCH 2/2] s390/pgtable: use non-quiescing sske for KVM switch to keyed Christian Borntraeger
2022-05-30 9:56 ` Claudio Imbrenda
2022-05-31 8:28 ` [PATCH 0/2] s390/gmap/pgtable improve handling of keyed KVM guests Christian Borntraeger
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox