From: Oleksii Kurochko <oleksii.kurochko@gmail.com>
To: Jan Beulich <jbeulich@suse.com>
Cc: "Alistair Francis" <alistair.francis@wdc.com>,
"Connor Davis" <connojdavis@gmail.com>,
"Andrew Cooper" <andrew.cooper3@citrix.com>,
"Anthony PERARD" <anthony.perard@vates.tech>,
"Michal Orzel" <michal.orzel@amd.com>,
"Julien Grall" <julien@xen.org>,
"Roger Pau Monné" <roger.pau@citrix.com>,
"Stefano Stabellini" <sstabellini@kernel.org>,
"Romain Caritey" <Romain.Caritey@microchip.com>,
xen-devel@lists.xenproject.org
Subject: Re: [PATCH v2 05/16] xen/riscv: introduce tracking of pending vCPU interrupts, part 2
Date: Mon, 2 Feb 2026 11:50:23 +0100 [thread overview]
Message-ID: <5fd2bbce-7d27-4f54-8eed-5bee0d2a6dd2@gmail.com> (raw)
In-Reply-To: <1bd0726d-20d8-4506-bb8e-849fd8b091a7@suse.com>
On 1/29/26 6:01 PM, Jan Beulich wrote:
> On 22.01.2026 17:47, Oleksii Kurochko wrote:
>> This patch is based on Linux kernel 6.16.0.
>>
>> Add the consumer side (vcpu_flush_interrupts()) of the lockless pending
>> interrupt tracking introduced in part 1 (for producers). According, to the
>> design only one consumer is possible, and it is vCPU itself.
>> vcpu_flush_interrupts() is expected to be ran (as guests aren't ran now due
>> to the lack of functionality) before the hypervisor returns control to the
>> guest.
>>
>> Producers may set bits in irqs_pending_mask without a lock. Clearing bits in
>> irqs_pending_mask is performed only by the consumer via xchg() (with aquire &
>> release semantics). The consumer must not write to irqs_pending and must not
>> act on bits that are not set in the mask. Otherwise, extra synchronization
>> should be provided.
>> The worst thing which could happen with such approach is that a new pending
>> bit will be set to irqs_pending bitmap during update of hvip variable in
>> vcpu_flush_interrupt() but it isn't problem as the new pending bit won't
>> be lost and just be proceded during the next flush.
>>
>> It is possible a guest could have pending bit not result in the hardware
>> register without to be marked pending in irq_pending bitmap as:
>> According to the RISC-V ISA specification:
>> Bits hip.VSSIP and hie.VSSIE are the interrupt-pending and
>> interrupt-enable bits for VS-level software interrupts. VSSIP in hip
>> is an alias (writable) of the same bit in hvip.
>> Additionally:
>> When bit 2 of hideleg is zero, vsip.SSIP and vsie.SSIE are read-only
>> zeros. Else, vsip.SSIP and vsie.SSIE are aliases of hip.VSSIP and
>> hie.VSSIE.
>> This means the guest may modify vsip.SSIP, which implicitly updates
>> hip.VSSIP and the bit being writable with 1 would also trigger an interrupt
>> as according to the RISC-V spec:
>> These conditions for an interrupt trap to occur must be evaluated in a
>> bounded amount of time from when an interrupt becomes, or ceases to be,
>> pending in sip, and must also be evaluated immediately following the
>> execution of an SRET instruction or an explicit write to a CSR on which
>> these interrupt trap conditions expressly depend (including sip, sie and
>> sstatus).
>> What means that IRQ_VS_SOFT must be synchronized separately, what is done
>> in vcpu_sync_interrupts().
> And this function is going to be used from where? Exit from guest into the
> hypervisor? Whereas vcpu_flush_interrupt() is to be called ahead of re-
> entering the guest?
Both of them are called before returning control to a guest (missed to mention
that in the commit message) in do_trap() at the end:
static void check_for_pcpu_work(void)
{
...
vcpu_flush_interrupts(current);
vcpu_sync_interrupts(current);
}
void do_trap(struct cpu_user_regs *cpu_regs)
{
...
if ( cpu_regs->hstatus & HSTATUS_SPV )
check_for_pcpu_work();
}
>
> I ask because vcpu_sync_interrupts() very much looks like a producer to me,
> yet the patch here supposedly is the consumer side.
Yes, vcpu_sync_interrupts() should be in producer side, I'll move it to the prev.
patch.
>
>> --- a/xen/arch/riscv/domain.c
>> +++ b/xen/arch/riscv/domain.c
>> @@ -171,3 +171,68 @@ int vcpu_unset_interrupt(struct vcpu *v, unsigned int irq)
>>
>> return 0;
>> }
>> +
>> +static void vcpu_update_hvip(struct vcpu *v)
> Pointer-to-const?
>
>> +{
>> + csr_write(CSR_HVIP, v->arch.hvip);
>> +}
>> +
>> +void vcpu_flush_interrupts(struct vcpu *v)
>> +{
>> + register_t *hvip = &v->arch.hvip;
>> +
>> + unsigned long mask, val;
> These are used ...
>
>> + if ( ACCESS_ONCE(v->arch.irqs_pending_mask[0]) )
>> + {
>> + mask = xchg(&v->arch.irqs_pending_mask[0], 0UL);
>> + val = ACCESS_ONCE(v->arch.irqs_pending[0]) & mask;
>> +
>> + *hvip &= ~mask;
>> + *hvip |= val;
> ... solely in this more narrow scope.
I'll declare them inside the if().
>
>> + }
>> +
>> + /*
>> + * Flush AIA high interrupts.
>> + *
>> + * It is necessary to do only for CONFIG_RISCV_32 which isn't supported
>> + * now.
>> + */
>> +#ifdef CONFIG_RISCV_32
>> +# error "Update hviph"
>> +#endif
>> +
>> + vcpu_update_hvip(v);
> Why would bits for which the mask bit wasn't set be written here?
This function inside uses only v->arch.hvip which is updated above according to
the mask.
>
>> +void vcpu_sync_interrupts(struct vcpu *v)
>> +{
>> + unsigned long hvip;
>> +
>> + /* Read current HVIP and VSIE CSRs */
>> + v->arch.vsie = csr_read(CSR_VSIE);
>> +
>> + /* Sync-up HVIP.VSSIP bit changes does by Guest */
> Nit: s/does/done/ ?
>
>> + hvip = csr_read(CSR_HVIP);
>> + if ( (v->arch.hvip ^ hvip) & BIT(IRQ_VS_SOFT, UL) )
>> + {
>> + if ( !test_and_set_bit(IRQ_VS_SOFT,
>> + &v->arch.irqs_pending_mask) )
> Why two separate, nested if()s?
Do you mean that it could be:
if ( !test_and_set_bit(IRQ_VS_SOFT, &v->arch.irqs_pending_mask) && (hvip & BIT(IRQ_VS_SOFT, UL))
?
>
>> + {
>> + if ( hvip & BIT(IRQ_VS_SOFT, UL) )
>> + set_bit(IRQ_VS_SOFT, &v->arch.irqs_pending);
>> + else
>> + clear_bit(IRQ_VS_SOFT, &v->arch.irqs_pending);
>> + }
> In the previous patch you set forth strict ordering rules, with a barrier in
> the middle. All of this is violated here.
It still respects the rule that the producer (|vcpu_sync_interrupts()| which
should be in the producer path) never clears the mask and only writes to
|irqs_pending| if it is the one that flipped the corresponding mask bit from 0
to 1.
Considering that the consumer cannot be called concurrently in this case
(since|vcpu_flush_interrupts()| and|vcpu_sync_interrupts()| are only invoked
sequentially in|check_for_pcpu_work()|, as mentioned above), nothing can
clear a bit in the mask in between. Therefore, I think it is acceptable to
slightly bend (and it should be explained in the comment above the
function or in the commit message) the rule that the|irqs_pending| bit must
be written first, followed by updating the corresponding bit in
|irqs_pending_mask() specifically for |vcpu_sync_interrupts().
>
>> + }
>> +
>> + /*
>> + * Sync-up AIA high interrupts.
>> + *
>> + * It is necessary to do only for CONFIG_RISCV_32 which isn't supported
>> + * now.
>> + */
>> +#ifdef CONFIG_RISCV_32
>> +# error "Update vsieh"
>> +#endif
> Here you mean the register or the struct vcpu field? It may be helpful to
> disambiguate; assuming it's the latter, simply spell out v->arch.vsieh?
> (Same then for the similar code in vcpu_flush_interrupts().)
Agree, it would be better.
Thanks.
~ Oleksii
next prev parent reply other threads:[~2026-02-02 10:50 UTC|newest]
Thread overview: 65+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-01-22 16:47 [PATCH v2 00/16] xen/riscv: introduce vtimer related things Oleksii Kurochko
2026-01-22 16:47 ` [PATCH v2 01/16] xen/riscv: introduce struct arch_vcpu Oleksii Kurochko
2026-01-26 11:41 ` Jan Beulich
2026-01-26 12:30 ` Oleksii Kurochko
2026-01-26 12:53 ` Jan Beulich
2026-01-26 14:22 ` Oleksii Kurochko
2026-01-22 16:47 ` [PATCH v2 02/16] xen/riscv: implement arch_vcpu_{create,destroy}() Oleksii Kurochko
2026-01-23 11:30 ` Teddy Astie
2026-01-26 9:00 ` Oleksii Kurochko
2026-01-26 11:47 ` Jan Beulich
2026-01-26 12:07 ` Oleksii Kurochko
2026-01-22 16:47 ` [PATCH v2 03/16] xen/riscv: implement vcpu_csr_init() Oleksii Kurochko
2026-01-24 22:44 ` Teddy Astie
2026-01-26 8:36 ` Jan Beulich
2026-01-26 9:47 ` Oleksii Kurochko
2026-01-26 9:54 ` Jan Beulich
2026-01-26 9:39 ` Oleksii Kurochko
2026-01-22 16:47 ` [PATCH v2 04/16] xen/riscv: introduce tracking of pending vCPU interrupts, part 1 Oleksii Kurochko
2026-01-29 16:44 ` Jan Beulich
2026-02-02 10:16 ` Oleksii Kurochko
2026-01-22 16:47 ` [PATCH v2 05/16] xen/riscv: introduce tracking of pending vCPU interrupts, part 2 Oleksii Kurochko
2026-01-29 17:01 ` Jan Beulich
2026-02-02 10:50 ` Oleksii Kurochko [this message]
2026-02-02 14:22 ` Jan Beulich
2026-01-22 16:47 ` [PATCH v2 06/16] xen/time: move ticks<->ns helpers to common code Oleksii Kurochko
2026-01-29 8:48 ` Jan Beulich
2026-02-04 8:13 ` Jan Beulich
2026-02-04 9:00 ` Oleksii Kurochko
2026-01-22 16:47 ` [PATCH v2 07/16] xen/riscv: introduce basic vtimer infrastructure for guests Oleksii Kurochko
2026-02-03 15:47 ` Jan Beulich
2026-02-03 16:55 ` Oleksii Kurochko
2026-02-03 17:04 ` Jan Beulich
2026-01-22 16:47 ` [PATCH v2 08/16] xen/riscv: add temporary stub for smp_send_event_check_mask() Oleksii Kurochko
2026-01-29 16:32 ` Jan Beulich
2026-01-29 16:46 ` Oleksii Kurochko
2026-01-22 16:47 ` [PATCH v2 09/16] xen/riscv: introduce vcpu_kick() implementation Oleksii Kurochko
2026-02-06 16:36 ` Oleksii Kurochko
2026-02-09 9:07 ` Jan Beulich
2026-02-09 9:40 ` Oleksii Kurochko
2026-02-09 9:51 ` Jan Beulich
2026-02-09 12:35 ` Oleksii Kurochko
2026-02-09 12:54 ` Jan Beulich
2026-01-22 16:47 ` [PATCH v2 10/16] xen/riscv: add vtimer context switch helpers Oleksii Kurochko
2026-02-03 16:43 ` Jan Beulich
2026-02-03 16:56 ` Oleksii Kurochko
2026-01-22 16:47 ` [PATCH v2 11/16] xen/riscv: implement SBI legacy SET_TIMER support for guests Oleksii Kurochko
2026-01-22 16:47 ` [PATCH v2 12/16] xen/riscv: introduce sbi_set_timer() Oleksii Kurochko
2026-02-03 17:02 ` Jan Beulich
2026-02-04 9:29 ` Oleksii Kurochko
2026-02-04 10:11 ` Jan Beulich
2026-02-04 6:50 ` Jan Beulich
2026-02-04 9:36 ` Oleksii Kurochko
2026-01-22 16:47 ` [PATCH v2 13/16] xen/riscv: implement reprogram_timer() via SBI Oleksii Kurochko
2026-01-24 23:13 ` Teddy Astie
2026-01-26 10:20 ` Oleksii Kurochko
2026-02-04 10:39 ` Jan Beulich
2026-02-05 16:25 ` Oleksii Kurochko
2026-01-22 16:47 ` [PATCH v2 14/16] xen/riscv: handle hypervisor timer interrupts Oleksii Kurochko
2026-02-04 10:42 ` Jan Beulich
2026-01-22 16:47 ` [PATCH v2 15/16] xen/riscv: init tasklet subsystem Oleksii Kurochko
2026-01-29 16:33 ` Jan Beulich
2026-01-22 16:47 ` [PATCH v2 16/16] xen/riscv: implement sync_vcpu_execstate() Oleksii Kurochko
2026-02-04 10:49 ` Jan Beulich
2026-02-05 16:51 ` Oleksii Kurochko
2026-02-05 16:53 ` Jan Beulich
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=5fd2bbce-7d27-4f54-8eed-5bee0d2a6dd2@gmail.com \
--to=oleksii.kurochko@gmail.com \
--cc=Romain.Caritey@microchip.com \
--cc=alistair.francis@wdc.com \
--cc=andrew.cooper3@citrix.com \
--cc=anthony.perard@vates.tech \
--cc=connojdavis@gmail.com \
--cc=jbeulich@suse.com \
--cc=julien@xen.org \
--cc=michal.orzel@amd.com \
--cc=roger.pau@citrix.com \
--cc=sstabellini@kernel.org \
--cc=xen-devel@lists.xenproject.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.