From: Christoffer Dall <christoffer.dall@linaro.org>
To: Marc Zyngier <marc.zyngier@arm.com>
Cc: Andre Przywara <andre.przywara@arm.com>,
kvm@vger.kernel.org, linux-arm-kernel@lists.infradead.org,
kvmarm@lists.cs.columbia.edu
Subject: Re: [PATCH] KVM: arm/arm64: vgic: Prevent VGIC_ADDR_TO_INTID from emiting divisions
Date: Tue, 1 Nov 2016 16:28:49 +0100 [thread overview]
Message-ID: <20161101152849.GC13677@cbox> (raw)
In-Reply-To: <20161029111901.16668-1-marc.zyngier@arm.com>
On Sat, Oct 29, 2016 at 12:19:01PM +0100, Marc Zyngier wrote:
> Using non-constant number of bits for VGIC_ADDR_TO_INTID() leads
> to gcc 6.1 emiting calls to __aeabi_uldivmod, which the kernel
> does not implement.
>
> As we really don't want to implement complex division in the kernel,
> the only other option is to prove to the compiler that there is only
> a few values that are possible for the number of bits per IRQ, and
> that they are all power of 2.
>
> We turn the VGIC_ADDR_TO_INTID macro into a switch that looks for
> the supported set of values (1, 2, 8, 64), and perform the computation
> accordingly. When "bits" is a constant, the compiler optimizes
> away the other cases. If not, we end-up with a small number of cases
> that GCC optimises reasonably well. Out of range values are detected
> both at build time (constants) and at run time (variables).
>
> Signed-off-by: Marc Zyngier <marc.zyngier@arm.com>
> ---
> This should be applied *before* Andre's patch fixing out of bound SPIs.
>
> virt/kvm/arm/vgic/vgic-mmio.h | 33 ++++++++++++++++++++++++++++++++-
> 1 file changed, 32 insertions(+), 1 deletion(-)
>
> diff --git a/virt/kvm/arm/vgic/vgic-mmio.h b/virt/kvm/arm/vgic/vgic-mmio.h
> index 4c34d39..a457282 100644
> --- a/virt/kvm/arm/vgic/vgic-mmio.h
> +++ b/virt/kvm/arm/vgic/vgic-mmio.h
> @@ -57,10 +57,41 @@ extern struct kvm_io_device_ops kvm_io_gic_ops;
> * multiplication with the inverted fraction, and scale up both the
> * numerator and denominator with 8 to support at most 64 bits per IRQ:
> */
> -#define VGIC_ADDR_TO_INTID(addr, bits) (((addr) & VGIC_ADDR_IRQ_MASK(bits)) * \
> +#define __VGIC_ADDR_INTID(addr, bits) (((addr) & VGIC_ADDR_IRQ_MASK(bits)) * \
> 64 / (bits) / 8)
>
> /*
> + * Perform the same computation, but also handle non-constant number
> + * of bits. We only care about the few cases that are required by
> + * GICv2/v3.
> + */
> +#define VGIC_ADDR_TO_INTID(addr, bits) \
> + ({ \
> + u32 __v; \
> + switch((bits)) { \
> + case 1: \
> + __v = __VGIC_ADDR_INTID((addr), 1); \
> + break; \
> + case 2: \
> + __v = __VGIC_ADDR_INTID((addr), 2); \
> + break; \
> + case 8: \
> + __v = __VGIC_ADDR_INTID((addr), 8); \
> + break; \
> + case 64: \
> + __v = __VGIC_ADDR_INTID((addr), 64); \
> + break; \
> + default: \
> + if (__builtin_constant_p((bits))) \
> + BUILD_BUG(); \
> + else \
> + BUG(); \
> + } \
> + \
> + __v; \
> + })
> +
> +/*
> * Some VGIC registers store per-IRQ information, with a different number
> * of bits per IRQ. For those registers this macro is used.
> * The _WITH_LENGTH version instantiates registers with a fixed length
> --
> 2.9.3
>
Looks functionally correct, just wondering if it's cleaner to turn the
whole thing into a static inline, or if it can be rewritten to use
shifts with any benefit.
In any case, if you like this version:
Acked-by: Christoffer Dall <christoffer.dall@linaro.org>
WARNING: multiple messages have this Message-ID (diff)
From: christoffer.dall@linaro.org (Christoffer Dall)
To: linux-arm-kernel@lists.infradead.org
Subject: [PATCH] KVM: arm/arm64: vgic: Prevent VGIC_ADDR_TO_INTID from emiting divisions
Date: Tue, 1 Nov 2016 16:28:49 +0100 [thread overview]
Message-ID: <20161101152849.GC13677@cbox> (raw)
In-Reply-To: <20161029111901.16668-1-marc.zyngier@arm.com>
On Sat, Oct 29, 2016 at 12:19:01PM +0100, Marc Zyngier wrote:
> Using non-constant number of bits for VGIC_ADDR_TO_INTID() leads
> to gcc 6.1 emiting calls to __aeabi_uldivmod, which the kernel
> does not implement.
>
> As we really don't want to implement complex division in the kernel,
> the only other option is to prove to the compiler that there is only
> a few values that are possible for the number of bits per IRQ, and
> that they are all power of 2.
>
> We turn the VGIC_ADDR_TO_INTID macro into a switch that looks for
> the supported set of values (1, 2, 8, 64), and perform the computation
> accordingly. When "bits" is a constant, the compiler optimizes
> away the other cases. If not, we end-up with a small number of cases
> that GCC optimises reasonably well. Out of range values are detected
> both at build time (constants) and at run time (variables).
>
> Signed-off-by: Marc Zyngier <marc.zyngier@arm.com>
> ---
> This should be applied *before* Andre's patch fixing out of bound SPIs.
>
> virt/kvm/arm/vgic/vgic-mmio.h | 33 ++++++++++++++++++++++++++++++++-
> 1 file changed, 32 insertions(+), 1 deletion(-)
>
> diff --git a/virt/kvm/arm/vgic/vgic-mmio.h b/virt/kvm/arm/vgic/vgic-mmio.h
> index 4c34d39..a457282 100644
> --- a/virt/kvm/arm/vgic/vgic-mmio.h
> +++ b/virt/kvm/arm/vgic/vgic-mmio.h
> @@ -57,10 +57,41 @@ extern struct kvm_io_device_ops kvm_io_gic_ops;
> * multiplication with the inverted fraction, and scale up both the
> * numerator and denominator with 8 to support at most 64 bits per IRQ:
> */
> -#define VGIC_ADDR_TO_INTID(addr, bits) (((addr) & VGIC_ADDR_IRQ_MASK(bits)) * \
> +#define __VGIC_ADDR_INTID(addr, bits) (((addr) & VGIC_ADDR_IRQ_MASK(bits)) * \
> 64 / (bits) / 8)
>
> /*
> + * Perform the same computation, but also handle non-constant number
> + * of bits. We only care about the few cases that are required by
> + * GICv2/v3.
> + */
> +#define VGIC_ADDR_TO_INTID(addr, bits) \
> + ({ \
> + u32 __v; \
> + switch((bits)) { \
> + case 1: \
> + __v = __VGIC_ADDR_INTID((addr), 1); \
> + break; \
> + case 2: \
> + __v = __VGIC_ADDR_INTID((addr), 2); \
> + break; \
> + case 8: \
> + __v = __VGIC_ADDR_INTID((addr), 8); \
> + break; \
> + case 64: \
> + __v = __VGIC_ADDR_INTID((addr), 64); \
> + break; \
> + default: \
> + if (__builtin_constant_p((bits))) \
> + BUILD_BUG(); \
> + else \
> + BUG(); \
> + } \
> + \
> + __v; \
> + })
> +
> +/*
> * Some VGIC registers store per-IRQ information, with a different number
> * of bits per IRQ. For those registers this macro is used.
> * The _WITH_LENGTH version instantiates registers with a fixed length
> --
> 2.9.3
>
Looks functionally correct, just wondering if it's cleaner to turn the
whole thing into a static inline, or if it can be rewritten to use
shifts with any benefit.
In any case, if you like this version:
Acked-by: Christoffer Dall <christoffer.dall@linaro.org>
next prev parent reply other threads:[~2016-11-01 15:28 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-10-29 11:19 [PATCH] KVM: arm/arm64: vgic: Prevent VGIC_ADDR_TO_INTID from emiting divisions Marc Zyngier
2016-10-29 11:19 ` Marc Zyngier
2016-11-01 15:28 ` Christoffer Dall [this message]
2016-11-01 15:28 ` Christoffer Dall
2016-11-01 16:50 ` Andre Przywara
2016-11-01 16:50 ` Andre Przywara
2016-11-01 19:31 ` Christoffer Dall
2016-11-01 19:31 ` Christoffer Dall
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20161101152849.GC13677@cbox \
--to=christoffer.dall@linaro.org \
--cc=andre.przywara@arm.com \
--cc=kvm@vger.kernel.org \
--cc=kvmarm@lists.cs.columbia.edu \
--cc=linux-arm-kernel@lists.infradead.org \
--cc=marc.zyngier@arm.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.