* [PATCH] x86/xen/mmu: Increase MAX_CONTIG_ORDER @ 2024-12-04 17:14 Thierry Escande 2024-12-09 10:04 ` Jan Beulich 0 siblings, 1 reply; 10+ messages in thread From: Thierry Escande @ 2024-12-04 17:14 UTC (permalink / raw) To: Juergen Gross, linux-kernel Cc: sstabellini, oleksandr_tyshchenko, jbeulich, xen-devel With change 9f40ec84a797 (xen/swiotlb: add alignment check for dma buffers), the driver mpt3sas fails to load because it cannot allocate its DMA pool for an allocation size of ~2,3 MBytes. This is because the alignement check added by 9f40ec84a797 fails and xen_swiotlb_alloc_coherent() ends up calling xen_create_contiguous_region() with a size order of 10 which is too high for the current max value. This patch increases the MAX_CONTIG_ORDER from 9 to 10 (4MB) to allow such allocations. Signed-off-by: Thierry Escande <thierry.escande@vates.tech> --- arch/x86/xen/mmu_pv.c | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/arch/x86/xen/mmu_pv.c b/arch/x86/xen/mmu_pv.c index 55a4996d0c04..7f110740e1a2 100644 --- a/arch/x86/xen/mmu_pv.c +++ b/arch/x86/xen/mmu_pv.c @@ -2200,7 +2200,7 @@ void __init xen_init_mmu_ops(void) } /* Protected by xen_reservation_lock. */ -#define MAX_CONTIG_ORDER 9 /* 2MB */ +#define MAX_CONTIG_ORDER 10 /* 4MB */ static unsigned long discontig_frames[1<<MAX_CONTIG_ORDER]; #define VOID_PTE (mfn_pte(0, __pgprot(0))) -- 2.45.2 ^ permalink raw reply related [flat|nested] 10+ messages in thread
* Re: [PATCH] x86/xen/mmu: Increase MAX_CONTIG_ORDER 2024-12-04 17:14 [PATCH] x86/xen/mmu: Increase MAX_CONTIG_ORDER Thierry Escande @ 2024-12-09 10:04 ` Jan Beulich 2024-12-11 18:20 ` Thierry Escande 0 siblings, 1 reply; 10+ messages in thread From: Jan Beulich @ 2024-12-09 10:04 UTC (permalink / raw) To: Thierry Escande Cc: sstabellini, oleksandr_tyshchenko, xen-devel, Juergen Gross, linux-kernel On 04.12.2024 18:14, Thierry Escande wrote: > With change 9f40ec84a797 (xen/swiotlb: add alignment check for dma > buffers), the driver mpt3sas fails to load because it cannot allocate > its DMA pool for an allocation size of ~2,3 MBytes. This is because the > alignement check added by 9f40ec84a797 fails and > xen_swiotlb_alloc_coherent() ends up calling > xen_create_contiguous_region() with a size order of 10 which is too high > for the current max value. > > This patch increases the MAX_CONTIG_ORDER from 9 to 10 (4MB) to allow > such allocations. > > Signed-off-by: Thierry Escande <thierry.escande@vates.tech> > --- > arch/x86/xen/mmu_pv.c | 2 +- > 1 file changed, 1 insertion(+), 1 deletion(-) > > diff --git a/arch/x86/xen/mmu_pv.c b/arch/x86/xen/mmu_pv.c > index 55a4996d0c04..7f110740e1a2 100644 > --- a/arch/x86/xen/mmu_pv.c > +++ b/arch/x86/xen/mmu_pv.c > @@ -2200,7 +2200,7 @@ void __init xen_init_mmu_ops(void) > } > > /* Protected by xen_reservation_lock. */ > -#define MAX_CONTIG_ORDER 9 /* 2MB */ > +#define MAX_CONTIG_ORDER 10 /* 4MB */ > static unsigned long discontig_frames[1<<MAX_CONTIG_ORDER]; While lacking respective commentary, bumping this value imo also needs to take into account Xen itself, at least commit-message-wise. The bumping is fine for Dom0 in any event. It is also fine for DomU-s with the defaults built into the hypervisor (orders 12 and 10 respectively for x86 and Arm), yet especially for Arm (and in the future PPC and RISC-V) any further bumping would be less straightforward. However - does the driver really need this big a contiguous chunk? It would seem far more desirable to me to break that up some, if possible. Jan ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [PATCH] x86/xen/mmu: Increase MAX_CONTIG_ORDER 2024-12-09 10:04 ` Jan Beulich @ 2024-12-11 18:20 ` Thierry Escande 0 siblings, 0 replies; 10+ messages in thread From: Thierry Escande @ 2024-12-11 18:20 UTC (permalink / raw) To: Jan Beulich Cc: sstabellini, oleksandr_tyshchenko, xen-devel, Juergen Gross, linux-kernel Hi Jan, On 09/12/2024 11:04, Jan Beulich wrote: > On 04.12.2024 18:14, Thierry Escande wrote: >> With change 9f40ec84a797 (xen/swiotlb: add alignment check for dma >> buffers), the driver mpt3sas fails to load because it cannot allocate >> its DMA pool for an allocation size of ~2,3 MBytes. This is because the >> alignement check added by 9f40ec84a797 fails and >> xen_swiotlb_alloc_coherent() ends up calling >> xen_create_contiguous_region() with a size order of 10 which is too high >> for the current max value. >> >> This patch increases the MAX_CONTIG_ORDER from 9 to 10 (4MB) to allow >> such allocations. >> >> Signed-off-by: Thierry Escande <thierry.escande@vates.tech> >> --- >> arch/x86/xen/mmu_pv.c | 2 +- >> 1 file changed, 1 insertion(+), 1 deletion(-) >> >> diff --git a/arch/x86/xen/mmu_pv.c b/arch/x86/xen/mmu_pv.c >> index 55a4996d0c04..7f110740e1a2 100644 >> --- a/arch/x86/xen/mmu_pv.c >> +++ b/arch/x86/xen/mmu_pv.c >> @@ -2200,7 +2200,7 @@ void __init xen_init_mmu_ops(void) >> } >> >> /* Protected by xen_reservation_lock. */ >> -#define MAX_CONTIG_ORDER 9 /* 2MB */ >> +#define MAX_CONTIG_ORDER 10 /* 4MB */ >> static unsigned long discontig_frames[1<<MAX_CONTIG_ORDER]; > > While lacking respective commentary, bumping this value imo also needs to > take into account Xen itself, at least commit-message-wise. The bumping is > fine for Dom0 in any event. It is also fine for DomU-s with the defaults > built into the hypervisor (orders 12 and 10 respectively for x86 and Arm), > yet especially for Arm (and in the future PPC and RISC-V) any further > bumping would be less straightforward. Thanks for pointing this out. On the Xen side, CONFIG_CTLDOM_MAX_ORDER and CONFIG_HWDOM_MAX_ORDER seem big enough on all architectures. But I see CONFIG_DOMU_MAX_ORDER set to 9 (also all archs). Won't that be a problem for drivers trying to allocate more than that from a domU ? > However - does the driver really need this big a contiguous chunk? It > would seem far more desirable to me to break that up some, if possible. Since this works on bare metal I'm afraid the driver maintainer (mpt fusion driver) will just tell me to fix Xen. Regards, Thierry > > Jan ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [PATCH] x86/xen/mmu: Increase MAX_CONTIG_ORDER @ 2024-12-11 18:20 ` Thierry Escande 0 siblings, 0 replies; 10+ messages in thread From: Thierry Escande @ 2024-12-11 18:20 UTC (permalink / raw) To: Jan Beulich Cc: sstabellini, oleksandr_tyshchenko, xen-devel, Juergen Gross, linux-kernel Hi Jan, On 09/12/2024 11:04, Jan Beulich wrote: > On 04.12.2024 18:14, Thierry Escande wrote: >> With change 9f40ec84a797 (xen/swiotlb: add alignment check for dma >> buffers), the driver mpt3sas fails to load because it cannot allocate >> its DMA pool for an allocation size of ~2,3 MBytes. This is because the >> alignement check added by 9f40ec84a797 fails and >> xen_swiotlb_alloc_coherent() ends up calling >> xen_create_contiguous_region() with a size order of 10 which is too high >> for the current max value. >> >> This patch increases the MAX_CONTIG_ORDER from 9 to 10 (4MB) to allow >> such allocations. >> >> Signed-off-by: Thierry Escande <thierry.escande@vates.tech> >> --- >> arch/x86/xen/mmu_pv.c | 2 +- >> 1 file changed, 1 insertion(+), 1 deletion(-) >> >> diff --git a/arch/x86/xen/mmu_pv.c b/arch/x86/xen/mmu_pv.c >> index 55a4996d0c04..7f110740e1a2 100644 >> --- a/arch/x86/xen/mmu_pv.c >> +++ b/arch/x86/xen/mmu_pv.c >> @@ -2200,7 +2200,7 @@ void __init xen_init_mmu_ops(void) >> } >> >> /* Protected by xen_reservation_lock. */ >> -#define MAX_CONTIG_ORDER 9 /* 2MB */ >> +#define MAX_CONTIG_ORDER 10 /* 4MB */ >> static unsigned long discontig_frames[1<<MAX_CONTIG_ORDER]; > > While lacking respective commentary, bumping this value imo also needs to > take into account Xen itself, at least commit-message-wise. The bumping is > fine for Dom0 in any event. It is also fine for DomU-s with the defaults > built into the hypervisor (orders 12 and 10 respectively for x86 and Arm), > yet especially for Arm (and in the future PPC and RISC-V) any further > bumping would be less straightforward. Thanks for pointing this out. On the Xen side, CONFIG_CTLDOM_MAX_ORDER and CONFIG_HWDOM_MAX_ORDER seem big enough on all architectures. But I see CONFIG_DOMU_MAX_ORDER set to 9 (also all archs). Won't that be a problem for drivers trying to allocate more than that from a domU ? > However - does the driver really need this big a contiguous chunk? It > would seem far more desirable to me to break that up some, if possible. Since this works on bare metal I'm afraid the driver maintainer (mpt fusion driver) will just tell me to fix Xen. Regards, Thierry > > Jan ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [PATCH] x86/xen/mmu: Increase MAX_CONTIG_ORDER 2024-12-11 18:20 ` Thierry Escande (?) @ 2024-12-12 10:22 ` Jan Beulich 2024-12-12 11:09 ` Juergen Gross -1 siblings, 1 reply; 10+ messages in thread From: Jan Beulich @ 2024-12-12 10:22 UTC (permalink / raw) To: Thierry Escande Cc: sstabellini, oleksandr_tyshchenko, xen-devel, Juergen Gross, linux-kernel On 11.12.2024 19:20, Thierry Escande wrote: > Hi Jan, > > On 09/12/2024 11:04, Jan Beulich wrote: >> On 04.12.2024 18:14, Thierry Escande wrote: >>> With change 9f40ec84a797 (xen/swiotlb: add alignment check for dma >>> buffers), the driver mpt3sas fails to load because it cannot allocate >>> its DMA pool for an allocation size of ~2,3 MBytes. This is because the >>> alignement check added by 9f40ec84a797 fails and >>> xen_swiotlb_alloc_coherent() ends up calling >>> xen_create_contiguous_region() with a size order of 10 which is too high >>> for the current max value. >>> >>> This patch increases the MAX_CONTIG_ORDER from 9 to 10 (4MB) to allow >>> such allocations. >>> >>> Signed-off-by: Thierry Escande <thierry.escande@vates.tech> >>> --- >>> arch/x86/xen/mmu_pv.c | 2 +- >>> 1 file changed, 1 insertion(+), 1 deletion(-) >>> >>> diff --git a/arch/x86/xen/mmu_pv.c b/arch/x86/xen/mmu_pv.c >>> index 55a4996d0c04..7f110740e1a2 100644 >>> --- a/arch/x86/xen/mmu_pv.c >>> +++ b/arch/x86/xen/mmu_pv.c >>> @@ -2200,7 +2200,7 @@ void __init xen_init_mmu_ops(void) >>> } >>> >>> /* Protected by xen_reservation_lock. */ >>> -#define MAX_CONTIG_ORDER 9 /* 2MB */ >>> +#define MAX_CONTIG_ORDER 10 /* 4MB */ >>> static unsigned long discontig_frames[1<<MAX_CONTIG_ORDER]; >> >> While lacking respective commentary, bumping this value imo also needs to >> take into account Xen itself, at least commit-message-wise. The bumping is >> fine for Dom0 in any event. It is also fine for DomU-s with the defaults >> built into the hypervisor (orders 12 and 10 respectively for x86 and Arm), >> yet especially for Arm (and in the future PPC and RISC-V) any further >> bumping would be less straightforward. > > Thanks for pointing this out. On the Xen side, CONFIG_CTLDOM_MAX_ORDER > and CONFIG_HWDOM_MAX_ORDER seem big enough on all architectures. But I > see CONFIG_DOMU_MAX_ORDER set to 9 (also all archs). Won't that be a > problem for drivers trying to allocate more than that from a domU ? A driver assumes a (physical) device to be in the DomU, at which point it is CONFIG_PTDOM_MAX_ORDER which applies (PT standing for pass-through). >> However - does the driver really need this big a contiguous chunk? It >> would seem far more desirable to me to break that up some, if possible. > > Since this works on bare metal I'm afraid the driver maintainer (mpt > fusion driver) will just tell me to fix Xen. Well. The bigger such allocations, the larger the risk that on systems that have been up for a while such allocations can't be fulfilled anymore even in the bare metal case. Jan ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [PATCH] x86/xen/mmu: Increase MAX_CONTIG_ORDER 2024-12-12 10:22 ` Jan Beulich @ 2024-12-12 11:09 ` Juergen Gross 2024-12-18 11:11 ` Thierry Escande 0 siblings, 1 reply; 10+ messages in thread From: Juergen Gross @ 2024-12-12 11:09 UTC (permalink / raw) To: Jan Beulich, Thierry Escande Cc: sstabellini, oleksandr_tyshchenko, xen-devel, linux-kernel [-- Attachment #1.1.1: Type: text/plain, Size: 2957 bytes --] On 12.12.24 11:22, Jan Beulich wrote: > On 11.12.2024 19:20, Thierry Escande wrote: >> Hi Jan, >> >> On 09/12/2024 11:04, Jan Beulich wrote: >>> On 04.12.2024 18:14, Thierry Escande wrote: >>>> With change 9f40ec84a797 (xen/swiotlb: add alignment check for dma >>>> buffers), the driver mpt3sas fails to load because it cannot allocate >>>> its DMA pool for an allocation size of ~2,3 MBytes. This is because the >>>> alignement check added by 9f40ec84a797 fails and >>>> xen_swiotlb_alloc_coherent() ends up calling >>>> xen_create_contiguous_region() with a size order of 10 which is too high >>>> for the current max value. >>>> >>>> This patch increases the MAX_CONTIG_ORDER from 9 to 10 (4MB) to allow >>>> such allocations. >>>> >>>> Signed-off-by: Thierry Escande <thierry.escande@vates.tech> >>>> --- >>>> arch/x86/xen/mmu_pv.c | 2 +- >>>> 1 file changed, 1 insertion(+), 1 deletion(-) >>>> >>>> diff --git a/arch/x86/xen/mmu_pv.c b/arch/x86/xen/mmu_pv.c >>>> index 55a4996d0c04..7f110740e1a2 100644 >>>> --- a/arch/x86/xen/mmu_pv.c >>>> +++ b/arch/x86/xen/mmu_pv.c >>>> @@ -2200,7 +2200,7 @@ void __init xen_init_mmu_ops(void) >>>> } >>>> >>>> /* Protected by xen_reservation_lock. */ >>>> -#define MAX_CONTIG_ORDER 9 /* 2MB */ >>>> +#define MAX_CONTIG_ORDER 10 /* 4MB */ >>>> static unsigned long discontig_frames[1<<MAX_CONTIG_ORDER]; >>> >>> While lacking respective commentary, bumping this value imo also needs to >>> take into account Xen itself, at least commit-message-wise. The bumping is >>> fine for Dom0 in any event. It is also fine for DomU-s with the defaults >>> built into the hypervisor (orders 12 and 10 respectively for x86 and Arm), >>> yet especially for Arm (and in the future PPC and RISC-V) any further >>> bumping would be less straightforward. >> >> Thanks for pointing this out. On the Xen side, CONFIG_CTLDOM_MAX_ORDER >> and CONFIG_HWDOM_MAX_ORDER seem big enough on all architectures. But I >> see CONFIG_DOMU_MAX_ORDER set to 9 (also all archs). Won't that be a >> problem for drivers trying to allocate more than that from a domU ? > > A driver assumes a (physical) device to be in the DomU, at which point it > is CONFIG_PTDOM_MAX_ORDER which applies (PT standing for pass-through). > >>> However - does the driver really need this big a contiguous chunk? It >>> would seem far more desirable to me to break that up some, if possible. >> >> Since this works on bare metal I'm afraid the driver maintainer (mpt >> fusion driver) will just tell me to fix Xen. > > Well. The bigger such allocations, the larger the risk that on systems > that have been up for a while such allocations can't be fulfilled anymore > even in the bare metal case. Yes. I don't think we should just work around this issue without having even tried to get the driver fixed. In case they refuse to change it, we can still increase MAX_CONTIG_ORDER. Juergen [-- Attachment #1.1.2: OpenPGP public key --] [-- Type: application/pgp-keys, Size: 3743 bytes --] [-- Attachment #2: OpenPGP digital signature --] [-- Type: application/pgp-signature, Size: 495 bytes --] ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [PATCH] x86/xen/mmu: Increase MAX_CONTIG_ORDER 2024-12-12 11:09 ` Juergen Gross @ 2024-12-18 11:11 ` Thierry Escande 2024-12-18 11:24 ` Jürgen Groß 0 siblings, 1 reply; 10+ messages in thread From: Thierry Escande @ 2024-12-18 11:11 UTC (permalink / raw) To: Juergen Gross, Jan Beulich Cc: sstabellini, oleksandr_tyshchenko, xen-devel, linux-kernel On 12/12/2024 12:09, Juergen Gross wrote: > On 12.12.24 11:22, Jan Beulich wrote: >> On 11.12.2024 19:20, Thierry Escande wrote: >>> Hi Jan, >>> >>> On 09/12/2024 11:04, Jan Beulich wrote: >>>> On 04.12.2024 18:14, Thierry Escande wrote: >>>>> With change 9f40ec84a797 (xen/swiotlb: add alignment check for dma >>>>> buffers), the driver mpt3sas fails to load because it cannot allocate >>>>> its DMA pool for an allocation size of ~2,3 MBytes. This is because >>>>> the >>>>> alignement check added by 9f40ec84a797 fails and >>>>> xen_swiotlb_alloc_coherent() ends up calling >>>>> xen_create_contiguous_region() with a size order of 10 which is too >>>>> high >>>>> for the current max value. >>>>> >>>>> This patch increases the MAX_CONTIG_ORDER from 9 to 10 (4MB) to allow >>>>> such allocations. >>>>> >>>>> Signed-off-by: Thierry Escande <thierry.escande@vates.tech> >>>>> --- >>>>> arch/x86/xen/mmu_pv.c | 2 +- >>>>> 1 file changed, 1 insertion(+), 1 deletion(-) >>>>> >>>>> diff --git a/arch/x86/xen/mmu_pv.c b/arch/x86/xen/mmu_pv.c >>>>> index 55a4996d0c04..7f110740e1a2 100644 >>>>> --- a/arch/x86/xen/mmu_pv.c >>>>> +++ b/arch/x86/xen/mmu_pv.c >>>>> @@ -2200,7 +2200,7 @@ void __init xen_init_mmu_ops(void) >>>>> } >>>>> /* Protected by xen_reservation_lock. */ >>>>> -#define MAX_CONTIG_ORDER 9 /* 2MB */ >>>>> +#define MAX_CONTIG_ORDER 10 /* 4MB */ >>>>> static unsigned long discontig_frames[1<<MAX_CONTIG_ORDER]; >>>> >>>> While lacking respective commentary, bumping this value imo also >>>> needs to >>>> take into account Xen itself, at least commit-message-wise. The >>>> bumping is >>>> fine for Dom0 in any event. It is also fine for DomU-s with the >>>> defaults >>>> built into the hypervisor (orders 12 and 10 respectively for x86 and >>>> Arm), >>>> yet especially for Arm (and in the future PPC and RISC-V) any further >>>> bumping would be less straightforward. >>> >>> Thanks for pointing this out. On the Xen side, CONFIG_CTLDOM_MAX_ORDER >>> and CONFIG_HWDOM_MAX_ORDER seem big enough on all architectures. But I >>> see CONFIG_DOMU_MAX_ORDER set to 9 (also all archs). Won't that be a >>> problem for drivers trying to allocate more than that from a domU ? >> >> A driver assumes a (physical) device to be in the DomU, at which point it >> is CONFIG_PTDOM_MAX_ORDER which applies (PT standing for pass-through). >> >>>> However - does the driver really need this big a contiguous chunk? It >>>> would seem far more desirable to me to break that up some, if possible. >>> >>> Since this works on bare metal I'm afraid the driver maintainer (mpt >>> fusion driver) will just tell me to fix Xen. >> >> Well. The bigger such allocations, the larger the risk that on systems >> that have been up for a while such allocations can't be fulfilled anymore >> even in the bare metal case. > > Yes. I don't think we should just work around this issue without having > even tried to get the driver fixed. In case they refuse to change it, we > can still increase MAX_CONTIG_ORDER. Thanks for the feedback. I'll try to have a look at the driver if I have time to do so. Regards, Thierry ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [PATCH] x86/xen/mmu: Increase MAX_CONTIG_ORDER 2024-12-18 11:11 ` Thierry Escande @ 2024-12-18 11:24 ` Jürgen Groß 2024-12-19 7:12 ` Jan Beulich 0 siblings, 1 reply; 10+ messages in thread From: Jürgen Groß @ 2024-12-18 11:24 UTC (permalink / raw) To: Thierry Escande, Jan Beulich Cc: sstabellini, oleksandr_tyshchenko, xen-devel, linux-kernel [-- Attachment #1.1.1: Type: text/plain, Size: 3645 bytes --] On 18.12.24 12:11, Thierry Escande wrote: > > > On 12/12/2024 12:09, Juergen Gross wrote: >> On 12.12.24 11:22, Jan Beulich wrote: >>> On 11.12.2024 19:20, Thierry Escande wrote: >>>> Hi Jan, >>>> >>>> On 09/12/2024 11:04, Jan Beulich wrote: >>>>> On 04.12.2024 18:14, Thierry Escande wrote: >>>>>> With change 9f40ec84a797 (xen/swiotlb: add alignment check for dma >>>>>> buffers), the driver mpt3sas fails to load because it cannot allocate >>>>>> its DMA pool for an allocation size of ~2,3 MBytes. This is because >>>>>> the >>>>>> alignement check added by 9f40ec84a797 fails and >>>>>> xen_swiotlb_alloc_coherent() ends up calling >>>>>> xen_create_contiguous_region() with a size order of 10 which is too >>>>>> high >>>>>> for the current max value. >>>>>> >>>>>> This patch increases the MAX_CONTIG_ORDER from 9 to 10 (4MB) to allow >>>>>> such allocations. >>>>>> >>>>>> Signed-off-by: Thierry Escande <thierry.escande@vates.tech> >>>>>> --- >>>>>> arch/x86/xen/mmu_pv.c | 2 +- >>>>>> 1 file changed, 1 insertion(+), 1 deletion(-) >>>>>> >>>>>> diff --git a/arch/x86/xen/mmu_pv.c b/arch/x86/xen/mmu_pv.c >>>>>> index 55a4996d0c04..7f110740e1a2 100644 >>>>>> --- a/arch/x86/xen/mmu_pv.c >>>>>> +++ b/arch/x86/xen/mmu_pv.c >>>>>> @@ -2200,7 +2200,7 @@ void __init xen_init_mmu_ops(void) >>>>>> } >>>>>> /* Protected by xen_reservation_lock. */ >>>>>> -#define MAX_CONTIG_ORDER 9 /* 2MB */ >>>>>> +#define MAX_CONTIG_ORDER 10 /* 4MB */ >>>>>> static unsigned long discontig_frames[1<<MAX_CONTIG_ORDER]; >>>>> >>>>> While lacking respective commentary, bumping this value imo also >>>>> needs to >>>>> take into account Xen itself, at least commit-message-wise. The >>>>> bumping is >>>>> fine for Dom0 in any event. It is also fine for DomU-s with the >>>>> defaults >>>>> built into the hypervisor (orders 12 and 10 respectively for x86 and >>>>> Arm), >>>>> yet especially for Arm (and in the future PPC and RISC-V) any further >>>>> bumping would be less straightforward. >>>> >>>> Thanks for pointing this out. On the Xen side, CONFIG_CTLDOM_MAX_ORDER >>>> and CONFIG_HWDOM_MAX_ORDER seem big enough on all architectures. But I >>>> see CONFIG_DOMU_MAX_ORDER set to 9 (also all archs). Won't that be a >>>> problem for drivers trying to allocate more than that from a domU ? >>> >>> A driver assumes a (physical) device to be in the DomU, at which point it >>> is CONFIG_PTDOM_MAX_ORDER which applies (PT standing for pass-through). >>> >>>>> However - does the driver really need this big a contiguous chunk? It >>>>> would seem far more desirable to me to break that up some, if possible. >>>> >>>> Since this works on bare metal I'm afraid the driver maintainer (mpt >>>> fusion driver) will just tell me to fix Xen. >>> >>> Well. The bigger such allocations, the larger the risk that on systems >>> that have been up for a while such allocations can't be fulfilled anymore >>> even in the bare metal case. >> >> Yes. I don't think we should just work around this issue without having >> even tried to get the driver fixed. In case they refuse to change it, we >> can still increase MAX_CONTIG_ORDER. > > Thanks for the feedback. I'll try to have a look at the driver if I have > time to do so. Another thought would be to change the generic DMA allocation to not require alignment based on the rounded up size, but on the largest power-of-2 chunk fitting into the requested size. I don't see why a 2.3 MB memory allocation would need to be 4 MB aligned. It should be perfectly fine to align it to 2 MB only. Juergen [-- Attachment #1.1.2: OpenPGP public key --] [-- Type: application/pgp-keys, Size: 3743 bytes --] [-- Attachment #2: OpenPGP digital signature --] [-- Type: application/pgp-signature, Size: 495 bytes --] ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [PATCH] x86/xen/mmu: Increase MAX_CONTIG_ORDER 2024-12-18 11:24 ` Jürgen Groß @ 2024-12-19 7:12 ` Jan Beulich 2025-01-04 18:50 ` Jürgen Groß 0 siblings, 1 reply; 10+ messages in thread From: Jan Beulich @ 2024-12-19 7:12 UTC (permalink / raw) To: Jürgen Groß Cc: sstabellini, oleksandr_tyshchenko, xen-devel, linux-kernel, Thierry Escande On 18.12.2024 12:24, Jürgen Groß wrote: > On 18.12.24 12:11, Thierry Escande wrote: >> >> >> On 12/12/2024 12:09, Juergen Gross wrote: >>> On 12.12.24 11:22, Jan Beulich wrote: >>>> On 11.12.2024 19:20, Thierry Escande wrote: >>>>> Hi Jan, >>>>> >>>>> On 09/12/2024 11:04, Jan Beulich wrote: >>>>>> On 04.12.2024 18:14, Thierry Escande wrote: >>>>>>> With change 9f40ec84a797 (xen/swiotlb: add alignment check for dma >>>>>>> buffers), the driver mpt3sas fails to load because it cannot allocate >>>>>>> its DMA pool for an allocation size of ~2,3 MBytes. This is because >>>>>>> the >>>>>>> alignement check added by 9f40ec84a797 fails and >>>>>>> xen_swiotlb_alloc_coherent() ends up calling >>>>>>> xen_create_contiguous_region() with a size order of 10 which is too >>>>>>> high >>>>>>> for the current max value. >>>>>>> >>>>>>> This patch increases the MAX_CONTIG_ORDER from 9 to 10 (4MB) to allow >>>>>>> such allocations. >>>>>>> >>>>>>> Signed-off-by: Thierry Escande <thierry.escande@vates.tech> >>>>>>> --- >>>>>>> arch/x86/xen/mmu_pv.c | 2 +- >>>>>>> 1 file changed, 1 insertion(+), 1 deletion(-) >>>>>>> >>>>>>> diff --git a/arch/x86/xen/mmu_pv.c b/arch/x86/xen/mmu_pv.c >>>>>>> index 55a4996d0c04..7f110740e1a2 100644 >>>>>>> --- a/arch/x86/xen/mmu_pv.c >>>>>>> +++ b/arch/x86/xen/mmu_pv.c >>>>>>> @@ -2200,7 +2200,7 @@ void __init xen_init_mmu_ops(void) >>>>>>> } >>>>>>> /* Protected by xen_reservation_lock. */ >>>>>>> -#define MAX_CONTIG_ORDER 9 /* 2MB */ >>>>>>> +#define MAX_CONTIG_ORDER 10 /* 4MB */ >>>>>>> static unsigned long discontig_frames[1<<MAX_CONTIG_ORDER]; >>>>>> >>>>>> While lacking respective commentary, bumping this value imo also >>>>>> needs to >>>>>> take into account Xen itself, at least commit-message-wise. The >>>>>> bumping is >>>>>> fine for Dom0 in any event. It is also fine for DomU-s with the >>>>>> defaults >>>>>> built into the hypervisor (orders 12 and 10 respectively for x86 and >>>>>> Arm), >>>>>> yet especially for Arm (and in the future PPC and RISC-V) any further >>>>>> bumping would be less straightforward. >>>>> >>>>> Thanks for pointing this out. On the Xen side, CONFIG_CTLDOM_MAX_ORDER >>>>> and CONFIG_HWDOM_MAX_ORDER seem big enough on all architectures. But I >>>>> see CONFIG_DOMU_MAX_ORDER set to 9 (also all archs). Won't that be a >>>>> problem for drivers trying to allocate more than that from a domU ? >>>> >>>> A driver assumes a (physical) device to be in the DomU, at which point it >>>> is CONFIG_PTDOM_MAX_ORDER which applies (PT standing for pass-through). >>>> >>>>>> However - does the driver really need this big a contiguous chunk? It >>>>>> would seem far more desirable to me to break that up some, if possible. >>>>> >>>>> Since this works on bare metal I'm afraid the driver maintainer (mpt >>>>> fusion driver) will just tell me to fix Xen. >>>> >>>> Well. The bigger such allocations, the larger the risk that on systems >>>> that have been up for a while such allocations can't be fulfilled anymore >>>> even in the bare metal case. >>> >>> Yes. I don't think we should just work around this issue without having >>> even tried to get the driver fixed. In case they refuse to change it, we >>> can still increase MAX_CONTIG_ORDER. >> >> Thanks for the feedback. I'll try to have a look at the driver if I have >> time to do so. > > Another thought would be to change the generic DMA allocation to not require > alignment based on the rounded up size, but on the largest power-of-2 chunk > fitting into the requested size. > > I don't see why a 2.3 MB memory allocation would need to be 4 MB aligned. It > should be perfectly fine to align it to 2 MB only. Yet that wouldn't make a difference here, would it? We'd still need a 4M chunk of contiguous space, just with less alignment. Jan ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [PATCH] x86/xen/mmu: Increase MAX_CONTIG_ORDER 2024-12-19 7:12 ` Jan Beulich @ 2025-01-04 18:50 ` Jürgen Groß 0 siblings, 0 replies; 10+ messages in thread From: Jürgen Groß @ 2025-01-04 18:50 UTC (permalink / raw) To: Jan Beulich Cc: sstabellini, oleksandr_tyshchenko, xen-devel, linux-kernel, Thierry Escande [-- Attachment #1.1.1: Type: text/plain, Size: 4203 bytes --] On 19.12.24 08:12, Jan Beulich wrote: > On 18.12.2024 12:24, Jürgen Groß wrote: >> On 18.12.24 12:11, Thierry Escande wrote: >>> >>> >>> On 12/12/2024 12:09, Juergen Gross wrote: >>>> On 12.12.24 11:22, Jan Beulich wrote: >>>>> On 11.12.2024 19:20, Thierry Escande wrote: >>>>>> Hi Jan, >>>>>> >>>>>> On 09/12/2024 11:04, Jan Beulich wrote: >>>>>>> On 04.12.2024 18:14, Thierry Escande wrote: >>>>>>>> With change 9f40ec84a797 (xen/swiotlb: add alignment check for dma >>>>>>>> buffers), the driver mpt3sas fails to load because it cannot allocate >>>>>>>> its DMA pool for an allocation size of ~2,3 MBytes. This is because >>>>>>>> the >>>>>>>> alignement check added by 9f40ec84a797 fails and >>>>>>>> xen_swiotlb_alloc_coherent() ends up calling >>>>>>>> xen_create_contiguous_region() with a size order of 10 which is too >>>>>>>> high >>>>>>>> for the current max value. >>>>>>>> >>>>>>>> This patch increases the MAX_CONTIG_ORDER from 9 to 10 (4MB) to allow >>>>>>>> such allocations. >>>>>>>> >>>>>>>> Signed-off-by: Thierry Escande <thierry.escande@vates.tech> >>>>>>>> --- >>>>>>>> arch/x86/xen/mmu_pv.c | 2 +- >>>>>>>> 1 file changed, 1 insertion(+), 1 deletion(-) >>>>>>>> >>>>>>>> diff --git a/arch/x86/xen/mmu_pv.c b/arch/x86/xen/mmu_pv.c >>>>>>>> index 55a4996d0c04..7f110740e1a2 100644 >>>>>>>> --- a/arch/x86/xen/mmu_pv.c >>>>>>>> +++ b/arch/x86/xen/mmu_pv.c >>>>>>>> @@ -2200,7 +2200,7 @@ void __init xen_init_mmu_ops(void) >>>>>>>> } >>>>>>>> /* Protected by xen_reservation_lock. */ >>>>>>>> -#define MAX_CONTIG_ORDER 9 /* 2MB */ >>>>>>>> +#define MAX_CONTIG_ORDER 10 /* 4MB */ >>>>>>>> static unsigned long discontig_frames[1<<MAX_CONTIG_ORDER]; >>>>>>> >>>>>>> While lacking respective commentary, bumping this value imo also >>>>>>> needs to >>>>>>> take into account Xen itself, at least commit-message-wise. The >>>>>>> bumping is >>>>>>> fine for Dom0 in any event. It is also fine for DomU-s with the >>>>>>> defaults >>>>>>> built into the hypervisor (orders 12 and 10 respectively for x86 and >>>>>>> Arm), >>>>>>> yet especially for Arm (and in the future PPC and RISC-V) any further >>>>>>> bumping would be less straightforward. >>>>>> >>>>>> Thanks for pointing this out. On the Xen side, CONFIG_CTLDOM_MAX_ORDER >>>>>> and CONFIG_HWDOM_MAX_ORDER seem big enough on all architectures. But I >>>>>> see CONFIG_DOMU_MAX_ORDER set to 9 (also all archs). Won't that be a >>>>>> problem for drivers trying to allocate more than that from a domU ? >>>>> >>>>> A driver assumes a (physical) device to be in the DomU, at which point it >>>>> is CONFIG_PTDOM_MAX_ORDER which applies (PT standing for pass-through). >>>>> >>>>>>> However - does the driver really need this big a contiguous chunk? It >>>>>>> would seem far more desirable to me to break that up some, if possible. >>>>>> >>>>>> Since this works on bare metal I'm afraid the driver maintainer (mpt >>>>>> fusion driver) will just tell me to fix Xen. >>>>> >>>>> Well. The bigger such allocations, the larger the risk that on systems >>>>> that have been up for a while such allocations can't be fulfilled anymore >>>>> even in the bare metal case. >>>> >>>> Yes. I don't think we should just work around this issue without having >>>> even tried to get the driver fixed. In case they refuse to change it, we >>>> can still increase MAX_CONTIG_ORDER. >>> >>> Thanks for the feedback. I'll try to have a look at the driver if I have >>> time to do so. >> >> Another thought would be to change the generic DMA allocation to not require >> alignment based on the rounded up size, but on the largest power-of-2 chunk >> fitting into the requested size. >> >> I don't see why a 2.3 MB memory allocation would need to be 4 MB aligned. It >> should be perfectly fine to align it to 2 MB only. > > Yet that wouldn't make a difference here, would it? We'd still need a 4M > chunk of contiguous space, just with less alignment. Thierry stated that the driver failed to load due to the added alignment check introduced with commit 9f40ec84a797. I was targeting this reasoning with my remark. Juergen [-- Attachment #1.1.2: OpenPGP public key --] [-- Type: application/pgp-keys, Size: 3743 bytes --] [-- Attachment #2: OpenPGP digital signature --] [-- Type: application/pgp-signature, Size: 495 bytes --] ^ permalink raw reply [flat|nested] 10+ messages in thread
end of thread, other threads:[~2025-01-04 18:50 UTC | newest] Thread overview: 10+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2024-12-04 17:14 [PATCH] x86/xen/mmu: Increase MAX_CONTIG_ORDER Thierry Escande 2024-12-09 10:04 ` Jan Beulich 2024-12-11 18:20 ` Thierry Escande 2024-12-11 18:20 ` Thierry Escande 2024-12-12 10:22 ` Jan Beulich 2024-12-12 11:09 ` Juergen Gross 2024-12-18 11:11 ` Thierry Escande 2024-12-18 11:24 ` Jürgen Groß 2024-12-19 7:12 ` Jan Beulich 2025-01-04 18:50 ` Jürgen Groß
This is an external index of several public inboxes, see mirroring instructions on how to clone and mirror all data and code used by this external index.