From: Tomasz Wroblewski <tomasz.wroblewski@gmail.com>
To: Jan Beulich <JBeulich@suse.com>
Cc: xen-devel@lists.xenproject.org
Subject: Re: GPU passthrough performance regression in >4GB vms due to XSA-60 changes
Date: Mon, 19 May 2014 12:42:12 +0200 [thread overview]
Message-ID: <5379E004.1080107@gmail.com> (raw)
In-Reply-To: <5379DD1A.6050106@gmail.com>
On 05/19/2014 12:29 PM, Tomasz Wroblewski wrote:
>
> On 05/16/2014 04:36 PM, Jan Beulich wrote:
>>>>> On 16.05.14 at 13:38, <JBeulich@suse.com> wrote:
>>>>>> On 16.05.14 at 13:18, <tomasz.wroblewski@gmail.com> wrote:
>>>>> If I coded up a patch to deal with this on -unstable, would you be
>>>>> able to test that?
>>>> Willing to give it a go (xen major version updates are often
>>>> problematic
>>>> to do though so can't promise success). What would your patch be
>>>> doing?
>>>> Adding entries to MTRR for the relocated regions?
>>> This and properly declare the region in ACPI's _CRS. For starters I'll
>>> probably try keeping the WB default overlaid with UC variable ranges,
>>> as that's going to be the less intrusive change.
>> Okay here are two patches - the first to deal with the above mentioned
>> items, and the second to further increase correctness and at once
>> shrink the number of MTRR regions needed.
>>
>> Afaict they apply equally well to stable-4.3, master, and staging.
>>
>> But to be honest I don't expect any performance improvement, all
>> I'd expect is that BARs relocated above 4Gb would now get treated
>> equally to such below 4Gb - UC in all cases.
> Thanks Jan. I've tried the patches and you're correct, putting UC in
> MTRR for the relocated region didn't help the issue. However, I had to
> hack that manually - the codepaths to do that in your hvmloader patch
> were not activating. The hvmloader is not programming guest pci bars
> to 64bit regions at all, rather still programming them with 32 bit
> regions... upon a look this seems because using_64bar conditon, as
> well as bar64_relocate in hvmloader/pci.c is always false.
>
> So bar relocation to 64bit is not happening, but ram relocation as per
> the code tagged as /* Relocate RAM that overlaps PCI space (in
> 64k-page chunks). */ is happening. This maybe is correct (?), although
> I think the fact that RAM is relocated but not the BAR causes the
> tools (i.e. qemu) to lose sight of what memory is used for mmio and as
> you mentioned in one of the previous posts, the calls which would set
> it to mmio_direct in p2m table are not happening. Our qemu is pretty
> ancient and doesn't support 64bit bars so its not super trivial to
> verify whether relocating bars to 64bit would help. Trying to make
> sense out of this..
>
>
Actually seems to be like the plausible explanation for the performance
issues we see could be that
- some region of guest space has been relocated by hvmloader out to
64bit memory to enlarge pci mmio hole (which stays in 32bit space)
- BUT the caching on that relocated region is UC since at the time of
the relocation MTRR was disabled and that caused the EPT entry to get UC
type.
- however since this is just some region of guest memory not actually
used for mmio, just relocated out of mmio hole, the caching should be WB
- guest doesn't use that region for mmio but for some other tasks,
access to that is slow and slows the guest down.
- as you mentioned it might already be fixed on unstable since EPTs are
updated there when mtrr is enabled.
That would explain why retaining old loop fixed by XSA-60 fixes the perf
issue, since it runs at the time mtrr is enabled, it reverts the
relocated region to WB (which is correct I guess for the non mmio-regions)
>
>
>
next prev parent reply other threads:[~2014-05-19 10:42 UTC|newest]
Thread overview: 31+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-05-15 9:11 GPU passthrough performance regression in >4GB vms due to XSA-60 changes Tomasz Wroblewski
2014-05-15 12:32 ` Jan Beulich
2014-05-15 12:10 ` Tomasz Wroblewski
2014-05-15 13:23 ` Jan Beulich
2014-05-15 13:39 ` Tomasz Wroblewski
2014-05-15 14:34 ` Tomasz Wroblewski
2014-05-15 14:56 ` Tomasz Wroblewski
2014-05-15 16:07 ` Jan Beulich
2014-05-15 15:39 ` Tomasz Wroblewski
2014-05-16 6:33 ` Jan Beulich
2014-05-16 11:18 ` Tomasz Wroblewski
2014-05-16 11:38 ` Jan Beulich
2014-05-16 14:36 ` Jan Beulich
2014-05-19 10:29 ` Tomasz Wroblewski
2014-05-19 10:38 ` Jan Beulich
2014-05-19 10:47 ` Tomasz Wroblewski
2014-05-19 11:07 ` Jan Beulich
2014-05-19 11:32 ` Tomasz Wroblewski
2014-05-19 12:06 ` Jan Beulich
2014-05-19 12:17 ` Tomasz Wroblewski
2014-05-19 12:44 ` Jan Beulich
2014-05-19 14:20 ` Tomasz Wroblewski
2014-05-19 15:24 ` Jan Beulich
2014-05-19 15:48 ` Tomasz Wroblewski
2014-05-19 17:36 ` Tim Deegan
2014-05-20 6:31 ` Jan Beulich
2014-05-19 10:42 ` Tomasz Wroblewski [this message]
2014-05-19 11:01 ` Jan Beulich
2014-05-19 11:09 ` Tomasz Wroblewski
2014-05-19 11:19 ` Jan Beulich
2014-05-15 16:01 ` Jan Beulich
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=5379E004.1080107@gmail.com \
--to=tomasz.wroblewski@gmail.com \
--cc=JBeulich@suse.com \
--cc=xen-devel@lists.xenproject.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).