From: Andrew Cooper <andrew.cooper3@citrix.com>
To: Jan Beulich <jbeulich@suse.com>, daniel.kiper@oracle.com
Cc: xen-devel@lists.xenproject.org, keir@xen.org
Subject: Re: [PATCH v2 03/23] x86: zero BSS using stosl instead of stosb
Date: Wed, 22 Jul 2015 09:42:55 +0100 [thread overview]
Message-ID: <55AF578F.4070403@citrix.com> (raw)
In-Reply-To: <55AF35A502000078000D4F8E@prv-mh.provo.novell.com>
On 22/07/2015 06:18, Jan Beulich wrote:
>>>> Daniel Kiper <daniel.kiper@oracle.com> 07/21/15 8:23 PM >>>
>> On Tue, Jul 21, 2015 at 03:37:48AM -0600, Jan Beulich wrote:
>>>>>> On 20.07.15 at 16:28, <daniel.kiper@oracle.com> wrote:
>>> ... because of ??? Nowadays - with X86_FEATURE_ERMS - rep stosb
>>> is expected to be faster than rep stosl.
>> OK, I did not know about that. However, as I know this feature
>> was introduced in 2012 with Ivy Bridge. So, I suppose that there
>> are still a lot of machines in the wild which does not support it.
>> Anyway, because this code is not performance critical I am not going
>> to insist on one or another solution. However, Andrew suggested that
>> thing, so, please agree with him in which direction we should go.
>> I will do what you agree.
> ISTR having seen a similar patch from him(?), maybe in another area
> of code, before (or was it v1 of this one?), which I responded to with the
> same as above.
Indeed you have, several in fact. I had not had chance to delve into
the optimisation manuals, but have taken a peek now. (Section 3.7.6)
In the case of having aligned source and destination on a 16-byte
boundary (which we can trivially arrange), then ERMSB (to give it its
Intel name) and rep stosl differ only in the setup cost; they still
scale at the same rate for changes in length.
Therefore, assuming we arrange for 16-byte alignment, using rep stosl
would appear to be a single 60ish cycle hit over using ERMSB, but would
be substantially more efficient than using rep stosb on a non-ERMSB system.
Overall, I think 16 byte alignment and rep stosl is the best compromise.
~Andrew
next prev parent reply other threads:[~2015-07-22 8:42 UTC|newest]
Thread overview: 201+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-07-20 14:28 [PATCH v2 00/23] x86: multiboot2 protocol support Daniel Kiper
2015-07-20 14:28 ` [PATCH v2 01/23] x86/boot: remove unneeded instruction Daniel Kiper
2015-07-24 16:22 ` Konrad Rzeszutek Wilk
2015-07-27 19:46 ` Daniel Kiper
2015-07-27 19:46 ` Daniel Kiper
2015-08-10 16:07 ` Konrad Rzeszutek Wilk
2015-08-10 16:07 ` Konrad Rzeszutek Wilk
2015-07-24 16:22 ` Konrad Rzeszutek Wilk
2015-07-20 14:28 ` Daniel Kiper
2015-07-20 14:28 ` [PATCH v2 02/23] x86/boot: copy only text section from *.lnk file to *.bin file Daniel Kiper
2015-07-20 14:28 ` Daniel Kiper
2015-07-21 9:35 ` Jan Beulich
2015-07-21 17:23 ` Daniel Kiper
2015-07-22 5:14 ` Jan Beulich
2015-07-22 8:02 ` Jan Beulich
2015-07-22 13:31 ` Daniel Kiper
2015-07-22 14:07 ` Jan Beulich
2015-07-20 14:28 ` [PATCH v2 03/23] x86: zero BSS using stosl instead of stosb Daniel Kiper
2015-07-21 9:37 ` Jan Beulich
2015-07-21 18:23 ` Daniel Kiper
2015-07-22 5:18 ` Jan Beulich
2015-07-22 8:42 ` Andrew Cooper [this message]
2015-07-22 10:04 ` Jan Beulich
2015-07-22 11:22 ` Andrew Cooper
2015-07-22 11:48 ` Jan Beulich
2015-07-20 14:28 ` Daniel Kiper
2015-07-20 14:28 ` [PATCH v2 04/23] x86/boot: call reloc() using cdecl calling convention Daniel Kiper
2015-07-20 14:28 ` Daniel Kiper
2015-08-10 16:33 ` Konrad Rzeszutek Wilk
2015-08-10 16:33 ` [Xen-devel] " Konrad Rzeszutek Wilk
2015-08-17 15:44 ` Jan Beulich
2015-08-17 15:44 ` Jan Beulich
2015-07-20 14:29 ` [PATCH v2 05/23] x86/boot/reloc: create generic alloc and copy functions Daniel Kiper
2015-08-17 15:51 ` Jan Beulich
2015-08-17 22:03 ` Daniel Kiper
2015-08-17 15:51 ` Jan Beulich
2015-07-20 14:29 ` Daniel Kiper
2015-07-20 14:29 ` [PATCH v2 06/23] x86/boot: use %ecx instead of %eax Daniel Kiper
2015-07-20 14:29 ` Daniel Kiper
2015-08-10 16:36 ` [Xen-devel] " Konrad Rzeszutek Wilk
2015-08-10 16:36 ` Konrad Rzeszutek Wilk
2015-07-20 14:29 ` [PATCH v2 07/23] x86/boot/reloc: Rename some variables and rearrange code a bit Daniel Kiper
2015-07-20 14:29 ` Daniel Kiper
2015-08-10 16:40 ` Konrad Rzeszutek Wilk
2015-08-10 16:40 ` [Xen-devel] " Konrad Rzeszutek Wilk
2015-08-17 15:55 ` Jan Beulich
2015-08-17 15:55 ` Jan Beulich
2015-07-20 14:29 ` [PATCH v2 08/23] x86: add multiboot2 protocol support Daniel Kiper
2015-08-10 19:17 ` [Xen-devel] " Konrad Rzeszutek Wilk
2015-08-13 19:22 ` Daniel Kiper
2015-08-14 10:03 ` Jan Beulich
2015-08-14 10:03 ` [Xen-devel] " Jan Beulich
2015-08-15 6:00 ` Andrew Cooper
2015-08-15 6:00 ` Andrew Cooper
2015-08-13 19:22 ` Daniel Kiper
2015-08-10 19:17 ` Konrad Rzeszutek Wilk
2015-08-18 8:12 ` Jan Beulich
2015-08-18 12:00 ` Daniel Kiper
2015-08-18 14:20 ` Jan Beulich
2015-07-20 14:29 ` [PATCH v2 09/23] efi: create efi_enabled() Daniel Kiper
2015-08-10 19:20 ` Konrad Rzeszutek Wilk
2015-08-10 19:20 ` [Xen-devel] " Konrad Rzeszutek Wilk
2015-08-20 15:18 ` Jan Beulich
2015-08-22 12:33 ` Daniel Kiper
2015-08-24 11:29 ` Jan Beulich
2015-08-22 12:33 ` Daniel Kiper
2015-07-20 14:29 ` [PATCH v2 10/23] efi: build xen.gz with EFI code Daniel Kiper
2015-07-20 14:29 ` Daniel Kiper
2015-08-10 19:24 ` [Xen-devel] " Konrad Rzeszutek Wilk
2015-08-10 19:24 ` Konrad Rzeszutek Wilk
2015-08-20 15:39 ` Jan Beulich
2015-08-22 13:59 ` Daniel Kiper
2015-08-22 13:59 ` Daniel Kiper
2015-08-24 11:35 ` Jan Beulich
2015-08-24 20:54 ` Daniel Kiper
2015-08-25 10:50 ` Andrew Cooper
2015-08-25 10:50 ` Andrew Cooper
2015-08-25 15:39 ` Daniel Kiper
2015-08-25 15:39 ` Daniel Kiper
2015-08-25 12:09 ` Jan Beulich
2015-08-25 12:09 ` Jan Beulich
2015-08-25 16:31 ` Daniel Kiper
2015-08-26 6:46 ` Jan Beulich
2015-08-26 12:33 ` Daniel Kiper
2015-08-26 12:40 ` Jan Beulich
2015-08-26 12:40 ` Jan Beulich
2015-08-26 12:58 ` Daniel Kiper
2015-08-26 12:58 ` Daniel Kiper
2015-08-26 12:33 ` Daniel Kiper
2015-08-26 6:46 ` Jan Beulich
2015-08-25 16:31 ` Daniel Kiper
2015-08-24 20:54 ` Daniel Kiper
2015-07-20 14:29 ` [PATCH v2 11/23] efi: split out efi_init() Daniel Kiper
2015-08-10 19:25 ` Konrad Rzeszutek Wilk
2015-08-10 19:25 ` [Xen-devel] " Konrad Rzeszutek Wilk
2015-07-20 14:29 ` [PATCH v2 12/23] efi: split out efi_console_set_mode() Daniel Kiper
2015-08-10 19:25 ` [Xen-devel] " Konrad Rzeszutek Wilk
2015-08-10 19:25 ` Konrad Rzeszutek Wilk
2015-07-20 14:29 ` Daniel Kiper
2015-07-20 14:29 ` [PATCH v2 13/23] efi: split out efi_get_gop() Daniel Kiper
2015-07-20 14:29 ` Daniel Kiper
2015-08-10 19:27 ` Konrad Rzeszutek Wilk
2015-08-10 19:27 ` [Xen-devel] " Konrad Rzeszutek Wilk
2015-07-20 14:29 ` [PATCH v2 14/23] efi: split out efi_find_gop_mode() Daniel Kiper
2015-08-10 19:31 ` Konrad Rzeszutek Wilk
2015-08-10 19:31 ` [Xen-devel] " Konrad Rzeszutek Wilk
2015-08-20 15:48 ` Jan Beulich
2015-07-20 14:29 ` [PATCH v2 15/23] efi: split out efi_tables() Daniel Kiper
2015-08-10 19:32 ` [Xen-devel] " Konrad Rzeszutek Wilk
2015-08-10 19:32 ` Konrad Rzeszutek Wilk
2015-07-20 14:29 ` [PATCH v2 16/23] efi: split out efi_variables() Daniel Kiper
2015-08-10 19:34 ` Konrad Rzeszutek Wilk
2015-08-10 19:34 ` [Xen-devel] " Konrad Rzeszutek Wilk
2015-07-20 14:29 ` [PATCH v2 17/23] efi: split out efi_set_gop_mode() Daniel Kiper
2015-08-10 19:34 ` Konrad Rzeszutek Wilk
2015-08-10 19:34 ` [Xen-devel] " Konrad Rzeszutek Wilk
2015-07-20 14:29 ` [PATCH v2 18/23] efi: split out efi_exit_boot() Daniel Kiper
2015-07-20 14:29 ` Daniel Kiper
2015-08-10 19:36 ` Konrad Rzeszutek Wilk
2015-08-10 19:36 ` [Xen-devel] " Konrad Rzeszutek Wilk
2015-07-20 14:29 ` [PATCH v2 19/23] x86/efi: create new early memory allocator Daniel Kiper
2015-08-10 19:49 ` Konrad Rzeszutek Wilk
2015-08-10 19:49 ` [Xen-devel] " Konrad Rzeszutek Wilk
2015-08-27 11:23 ` Jan Beulich
2015-08-27 11:23 ` Jan Beulich
2015-07-20 14:29 ` Daniel Kiper
2015-07-20 14:29 ` [PATCH v2 20/23] x86: add multiboot2 protocol support for EFI platforms Daniel Kiper
2015-08-10 20:07 ` Konrad Rzeszutek Wilk
2015-08-10 20:07 ` [Xen-devel] " Konrad Rzeszutek Wilk
2015-08-11 15:23 ` Konrad Rzeszutek Wilk
2015-08-11 15:23 ` Konrad Rzeszutek Wilk
2015-08-27 12:01 ` Jan Beulich
2015-08-27 12:01 ` Jan Beulich
2015-09-22 15:21 ` Daniel Kiper
2015-09-22 15:58 ` Jan Beulich
2015-09-22 15:58 ` Jan Beulich
2015-07-20 14:29 ` [PATCH v2 21/23] x86/boot: implement early command line parser in C Daniel Kiper
2015-07-20 14:29 ` Daniel Kiper
2015-08-10 20:31 ` Konrad Rzeszutek Wilk
2015-08-10 20:31 ` [Xen-devel] " Konrad Rzeszutek Wilk
2015-08-11 14:43 ` Konrad Rzeszutek Wilk
2015-08-11 14:43 ` [Xen-devel] " Konrad Rzeszutek Wilk
2015-08-27 12:43 ` Jan Beulich
2015-08-27 12:43 ` Jan Beulich
2015-09-22 17:03 ` Daniel Kiper
2015-09-23 7:25 ` Jan Beulich
2015-09-23 7:25 ` Jan Beulich
2015-09-22 17:03 ` Daniel Kiper
2015-07-20 14:29 ` [PATCH v2 22/23] x86: make Xen early boot code relocatable Daniel Kiper
2015-08-11 16:48 ` [Xen-devel] " Konrad Rzeszutek Wilk
2015-08-14 11:52 ` Daniel Kiper
2015-08-14 12:49 ` Jan Beulich
2015-08-14 13:59 ` Daniel Kiper
2015-08-14 14:32 ` Jan Beulich
2015-08-14 14:37 ` Daniel Kiper
2015-08-14 15:12 ` Jan Beulich
2015-08-14 15:12 ` Jan Beulich
2015-08-14 14:37 ` Daniel Kiper
2015-08-14 14:32 ` Jan Beulich
2015-08-14 13:59 ` Daniel Kiper
2015-08-14 12:49 ` Jan Beulich
2015-08-14 15:20 ` Konrad Rzeszutek Wilk
2015-08-14 15:20 ` [Xen-devel] " Konrad Rzeszutek Wilk
2015-08-14 11:52 ` Daniel Kiper
2015-08-11 16:48 ` Konrad Rzeszutek Wilk
2015-08-27 13:12 ` Jan Beulich
2015-08-27 15:10 ` Daniel Kiper
2015-08-27 15:29 ` Jan Beulich
2015-08-27 17:56 ` Ben Hildred
2015-08-28 8:22 ` Jan Beulich
2015-08-28 8:22 ` [Xen-devel] " Jan Beulich
2015-08-28 13:42 ` Konrad Rzeszutek Wilk
2015-08-28 13:42 ` [Xen-devel] " Konrad Rzeszutek Wilk
2015-08-28 14:16 ` Jan Beulich
2015-08-28 14:16 ` Jan Beulich
2015-08-31 19:49 ` Daniel Kiper
2015-08-31 19:49 ` [Xen-devel] " Daniel Kiper
2015-09-01 6:59 ` Jan Beulich
2015-09-01 6:59 ` [Xen-devel] " Jan Beulich
2015-08-27 17:56 ` Ben Hildred
2015-08-27 18:04 ` Andrew Cooper
2015-08-28 6:54 ` Jan Beulich
2015-08-28 11:59 ` Andrew Cooper
2015-08-28 11:59 ` Andrew Cooper
2015-08-28 6:54 ` Jan Beulich
2015-08-27 18:04 ` Andrew Cooper
2015-08-28 14:24 ` Jan Beulich
2015-08-28 14:24 ` [Xen-devel] " Jan Beulich
2015-08-27 15:29 ` Jan Beulich
2015-08-27 15:10 ` Daniel Kiper
2015-08-27 13:12 ` Jan Beulich
2015-07-20 14:29 ` Daniel Kiper
2015-07-20 14:29 ` [PATCH v2 23/23] x86: add multiboot2 protocol support for relocatable images Daniel Kiper
2015-08-11 16:56 ` Konrad Rzeszutek Wilk
2015-08-11 16:56 ` Konrad Rzeszutek Wilk
2015-08-14 11:57 ` Daniel Kiper
2015-08-14 13:43 ` Konrad Rzeszutek Wilk
2015-08-14 13:43 ` Konrad Rzeszutek Wilk
2015-08-14 11:57 ` Daniel Kiper
2015-07-21 9:39 ` [PATCH v2 00/23] x86: multiboot2 protocol support Jan Beulich
2015-07-21 9:39 ` Jan Beulich
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=55AF578F.4070403@citrix.com \
--to=andrew.cooper3@citrix.com \
--cc=daniel.kiper@oracle.com \
--cc=jbeulich@suse.com \
--cc=keir@xen.org \
--cc=xen-devel@lists.xenproject.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.