From: Zachary Amsden <zamsden@redhat.com>
To: "H. Peter Anvin" <hpa@zytor.com>
Cc: Linus Torvalds <torvalds@linux-foundation.org>,
linux-kernel@vger.kernel.org,
Thomas Gleixner <tglx@linutronix.de>,
Ingo Molnar <mingo@redhat.com>,
x86@kernel.org, Avi Kivity <avi@redhat.com>
Subject: Re: [PATCH] x86 rwsem optimization extreme
Date: Wed, 17 Feb 2010 18:25:51 -1000 [thread overview]
Message-ID: <4B7CC14F.7000802@redhat.com> (raw)
In-Reply-To: <4B7C9F0A.1080708@zytor.com>
>
> On 02/17/2010 05:53 PM, Linus Torvalds wrote:
>
>> - but adc _throughput_ is also typically much higher, which indicates
>> that even if you do flag renaming, the 'adc' quite likely only
>> schedules in a single ALU unit.
>>
>> For example, on a Pentium, adc/sbb can only go in the U pipe, and I think
>> the same is true of 'stc'. Now, nobody likely cares about Pentiums any
>> more, but the point is, 'adc' does often have constraints that a regular
>> 'add' does not, and there's an example of a 'stc+adc' pair would at the
>> very least have to be scheduled with an instruction in between.
>>
> No doubt. I doubt it much matters in this context, but either way I
> think the patch is probably a bad idea... much for the same as my incl
> hack was - since the code isn't actually inline, saving a handful bytes
> is not the right tradeoff.
>
> -hpa
>
>
Incidentally, the cost of putting all the rwsem code inline, using the
straightforward approach, for git-tip, using defconfig on x86_64 is 3565
bytes / 20971778 bytes total, or 0.0168%, using gcc 4.4.3.
That's small enough to actually consider it.
Even smaller if you leave trylock as a function... actually no, that
didn't work, size increased. I'm guessing many call sites also end up
calling the explicit form as a fallback.
If you inline only read_lock functions and write release, nope, that
didn't work either.
If you inline only read_lock functions, that still isn't it. Many other
permutations are possible, but I've wasted enough time.
Although, with a more clever inline implementation, if some of the
constraints to %rdx go away...
Zach
next prev parent reply other threads:[~2010-02-18 4:26 UTC|newest]
Thread overview: 12+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-02-17 21:58 [PATCH] x86 rwsem optimization extreme Zachary Amsden
2010-02-17 22:10 ` Linus Torvalds
2010-02-17 22:29 ` H. Peter Anvin
2010-02-17 23:29 ` H. Peter Anvin
2010-02-18 1:03 ` Zachary Amsden
2010-02-18 1:53 ` Linus Torvalds
2010-02-18 1:59 ` H. Peter Anvin
2010-02-18 4:25 ` Zachary Amsden [this message]
2010-02-18 8:12 ` Andi Kleen
2010-02-18 8:24 ` Zachary Amsden
2010-02-18 9:29 ` Andi Kleen
2010-02-18 10:55 ` Ingo Molnar
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4B7CC14F.7000802@redhat.com \
--to=zamsden@redhat.com \
--cc=avi@redhat.com \
--cc=hpa@zytor.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mingo@redhat.com \
--cc=tglx@linutronix.de \
--cc=torvalds@linux-foundation.org \
--cc=x86@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.