From: William Breathitt Gray <vilhelm.gray@gmail.com>
To: Lukas Wunner <lukas@wunner.de>
Cc: Syed Nayyar Waris <syednwaris@gmail.com>,
akpm@linux-foundation.org, andriy.shevchenko@linux.intel.com,
arnd@arndb.de, Linus Walleij <linus.walleij@linaro.org>,
linux-arch@vger.kernel.org, linux-kernel@vger.kernel.org
Subject: Re: [PATCH 1/6] bitops: Introduce the the for_each_set_clump macro
Date: Fri, 24 Apr 2020 11:09:26 -0400 [thread overview]
Message-ID: <20200424150828.GA5034@icarus> (raw)
In-Reply-To: <20200424150058.xadjxaga3csh3br6@wunner.de>
[-- Attachment #1: Type: text/plain, Size: 3870 bytes --]
On Fri, Apr 24, 2020 at 05:00:58PM +0200, Lukas Wunner wrote:
> On Fri, Apr 24, 2020 at 08:22:38PM +0530, Syed Nayyar Waris wrote:
> > On Fri, Apr 24, 2020 at 7:40 PM Lukas Wunner <lukas@wunner.de> wrote:
> > >
> > > On Fri, Apr 24, 2020 at 05:55:21PM +0530, Syed Nayyar Waris wrote:
> > > > +static inline void bitmap_set_value(unsigned long *map,
> > > > + unsigned long value,
> > > > + unsigned long start, unsigned long nbits)
> > > > +{
> > > > + const size_t index = BIT_WORD(start);
> > > > + const unsigned long offset = start % BITS_PER_LONG;
> > > > + const unsigned long ceiling = roundup(start + 1, BITS_PER_LONG);
> > > > + const unsigned long space = ceiling - start;
> > > > +
> > > > + value &= GENMASK(nbits - 1, 0);
> > > > +
> > > > + if (space >= nbits) {
> > > > + map[index] &= ~(GENMASK(nbits + offset - 1, offset));
> > > > + map[index] |= value << offset;
> > > > + } else {
> > > > + map[index] &= ~BITMAP_FIRST_WORD_MASK(start);
> > > > + map[index] |= value << offset;
> > > > + map[index + 1] &= ~BITMAP_LAST_WORD_MASK(start + nbits);
> > > > + map[index + 1] |= (value >> space);
> > > > + }
> > > > +}
> > >
> > > Sorry but what's the advantage of using this complicated function
> > > as a replacement for the much simpler bitmap_set_value8()?
> > >
> > > The drivers calling bitmap_set_value8() *know* that 8-bit accesses
> > > are possible and take advantage of that knowledge by using a small,
> > > speed-optimized function. Replacing that with a more complicated
> > > (potentially less performant) function doesn't seem to be a step
> > > forward.
> >
> > Actually this generic function can work with n-bits of any size (less
> > than equal to BITS_PER_LONG), while the earlier bitmap_set_value8
> > worked with n-bits having size of 8 bits only.
> >
> > In the case when n-bits is 8-bits, this new bitmap_set_value()
> > function would behave very similar to the earlier bitmap_set_value8()
> > function. For example, in case of n-bits being 8-bits it will always
> > execute the 'if' condition and not the 'else' condition, hence
> > offering the same performance (because of encountering similar code
> > statements) as earlier bitmap_set_value8() function, most probably.
> >
> > There is an additional advantage (this can happen when n-bits is not 8
> > bits): during setting value of n-bit in bitmap, if a situation arise
> > that the width of next n-bit is exceeding the word boundary, then it
> > will divide itself such that some portion of it is stored in that
> > word, while the remaining portion is stored in the next higher word.
> >
> > So, this function preserves the behaviour of earlier
> > bitmap_set_value8() function and also adds extra functionality to
> > that.
>
> Please leave drivers as is which use exclusively 8-bit accesses,
> e.g. gpio-max3191x.c and gpio-74x164.c. I'm fearing a performance
> regression if your new generic variant is used. They work perfectly
> fine the way they are and I don't see any benefit this series may have
> for them.
>
> If there are other drivers which benefit from the flexibility of your
> generic variant then I'm not opposed to changing those.
>
> Thanks,
>
> Lukas
We can leave of course bitmap_set_value8 alone, but for 8-bit values the
difference in latency I suspect is primarily due to the conditional test
for the word boundaries. This latency is surely overshadowed by the I/O
latency of the GPIO drivers, so I don't think there's much harm in
changing those to use the generic function when the bottleneck will not
be due to the bitmap_set_value/bitmap_get_value operations.
William Breathitt Gray
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 833 bytes --]
next prev parent reply other threads:[~2020-04-24 15:09 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2020-04-24 12:25 [PATCH 1/6] bitops: Introduce the the for_each_set_clump macro Syed Nayyar Waris
2020-04-24 14:10 ` Lukas Wunner
2020-04-24 14:52 ` Syed Nayyar Waris
2020-04-24 15:00 ` Lukas Wunner
2020-04-24 15:09 ` William Breathitt Gray [this message]
2020-04-24 16:34 ` Andy Shevchenko
2020-04-24 16:42 ` William Breathitt Gray
2020-04-24 17:59 ` Lukas Wunner
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20200424150828.GA5034@icarus \
--to=vilhelm.gray@gmail.com \
--cc=akpm@linux-foundation.org \
--cc=andriy.shevchenko@linux.intel.com \
--cc=arnd@arndb.de \
--cc=linus.walleij@linaro.org \
--cc=linux-arch@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=lukas@wunner.de \
--cc=syednwaris@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.