Re: coding style lesson: iwlwifi vs. endianness

All of lore.kernel.org
 help / color / mirror / Atom feed

From: Dan Williams <dcbw@redhat.com>
To: Tomas Winkler <tomasw@gmail.com>
Cc: Johannes Berg <johannes@sipsolutions.net>,
	linux-wireless <linux-wireless@vger.kernel.org>,
	Zhu Yi <yi.zhu@intel.com>,
	Reinette Chatre <reinette.chatre@intel.com>
Subject: Re: coding style lesson: iwlwifi vs. endianness
Date: Wed, 28 Nov 2007 16:43:12 -0500	[thread overview]
Message-ID: <1196286192.4967.29.camel@localhost.localdomain> (raw)
In-Reply-To: <1ba2fa240711281020n6052980cl610f6c790788d2da@mail.gmail.com>

On Wed, 2007-11-28 at 20:20 +0200, Tomas Winkler wrote:
> If you would compose your email in less arrogant tone I would answer
> you why your assumptions are wrong.
> I know it is tempting to teach BIG Intel, but please try to keep good
> spirit on this mailing list as it was so far.
> Thanks
> Tomas.

Ignoring the email tones and focusing on the problem, could you
elaborate your reasons?  Doing endian conversions at the boundaries is
quite a bit simpler and does lead to cleaner, more readable code.  The
bulk of the work being done with a softmac card is in the driver +
stack, so the clearer those are, the better for everyone.

Dan

> On Nov 27, 2007 8:44 PM, Johannes Berg <johannes@sipsolutions.net> wrote:
> > Today's lesson is brought to you by the motivation to teach you how to
> > write simpler code since apparently the only way you found so far to get
> > drivers working on big endian platforms is to use a big endian
> > conversion hammer (ambiguity intentional :) ). Which is not at all
> > necessary. Maybe you don't want to do this as it's a lot of work to
> > start with, but the current way you do things more than suboptimal.
> >
> > But let me explain. And since examples are always good, I picked one at
> > random from the iwl4965 driver.
> >
> > Consider the phy_flags field in the iwl4965_rx_phy_res structure which
> > is defined as follows:
> >         __le16 phy_flags;       /* general phy flags: band, modulation, ... */
> >
> > along with these definitions:
> >
> > #define RX_RES_PHY_FLAGS_BAND_24_MSK            __constant_cpu_to_le16(1 << 0)
> > #define RX_RES_PHY_FLAGS_MOD_CCK_MSK            __constant_cpu_to_le16(1 << 1)
> > #define RX_RES_PHY_FLAGS_SHORT_PREAMBLE_MSK     __constant_cpu_to_le16(1 << 2)
> > #define RX_RES_PHY_FLAGS_NARROW_BAND_MSK        __constant_cpu_to_le16(1 << 3)
> > #define RX_RES_PHY_FLAGS_ANTENNA_MSK            __constant_cpu_to_le16(0xf0)    [1]
> >
> > This is very complex. Why? Because you continually have to use __le16
> > for any phy flags field. To extract the antenna, here's what you need to
> > do:
> >
> > __le16 phy_flags_hw = phy_res->phy_flags;
> > [...]
> >
> > antenna = le16_to_cpu(phy_flags_hw & ANTENNA_MSK) >> 4;
> >
> > Also, on a big-endian architecture that expands to this beast:
> > rt_antenna =
> >   (__builtin_constant_p((__u16)(( __u16)(__le16)(phy_flags_hw &
> > (( __le16)((__u16)( (((__u16)((0xf0)) & (__u16)0x00ffU) << 8) |
> > (((__u16)((0xf0)) & (__u16)0xff00U) >> 8) )))))) ?
> > ((__u16)( (((__u16)((( __u16)(__le16)(phy_flags_hw &
> > (( __le16)((__u16)( (((__u16)((0xf0)) & (__u16)0x00ffU) << 8) |
> > (((__u16)((0xf0)) & (__u16)0xff00U) >> 8) )))))) & (__u16)0x00ffU) << 8)
> > | (((__u16)((( __u16)(__le16)(phy_flags_hw &
> > (( __le16)((__u16)( (((__u16)((0xf0)) & (__u16)0x00ffU) << 8) |
> > (((__u16)((0xf0)) & (__u16)0xff00U) >> 8) )))))) & (__u16)0xff00U) >>
> > 8) )) : __fswab16((( __u16)(__le16)(phy_flags_hw &
> > (( __le16)((__u16)( (((__u16)((0xf0)) & (__u16)0x00ffU) << 8) |
> > (((__u16)((0xf0)) & (__u16)0xff00U) >> 8) ))))))) >> 4;
> >
> > It also results in a compiler warning:
> > [...] warning: integer overflow in expression
> >
> > although I have to admit that right now I do not know where in that
> > beast the compiler thinks it got an overflow. Twice, in fact. In any
> > case, it's pretty hard on the compiler.
> >
> > Additionally, doing it this way means that programmers continually need
> > to think about endianness *all over the code*. Literally *everywhere*
> > that touches values coming from hardware/going to hardware, which is
> > pretty much everywhere in a driver.
> >
> > Now, here's what I want to teach you to do instead.
> >
> > Keep the phy_flags field defined as it was:
> >         __le16 phy_flags;       /* general phy flags: band, modulation, ... */
> >
> > but do it like everybody else and define the values as they are in the
> > phy_flags field without thinking about endianness at all:
> >
> > #define RX_RES_PHY_FLAGS_BAND_24_MSK            (1 << 0)
> > #define RX_RES_PHY_FLAGS_MOD_CCK_MSK            (1 << 1)
> > #define RX_RES_PHY_FLAGS_SHORT_PREAMBLE_MSK     (1 << 2)
> > #define RX_RES_PHY_FLAGS_NARROW_BAND_MSK        (1 << 3)
> > #define RX_RES_PHY_FLAGS_ANTENNA_MSK            0xf0
> >
> > This is easier for the person writing the definitions and looks much
> > cleaner to boot.
> >
> > Then, when it comes to using a value, simply do:
> >
> > u16 phy_flags = le16_to_cpu(phy_res->phy_flags);
> >
> > (and if you get it wrong, sparse will warn here.)
> >
> > Then, you can do the easy:
> > antenna = (phy_flags & ANTENNA_MASK) >> 4;
> >
> > without having to think about endianness. And if you use the field again
> > and again you never have any conversion functions.
> >
> > Presto. As long as you think in terms of
> >         this is the phy_flags field that contains X, Y and Z at
> >         positions x, y, z
> >
> > rather than
> >
> >         this is 16 bits that are laid out in such and such way
> >
> > (which I think everybody except hardware designers does), you win.
> >
> > As a bonus, your code is much easier to read and much smaller (C/header
> > file size I mean). Also, it's a lot more efficient for fields that
> > contain multiple fields like the antenna field (with more than one bit)
> > because you do the conversion only once.
> >
> > Think of it this way:
> >
> > Your current style of taking care of endianness requires thinking about
> > it everywhere, is thus prone to errors and looks ugly.
> >
> > The way I'm suggesting is to convert all data to native (CPU) endianness
> > _as it enters the system_, i.e. at the driver/hardware boundary and then
> > think in terms of the "field" after that, never bothering to think about
> > endianness again. In fact, you shouldn't ever need to use cpu_to_le*()
> > in the RX path, only in any control/TX paths where you need to push a
> > value down to the hardware.
> >
> > The iwlwifi code is a big mess this way. I'm willing to help with the
> > conversion, it should be a pretty mechanical removal of many many
> > cpu_to_le16/32 calls and some fixups and will definitely make the code
> > much easier to maintain. In fact, it would probably have been easier if
> > you'd written the code without respect for endianness and then we'd
> > simply annotated all structures that are shared with the hardware and
> > fixed the sparse warnings at the system boundaries.
> >
> > I hope this helps. Let me know if you have any questions.
> >
> > I have one for you: Whatever gave you the idea of doing it this way?
> > Isn't it common sense that converting data at system boundaries is much
> > easier and less prone to errors than trying to pull it through the whole
> > second system in a non-native format? Whether it's endianness, character
> > set conversion, ...
> >
> > johannes
> >
> > [1] By the way, using __constant_cpu_to_le16() is not useful because
> > cpu_to_le16() already checks whether the argument is a constant. You
> > should only use it if absolutely necessary for some reason. That's why
> > it has two underscores.
> >
> -
> To unsubscribe from this list: send the line "unsubscribe linux-wireless" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

next prev parent reply	other threads:[~2007-11-28 21:48 UTC|newest]

Thread overview: 19+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2007-11-27 18:44 coding style lesson: iwlwifi vs. endianness Johannes Berg
2007-11-28 18:20 ` Tomas Winkler
2007-11-28 18:50   ` Johannes Berg
2007-11-28 21:43   ` Dan Williams [this message]
2007-11-29  0:58     ` John W. Linville
2007-11-29 23:02       ` Tomas Winkler
2007-12-10 11:42         ` Johannes Berg
2007-12-10 14:18           ` Tomas Winkler
2007-12-10 15:18             ` Johannes Berg
2007-12-10 15:30               ` Michael Buesch
2007-12-10 16:18                 ` Johannes Berg
2007-12-10 16:48                   ` Michael Buesch
2007-12-10 16:21                 ` Tomas Winkler
2007-12-10 16:25                   ` Johannes Berg
2007-12-10 21:18                     ` Tomas Winkler
2007-12-10 16:10               ` Tomas Winkler
2007-12-10 16:17                 ` Johannes Berg
2007-12-10 16:23                 ` Johannes Berg
2007-11-29  9:03   ` Holger Schurig

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1196286192.4967.29.camel@localhost.localdomain \
    --to=dcbw@redhat.com \
    --cc=johannes@sipsolutions.net \
    --cc=linux-wireless@vger.kernel.org \
    --cc=reinette.chatre@intel.com \
    --cc=tomasw@gmail.com \
    --cc=yi.zhu@intel.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.