From: Stephen Hemminger <shemminger@vyatta.com>
To: Daniel Lezcano <daniel.lezcano@free.fr>
Cc: David Miller <davem@davemloft.net>,
eric.dumazet@gmail.com, kaber@trash.net, nightnord@gmail.com,
netdev@vger.kernel.org
Subject: Re: [PATCH][v3] dev : fix mtu check when TSO is enabled
Date: Wed, 16 Mar 2011 09:45:35 -0700 [thread overview]
Message-ID: <20110316094535.5ac6acd8@nehalam> (raw)
In-Reply-To: <4D80E302.9010806@free.fr>
On Wed, 16 Mar 2011 17:19:14 +0100
Daniel Lezcano <daniel.lezcano@free.fr> wrote:
> On 03/16/2011 04:35 PM, Stephen Hemminger wrote:
> > On Wed, 16 Mar 2011 14:56:09 +0100
> > Daniel Lezcano<daniel.lezcano@free.fr> wrote:
> >
> >> On 03/15/2011 07:17 PM, Stephen Hemminger wrote:
> >>> On Tue, 15 Mar 2011 14:57:40 +0100
> >>> Daniel Lezcano<daniel.lezcano@free.fr> wrote:
> >>>
> >>>> On 03/15/2011 12:59 AM, David Miller wrote:
> >>>>> From: Daniel Lezcano<daniel.lezcano@free.fr>
> >>>>> Date: Mon, 14 Mar 2011 21:39:50 +0100
> >>>>>
> >>>>>> + len = dev->mtu + dev->hard_header_len + VLAN_HLEN;
> >>>>>> + if (skb->len< len)
> >>>>>> + return true;
> >>>>> This is not a correct translation of the original test:
> >>>>>
> >>>>>> - (skb->len> (dev->mtu + dev->hard_header_len + VLAN_HLEN)))) {
> >>>>> You need to use "<=" in your version, which currently rejects all
> >>>>> full sized frames. :-)
> >>>> Right, thanks.
> >>>>
> >>>>>> +
> >>>>>> + /* if TSO is enabled, we don't care about the length as the packet
> >>>>>> + * could be forwarded without being segmented before
> >>>>>> + */
> >>>>>> + if (skb->dev&& skb->dev->features& NETIF_F_TSO)
> >>>>>> + return true;
> >>>>> I am trying to understand why you aren't simply checking also if this
> >>>>> is a segmented frame? Perhaps skb_is_gso()&& device has NETIF_F_TSO
> >>>>> set?
> >>>> Maybe I am misunderstanding but the packet was forwarded by another device.
> >>>> In our case from macvlan:
> >>>>
> >>>> macvlan_start_xmit
> >>>> macvlan_queue_xmit
> >>>> dest->forward
> >>>> dev_skb_forward
> >>>>
> >>>> When we reached dev_skb_forward, that means we passed through
> >>>> dev_hard_start_xmit where the packet was already segmented so we should
> >>>> exit at the first test (skb->len< len). I don't see the point of adding
> >>>> the skb_is_gso.
> >>>> But maybe I am missing something, can you explain ?
> >>> The macvlan device only has one downstream device (slave).
> >>> If kernel is working properly, macvlan device should have a subset
> >>> of the features of the underlying device
> >> Right, dev->features = lowerdev->features& MACVLAN_FEATURES
> >>
> >>> and macvlan device should
> >>> have same MTU as underlying device.
> >> Right,
> >>
> >> ...
> >>
> >> if (!tb[IFLA_MTU])
> >> dev->mtu = lowerdev->mtu;
> >>
> >> ...
> >>> If the feature/MTU flags
> >>> were correct, then the path calling macvlan should be respecting
> >>> the MTU.
> >> But if the TSO is enabled on the macvlan (inherited from eg e1000), the
> >> packet won't be fragmented to the mtu size no ?
> > That is the responsiblity of the hardware that receives the packet.
> > Macvlan should be passing it through to the lowerdev and since the hardware
> > supports TSO, it will fragment it.
>
> Ok, but in the case the macvlan is in bridge mode, the dev_skb_forward
> function will forward the packet (which is not fragmented) to to another
> macvlan port without going through the hardware driver. In this
> function, the packet length is checked against the mtu size and of
> course the packet is dropped in case the lower device support the TSO
> (if the packet is larger than the mtu size). Dave suggested to check
> skb_is_gso and against the TSO feature of the macvlan but I don't
> understand why we should check skb_is_gso too.
>
> if (skb_is_gso(skb)&& (skb->dev&& skb->dev->features& NETIF_F_TSO))
> return true;
>
>
Then it is up to macvlan to do the same thing as bridge code.
static inline unsigned packet_length(const struct sk_buff *skb)
{
return skb->len - (skb->protocol == htons(ETH_P_8021Q) ? VLAN_HLEN : 0);
}
int br_dev_queue_push_xmit(struct sk_buff *skb)
{
/* ip_fragment doesn't copy the MAC header */
if (nf_bridge_maybe_copy_header(skb) ||
(packet_length(skb) > skb->dev->mtu && !skb_is_gso(skb))) {
kfree_skb(skb);
} else {
skb_push(skb, ETH_HLEN);
dev_queue_xmit(skb);
}
return 0;
}
Ps: not sure adding macvlan bridge mode was a good idea, we already have
a working bridge code. And there are too many missing pieces in the macvlan
implementation of bridging
--
next prev parent reply other threads:[~2011-03-16 16:45 UTC|newest]
Thread overview: 12+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-03-14 20:39 [PATCH][v3] dev : fix mtu check when TSO is enabled Daniel Lezcano
2011-03-14 23:59 ` David Miller
2011-03-15 13:57 ` Daniel Lezcano
2011-03-15 18:17 ` Stephen Hemminger
2011-03-16 13:56 ` Daniel Lezcano
2011-03-16 15:35 ` Stephen Hemminger
2011-03-16 16:19 ` Daniel Lezcano
2011-03-16 16:45 ` Stephen Hemminger [this message]
2011-03-21 22:58 ` Eric W. Biederman
2011-03-22 2:31 ` Jesse Gross
2011-03-22 3:02 ` Eric W. Biederman
2011-03-22 0:05 ` Michał Mirosław
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20110316094535.5ac6acd8@nehalam \
--to=shemminger@vyatta.com \
--cc=daniel.lezcano@free.fr \
--cc=davem@davemloft.net \
--cc=eric.dumazet@gmail.com \
--cc=kaber@trash.net \
--cc=netdev@vger.kernel.org \
--cc=nightnord@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).