From: Vincent Blut <vincent.debian@free.fr>
To: Stephen Hemminger <shemminger@vyatta.com>
Cc: netdev@vger.kernel.org,
Debian Bug Tracking System <609994@bugs.debian.org>
Subject: Re: sky2: hw csum failure
Date: Mon, 28 Nov 2011 19:20:45 +0000 [thread overview]
Message-ID: <4ED3DF0D.8030904@free.fr> (raw)
In-Reply-To: <20111128085030.248ebcb7@nehalam.linuxnetplumber.net>
Le 28/11/2011 16:50, Stephen Hemminger a écrit :
> On Mon, 28 Nov 2011 12:10:20 +0000
> Vincent Blut <vincent.debian@free.fr> wrote:
>
>
>> Hi,
>>
>> [reference: http://bugs.debian.org/609994]
>>
>> I have a Marvell ethernet controller which presents some failures when
>> 'rx checksumming' is enabled,
>> here is the model:
>>
>> $ lspci -vvs 03:00.0
>> 03:00.0 Ethernet controller: Marvell Technology Group Ltd. 88E8053 PCI-E
>> Gigabit Ethernet Controller (rev 15)
>> Subsystem: Micro-Star International Co., Ltd. Marvell 88E8053
>> Gigabit Ethernet Controller (MSI)
>> Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop-
>> ParErr- Stepping- SERR- FastB2B- DisINTx+
>> Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort-
>> <TAbort- <MAbort- >SERR- <PERR- INTx-
>> Latency: 0, Cache Line Size: 32 bytes
>> Interrupt: pin A routed to IRQ 44
>> Region 0: Memory at fdbfc000 (64-bit, non-prefetchable) [size=16K]
>> Region 2: I/O ports at 7c00 [size=256]
>> [virtual] Expansion ROM at fda00000 [disabled] [size=128K]
>> Capabilities: <access denied>
>> Kernel driver in use: sky2
>>
>> At first I thought it was due to the MTU size, so I tested different
>> values but unfortunately without positive effect.
>> Overall this issue appears randomly when the incoming traffic is high. I
>> tested 2.6.32, 3.1.1, and 3.2-rc3, sadly
>> all are affected. Finally, the only way to avoid those failures is to
>> disabled 'rx checksumming' (ethtool -K ethX rx off).
>>
>> Here is the stack trace:
>>
>> [ 14.615648] sky2 0000:03:00.0: eth1: enabling interface
>> [ 14.616452] ADDRCONF(NETDEV_UP): eth1: link is not ready
>> [ 17.094194] sky2 0000:03:00.0: eth1: Link is up at 1000 Mbps, full
>> duplex, flow control both
>> [ 17.094887] ADDRCONF(NETDEV_CHANGE): eth1: link becomes ready
>> [ 28.080018] eth1: no IPv6 routers present
>> [ 563.816032] sky2 0000:03:00.0: eth1: hung mac 124:22 fifo 195 (150:145)
>> [ 563.816036] sky2 0000:03:00.0: eth1: receiver hang detected
>> [ 567.005422] sky2 0000:03:00.0: eth1: Link is up at 1000 Mbps, full
>> duplex, flow control both
>> [ 1040.816314] sky2 0000:03:00.0: eth1: rx error, status 0x7ffc0001
>> length 1004
>> [ 2097.401616] sky2 0000:03:00.0: eth1: rx error, status 0x39a339a3 length 0
>>
> This isn't really a hardware checksum failure.
> Your problem is deeper than that. The internal parts of the chip are not
> communicating correctly. The "hung mac" is a problem only occurs if the PCI
> is really stuck. There may be a timing issue on your motherboard, or the BIOS
> isn't setting up the device properly. The timing then gets messed up between
> the end of frame status and the PCI shared memory region. Turning checksum
> off masks the problem, but the status is probably still corrupt.
>
>
IMO, turning checksum off does more than masking the problem because it
prevents the
connection to stop working or even freeze the system.
> In either case the problem is beyond the ability of the driver to fix or workaround.
> Your best bet is to see if there is a BIOS update, or replace the hardware.
>
Sadly the last BIOS update was in 2006, so I don't think I can rely on
the manufacturer.
But again, deactivate the capability of checksum by NIC hardware makes
the interface stable since March.
Regards,
Vincent
next prev parent reply other threads:[~2011-11-28 18:20 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-11-28 12:10 sky2: hw csum failure Vincent Blut
2011-11-28 16:50 ` Stephen Hemminger
2011-11-28 19:20 ` Vincent Blut [this message]
-- strict thread matches above, loose matches on Subject: below --
2011-11-15 9:05 sky2 " Martin Volf
2011-11-15 10:28 ` Yan, Zheng
2011-11-15 17:45 ` Stephen Hemminger
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4ED3DF0D.8030904@free.fr \
--to=vincent.debian@free.fr \
--cc=609994@bugs.debian.org \
--cc=netdev@vger.kernel.org \
--cc=shemminger@vyatta.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.