Netdev List
 help / color / mirror / Atom feed
* SW csum errors
@ 2013-10-14 20:13 Kyle Hubert
  2013-10-14 20:40 ` Eric Dumazet
  2013-10-14 20:58 ` Stephen Hemminger
  0 siblings, 2 replies; 6+ messages in thread
From: Kyle Hubert @ 2013-10-14 20:13 UTC (permalink / raw)
  To: netdev

My problem is rather specific. I am working on an RDMA device, and we
have full end to end reliability. However, one of the initial spins of
our chip had some errors, since fixed, where the csum was unreliable.
So, we did exactly what Dave Miller warned not to do in the linked
message. We ran outgoing IP packets through the SKB checksum
function.. Unfortunately, we occasionally saw NFS csum errors on full
MTU packets.

Here is his response:

http://marc.info/?l=linux-netdev&m=128286758300676&w=2

Relevant portion:

"
Paged SKBs can have references to page cache pages and similar.  These
can be updated asynchronously to the transmit, there is no locking at
all to freeze the contents, and therefore full checksum offload is
required to support SG correctly.

So don't get the idea to do the checksum in software in the infiniband
layer, and advertize hw checksumming support, to get around this :-)
"

Now that those chips have long gone, I am left pondering about these
packets "corrupted" before the device transfers them. Can I get more
information about these paged SKBs with asynchronous modifications?
How does NFS use them?

Thanks for your time,
-Kyle

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2013-10-16 15:58 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2013-10-14 20:13 SW csum errors Kyle Hubert
2013-10-14 20:40 ` Eric Dumazet
2013-10-14 20:58 ` Stephen Hemminger
2013-10-16 15:10   ` Kyle Hubert
2013-10-16 15:24     ` Eric Dumazet
2013-10-16 15:58       ` Kyle Hubert

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox