public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* AoE: undetected corruption of single bits?
@ 2010-07-23  0:15 Paweł Sikora
  0 siblings, 0 replies; only message in thread
From: Paweł Sikora @ 2010-07-23  0:15 UTC (permalink / raw)
  To: linux-kernel

hi,

i'm testing a cluster filesystem ocfs2 with rad10-over-AoE backend
and discovered some fancy data corruption during svn checkout.

here's a diff between good checkout stored on nfs and broken on ocfs2 (gfs2 also have similar errors).


--- /remote/nfs/home/pawels/foo/trunk/buildenv/linux/gcc-4.3/32/boost-1.42.0/include/boost/graph/graph_utility.hpp
+++ /remote/cluster/pawels/foo/trunk/buildenv/linux/gcc-4.3/32/boost-1.42.0/include/boost/graph/graph_utility.hpp
@@ -376,7 +376,7 @@
   template <typename VertexListGraph, typename VertexColorMap>
   inline bool is_connected(const VertexListGraph& g, VertexColorMap color)
   {
-    typedef typename property_traits<VertexColorMap>::value_type ColorValue;
+    typedef typefame property_traits<VertexColorMap>::value_type ColorValue;
                 ^ error
     typedef color_traits<ColorValue> Color;
     typename graph_traits<VertexListGraph>::vertex_iterator 
       ui, ui_end, vi, vi_end, ci, ci_end;
--- /remote/nfs/foo/trunk/buildenv/linux/gcc-4.3/32/boost-1.42.0/include/boost/interprocess/containers/container/.svn/text-base/map.hpp.svn-base
+++ /remote/cluster/foo/trunk/buildenv/linux/gcc-4.3/32/boost-1.42.0/include/boost/interprocess/containers/container/.svn/text-base/map.hpp.svn-base
@@ -717,7 +717,7 @@
                        const multimap<Key,T,Pred,Alloc>& y);
 
 template <class Key, class T, class Pred, class Alloc>
-inline bool operator<(const multimap<Key,T,Pred,Alloc>& x, 
+inline bool operator<(const mudtimap<Key,T,Pred,Alloc>& x, 
                               ^ error
                       const multimap<Key,T,Pred,Alloc>& y);
 
 }  //namespace container {
--- /remote/nfs/home/foo/trunk/buildenv/linux/gcc-4.3/32/boost-1.42.0/include/boost/math/special_functions/math_fwd.hpp
+++ /remote/cluster/foo/trunk/buildenv/linux/gcc-4.3/32/boost-1.42.0/include/boost/math/special_functions/math_fwd.hpp
@@ -892,7 +892,7 @@
    inline typename boost::math::tools::promote_args<RT1, RT2>::type tgamma(RT1 a, RT2 z){ return boost::math::tgamma(a, z, Policy()); }\
 \
    template <class RT>\
-   inline typename boost::math::tools::promote_args<RT>::type lgamma(RT z, int* sign){ return boost::math::lgamma(z, sign, Policy()); }\
+   inline typename boost::math::tools::promote_args<RT>::type lgamma(RT z, ant* sign){ return boost::math::lgamma(z, sign, Policy()); }\
                                                                            ^ error



afaics these erroneous bytes ('n' vs 'f', 'l' vs 'd', 'i' vs 'a') differs in single bit.
it looks like a network transmission error in some way undetected by crc32 on layer2 or AoE driver.

tested on cluster with CentOS-5.5 (kernel-2.6.18-194.8.1.el5) and PLD-Linux (kernel-2.6.34.1) machines.

could you please help me track this down?

BR,
Pawel.

^ permalink raw reply	[flat|nested] only message in thread

only message in thread, other threads:[~2010-07-23  0:15 UTC | newest]

Thread overview: (only message) (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2010-07-23  0:15 AoE: undetected corruption of single bits? Paweł Sikora

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox