From mboxrd@z Thu Jan 1 00:00:00 1970 From: Bob Kryger Subject: nfs write problems Date: Tue, 02 Oct 2007 15:56:41 -0400 Message-ID: <4702A279.3080001@panix.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" To: nfs@lists.sourceforge.net, For users of Fedora , Bob Kryger Return-path: Received: from sc8-sf-mx2-b.sourceforge.net ([10.3.1.92] helo=mail.sourceforge.net) by sc8-sf-list2-new.sourceforge.net with esmtp (Exim 4.43) id 1IcnrD-0001uF-Hb for nfs@lists.sourceforge.net; Tue, 02 Oct 2007 12:56:42 -0700 Received: from mail3.panix.com ([166.84.1.74]) by mail.sourceforge.net with esmtp (Exim 4.44) id 1IcnrH-0008NV-Cw for nfs@lists.sourceforge.net; Tue, 02 Oct 2007 12:56:44 -0700 List-Id: "Discussion of NFS under Linux development, interoperability, and testing." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: nfs-bounces@lists.sourceforge.net Errors-To: nfs-bounces@lists.sourceforge.net So, I have a relatively new system on which I am seeing strange NFS behavior. In short I am getting seemingly random errors in files written via NFS. * I do not get the errors if I write files locally. * I have no errors in the NIC, I even tried a second NIC in a PCI slot as opposed to the onboard one. There are no errors recorded on the NIC or the switch on a 1Gb port. * I see no memory errors, I ran memtest for 3 days clean. * To test I am using dd if=/dev/zero of various (large) file sizes. * Since I know that the file should be all zeros I wrote a C program to read it back and tell me where it finds non-zero bytes. The program results are confirmed with od. * The files read back always have the errors in the same place, so it is not a problem with reading the files. * There are no errors in any logs. * The problem occurs on both the RAID1 (ext3) and RAID10 (xfs) filesystems. * I've tried two clients, both FC5 one 64bit, and the other a 32 bit with the same results. This error was uncovered by users attempting to write files from other systems and other Fedora releases, so it is repeatable regardless of the client. * the server is not running anything else and spends a large portion of the time idle. loadaverages are quite low. swap is mostly unused. a large portion of RAM is allocated to file cache, but I expect that this would be normal for this amount of file IO. The server is running an up-to-date FC6, although this also occurred with FC5. I am about to try F7. Hardware is an AMD 1220 dual core 64bit, on a Tyan K8SSA S3950 with an Adaptec Raid 2230SLP and 7 Fujitsu MAU3147NC. The RAID config is that 1 disks (on diff channels) are in a Mirror for the OS, 4 are in a Raid 10 config and 1 is a hot spare. Anyone ever seen anything like this before? Suggest where I might look next? Additional tests? ------------------------------------------------------------------------- This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/ _______________________________________________ NFS maillist - NFS@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nfs