From mboxrd@z Thu Jan 1 00:00:00 1970 From: David Fix Subject: jfsCommit high CPU usage in kernel 2.6.25.1 x86_64 Date: Fri, 16 May 2008 08:56:49 -0400 Message-ID: <482D8491.7050006@mrxfx.com> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit To: linux-fsdevel@vger.kernel.org Return-path: Received: from mail.mrxfx.com ([209.82.53.180]:52971 "EHLO mrxfx.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1751796AbYEPNTq (ORCPT ); Fri, 16 May 2008 09:19:46 -0400 Received: from localhost (localhost.localdomain [127.0.0.1]) by mrxfx.com (Postfix) with ESMTP id A6787123DDD for ; Fri, 16 May 2008 08:56:53 -0400 (EDT) Received: from mrxfx.com ([127.0.0.1]) by localhost (mail.mrxfx.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id AtMZn0nXVhsV for ; Fri, 16 May 2008 08:56:49 -0400 (EDT) Received: from [192.168.3.139] (unknown [209.82.53.179]) by mrxfx.com (Postfix) with ESMTP id A9285123AF8 for ; Fri, 16 May 2008 08:56:49 -0400 (EDT) Sender: linux-fsdevel-owner@vger.kernel.org List-ID: Hey guys, I'm new to the list, but I've been using Linux and fooling around with the kernel for ages. :) I posted this to linux-kernel as well, but figured it might have been lost and not seen by the right eyes. :) I've been experiencing high CPU usage for jfsCommit on kernel 2.6.25.1 (haven't had a chance to go to 2.6.25.4, but I didn't see any JFS-specific changes between the versions yet). This is under a fair load, with about 50 machines accessing the data. This is x86_64, running CentOS 5. I built the kernel myself, and it's a modular kernel. Here's my hardware config: CPUs: Two Intel Xeons - Both are E5420 2.5GHz Quad-core RAM: 8GB RAID Controller: 3Ware 9650SE-24M8 I can't find the documentation of what motherboard I have in there, so I'll give a bit of lspci: 00:00.0 Host bridge: Intel Corporation 5000P Chipset Memory Controller Hub (rev b1) 00:1d.0 USB Controller: Intel Corporation 631xESB/632xESB/3100 Chipset UHCI USB Controller #1 (rev 09) 00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev d9) 01:00.0 PCI bridge: Intel Corporation 6311ESB/6321ESB PCI Express Upstream Port (rev 01) 03:00.0 PCI bridge: Integrated Device Technology, Inc. Unknown device 8018 (rev 04) (prog-if 00 [Normal decode]) 05:00.0 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet Controller (rev 06) 07:00.0 Ethernet controller: Intel Corporation 80003ES2LAN Gigabit Ethernet Controller (Copper) (rev 01) 0c:00.0 RAID bus controller: 3ware Inc 9650SE SATA-II RAID (rev 01) 0f:0c.0 VGA compatible controller: ATI Technologies Inc ES1000 (rev 02) There's the salient points there. The NIC is a quad, which is trunked to the switch using LAPC. I've got a RAID-6 on the 3Ware controller of 20 TB, and when I'm running "top", I see this: --- top - 15:33:27 up 5:40, 3 users, load average: 4.33, 3.59, 3.98 Tasks: 315 total, 2 running, 313 sleeping, 0 stopped, 0 zombie Cpu(s): 0.2%us, 13.2%sy, 0.0%ni, 61.3%id, 24.9%wa, 0.1%hi, 0.2%si, 0.0%st Mem: 8194264k total, 8144500k used, 49764k free, 3884k buffers Swap: 16779884k total, 148k used, 16779736k free, 7667400k cached PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND 2669 root 15 -5 0 0 0 R 99 0.0 19:45.71 jfsCommit --- And people are complaining (and I'm seeing) very slow writes to the drives. Just wondering if anyone has any ideas. :) If you need any information, I'll provide whatever you need. Thanks in advance! Dave