* Hello and a question about high cpu usage on jfsCommit (kernel 2.6.25.1)
@ 2008-05-15 19:35 David Fix
2008-05-15 20:15 ` David Fix
2008-05-19 15:37 ` Dave Kleikamp
0 siblings, 2 replies; 6+ messages in thread
From: David Fix @ 2008-05-15 19:35 UTC (permalink / raw)
To: linux-kernel
Hey guys,
I'm new to the list, but I've been using Linux and fooling around with
the kernel for ages. :)
I've been experiencing high CPU usage for jfsCommit on kernel 2.6.25.1
(haven't had a chance to go to 2.6.25.4, but I didn't see any
JFS-specific changes between the versions yet). Here's my hardware
config, as well:
CPUs: 2x Intel Xeon E5420 2.5GHz Quad-core
RAM: 8GB
RAID Controller: 3Ware 9650SE-24M8
I can't find a mention of what motherboard I have in here, so I'll give
a bit of lspci:
00:00.0 Host bridge: Intel Corporation 5000P Chipset Memory Controller
Hub (rev b1)
00:1d.0 USB Controller: Intel Corporation 631xESB/632xESB/3100 Chipset
UHCI USB Controller #1 (rev 09)
00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev d9)
01:00.0 PCI bridge: Intel Corporation 6311ESB/6321ESB PCI Express
Upstream Port (rev 01)
03:00.0 PCI bridge: Integrated Device Technology, Inc. Unknown device
8018 (rev 04) (prog-if 00 [Normal decode])
05:00.0 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet
Controller (rev 06)
07:00.0 Ethernet controller: Intel Corporation 80003ES2LAN Gigabit
Ethernet Controller (Copper) (rev 01)
0c:00.0 RAID bus controller: 3ware Inc 9650SE SATA-II RAID (rev 01)
0f:0c.0 VGA compatible controller: ATI Technologies Inc ES1000 (rev 02)
There's the salient points there. The NIC is a quad, which is trunked
to the switch using LAPC.
I've got a RAID-6 on the 3Ware controller of 20 TB, and when I'm running
"top", I see this:
---
top - 15:33:27 up 5:40, 3 users, load average: 4.33, 3.59, 3.98
Tasks: 315 total, 2 running, 313 sleeping, 0 stopped, 0 zombie
Cpu(s): 0.2%us, 13.2%sy, 0.0%ni, 61.3%id, 24.9%wa, 0.1%hi, 0.2%si,
0.0%st
Mem: 8194264k total, 8144500k used, 49764k free, 3884k buffers
Swap: 16779884k total, 148k used, 16779736k free, 7667400k cached
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
2669 root 15 -5 0 0 0 R 99 0.0 19:45.71 jfsCommit
---
And people are complaining (and I'm seeing) very slow writes to the drives.
Just wondering if anyone has any ideas. :) If you need any
information, I'll provide whatever you need.
Thanks in advance!
Dave
^ permalink raw reply [flat|nested] 6+ messages in thread* Re: Hello and a question about high cpu usage on jfsCommit (kernel 2.6.25.1) 2008-05-15 19:35 Hello and a question about high cpu usage on jfsCommit (kernel 2.6.25.1) David Fix @ 2008-05-15 20:15 ` David Fix 2008-05-19 15:37 ` Dave Kleikamp 1 sibling, 0 replies; 6+ messages in thread From: David Fix @ 2008-05-15 20:15 UTC (permalink / raw) To: linux-kernel I'm sorry, I should have mentioned that this is a x86_64 build. David Fix wrote: > Hey guys, > > I'm new to the list, but I've been using Linux and fooling around with > the kernel for ages. :) > > I've been experiencing high CPU usage for jfsCommit on kernel 2.6.25.1 > (haven't had a chance to go to 2.6.25.4, but I didn't see any > JFS-specific changes between the versions yet). Here's my hardware > config, as well: > > <SNIP> ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: Hello and a question about high cpu usage on jfsCommit (kernel 2.6.25.1) 2008-05-15 19:35 Hello and a question about high cpu usage on jfsCommit (kernel 2.6.25.1) David Fix 2008-05-15 20:15 ` David Fix @ 2008-05-19 15:37 ` Dave Kleikamp 2008-05-20 12:19 ` [Jfs-discussion] " Christian Kujau 2008-05-20 13:06 ` David Fix 1 sibling, 2 replies; 6+ messages in thread From: Dave Kleikamp @ 2008-05-19 15:37 UTC (permalink / raw) To: David Fix; +Cc: linux-kernel, JFS Discussion I'm copying this to jfs-discussion to see if anyone has seen anything like this. On Thu, 2008-05-15 at 15:35 -0400, David Fix wrote: > Hey guys, > > I'm new to the list, but I've been using Linux and fooling around with > the kernel for ages. :) > > I've been experiencing high CPU usage for jfsCommit on kernel 2.6.25.1 > (haven't had a chance to go to 2.6.25.4, but I didn't see any > JFS-specific changes between the versions yet). In fact, there haven't been a whole lot of non-cosmetic changes to jfs at all recently. Nothing I see that suspicious. What was the previous kernel you were running before moving to 2.6.25.1? > Here's my hardware > config, as well: > > > CPUs: 2x Intel Xeon E5420 2.5GHz Quad-core > RAM: 8GB > RAID Controller: 3Ware 9650SE-24M8 For anyone seeing this for the first time on jfs-discussion, Dave followed up stating that this was an x86_64 build. > I can't find a mention of what motherboard I have in here, so I'll give > a bit of lspci: > > 00:00.0 Host bridge: Intel Corporation 5000P Chipset Memory Controller > Hub (rev b1) > 00:1d.0 USB Controller: Intel Corporation 631xESB/632xESB/3100 Chipset > UHCI USB Controller #1 (rev 09) > 00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev d9) > 01:00.0 PCI bridge: Intel Corporation 6311ESB/6321ESB PCI Express > Upstream Port (rev 01) > 03:00.0 PCI bridge: Integrated Device Technology, Inc. Unknown device > 8018 (rev 04) (prog-if 00 [Normal decode]) > 05:00.0 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet > Controller (rev 06) > 07:00.0 Ethernet controller: Intel Corporation 80003ES2LAN Gigabit > Ethernet Controller (Copper) (rev 01) > 0c:00.0 RAID bus controller: 3ware Inc 9650SE SATA-II RAID (rev 01) > 0f:0c.0 VGA compatible controller: ATI Technologies Inc ES1000 (rev 02) > > There's the salient points there. The NIC is a quad, which is trunked > to the switch using LAPC. > > I've got a RAID-6 on the 3Ware controller of 20 TB, and when I'm running > "top", I see this: > > --- > top - 15:33:27 up 5:40, 3 users, load average: 4.33, 3.59, 3.98 > Tasks: 315 total, 2 running, 313 sleeping, 0 stopped, 0 zombie > Cpu(s): 0.2%us, 13.2%sy, 0.0%ni, 61.3%id, 24.9%wa, 0.1%hi, 0.2%si, > 0.0%st > Mem: 8194264k total, 8144500k used, 49764k free, 3884k buffers > Swap: 16779884k total, 148k used, 16779736k free, 7667400k cached > > PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND > 2669 root 15 -5 0 0 0 R 99 0.0 19:45.71 jfsCommit This is highly unusual. Is this thread continually eating cpu at this rate, or does it happen in spurts? > And people are complaining (and I'm seeing) very slow writes to the drives. > > Just wondering if anyone has any ideas. :) If you need any > information, I'll provide whatever you need. Has this happened more than once (have you rebooted and still seen the problem)? I'm not sure if some rare bug has caused some kind of linked list corruption that puts the thread into an infinite loop, or if this is a real regression. > Thanks in advance! > > Dave Thanks, Shaggy -- David Kleikamp IBM Linux Technology Center ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [Jfs-discussion] Hello and a question about high cpu usage on jfsCommit (kernel 2.6.25.1) 2008-05-19 15:37 ` Dave Kleikamp @ 2008-05-20 12:19 ` Christian Kujau 2008-05-20 13:06 ` David Fix 1 sibling, 0 replies; 6+ messages in thread From: Christian Kujau @ 2008-05-20 12:19 UTC (permalink / raw) To: Dave Kleikamp; +Cc: David Fix, JFS Discussion, linux-kernel On Mon, May 19, 2008 17:37, Dave Kleikamp wrote: >> PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND >> 2669 root 15 -5 0 0 0 R 99 0.0 19:45.71 jfsCommit >> > This is highly unusual. Is this thread continually eating cpu at this > rate, or does it happen in spurts? Would oprofile[0] help to see what jfsCommit is doing here? C. [0] http://oprofile.sf.net -- make bzImage, not war ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: Hello and a question about high cpu usage on jfsCommit (kernel 2.6.25.1) 2008-05-19 15:37 ` Dave Kleikamp 2008-05-20 12:19 ` [Jfs-discussion] " Christian Kujau @ 2008-05-20 13:06 ` David Fix 2008-05-20 14:10 ` Dave Kleikamp 1 sibling, 1 reply; 6+ messages in thread From: David Fix @ 2008-05-20 13:06 UTC (permalink / raw) To: Dave Kleikamp; +Cc: linux-kernel, JFS Discussion Hey Dave, Thanks for following up on this... The previous kernel that I was running was 2.6.18.1... From CentOS 5.1. It appears that the thread eating up that much CPU isn't a continuous happening, only when there's a fair amount of activity going on. It's hard to nail down exactly when it happens, but the next time it does, I'll definitely let you all know! I haven't been able to reboot this machine, as it's a production unit, but if I do get the chance, I'll do so. It seems to have leveled out now, with there being no high usage at all on there right now. Dave Dave Kleikamp wrote: > I'm copying this to jfs-discussion to see if anyone has seen anything > like this. > > On Thu, 2008-05-15 at 15:35 -0400, David Fix wrote: > >> Hey guys, >> >> I'm new to the list, but I've been using Linux and fooling around with >> the kernel for ages. :) >> >> I've been experiencing high CPU usage for jfsCommit on kernel 2.6.25.1 >> (haven't had a chance to go to 2.6.25.4, but I didn't see any >> JFS-specific changes between the versions yet). >> > > In fact, there haven't been a whole lot of non-cosmetic changes to jfs > at all recently. Nothing I see that suspicious. > > What was the previous kernel you were running before moving to 2.6.25.1? > > >> Here's my hardware >> config, as well: >> >> >> CPUs: 2x Intel Xeon E5420 2.5GHz Quad-core >> RAM: 8GB >> RAID Controller: 3Ware 9650SE-24M8 >> > > For anyone seeing this for the first time on jfs-discussion, Dave > followed up stating that this was an x86_64 build. > > >> I can't find a mention of what motherboard I have in here, so I'll give >> a bit of lspci: >> >> 00:00.0 Host bridge: Intel Corporation 5000P Chipset Memory Controller >> Hub (rev b1) >> 00:1d.0 USB Controller: Intel Corporation 631xESB/632xESB/3100 Chipset >> UHCI USB Controller #1 (rev 09) >> 00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev d9) >> 01:00.0 PCI bridge: Intel Corporation 6311ESB/6321ESB PCI Express >> Upstream Port (rev 01) >> 03:00.0 PCI bridge: Integrated Device Technology, Inc. Unknown device >> 8018 (rev 04) (prog-if 00 [Normal decode]) >> 05:00.0 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet >> Controller (rev 06) >> 07:00.0 Ethernet controller: Intel Corporation 80003ES2LAN Gigabit >> Ethernet Controller (Copper) (rev 01) >> 0c:00.0 RAID bus controller: 3ware Inc 9650SE SATA-II RAID (rev 01) >> 0f:0c.0 VGA compatible controller: ATI Technologies Inc ES1000 (rev 02) >> >> There's the salient points there. The NIC is a quad, which is trunked >> to the switch using LAPC. >> >> I've got a RAID-6 on the 3Ware controller of 20 TB, and when I'm running >> "top", I see this: >> >> --- >> top - 15:33:27 up 5:40, 3 users, load average: 4.33, 3.59, 3.98 >> Tasks: 315 total, 2 running, 313 sleeping, 0 stopped, 0 zombie >> Cpu(s): 0.2%us, 13.2%sy, 0.0%ni, 61.3%id, 24.9%wa, 0.1%hi, 0.2%si, >> 0.0%st >> Mem: 8194264k total, 8144500k used, 49764k free, 3884k buffers >> Swap: 16779884k total, 148k used, 16779736k free, 7667400k cached >> >> PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND >> 2669 root 15 -5 0 0 0 R 99 0.0 19:45.71 jfsCommit >> > > This is highly unusual. Is this thread continually eating cpu at this > rate, or does it happen in spurts? > > >> And people are complaining (and I'm seeing) very slow writes to the drives. >> >> Just wondering if anyone has any ideas. :) If you need any >> information, I'll provide whatever you need. >> > > Has this happened more than once (have you rebooted and still seen the > problem)? I'm not sure if some rare bug has caused some kind of linked > list corruption that puts the thread into an infinite loop, or if this > is a real regression. > > >> Thanks in advance! >> >> Dave >> > > Thanks, > Shaggy ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: Hello and a question about high cpu usage on jfsCommit (kernel 2.6.25.1) 2008-05-20 13:06 ` David Fix @ 2008-05-20 14:10 ` Dave Kleikamp 0 siblings, 0 replies; 6+ messages in thread From: Dave Kleikamp @ 2008-05-20 14:10 UTC (permalink / raw) To: David Fix; +Cc: linux-kernel, JFS Discussion, Christian Kujau On Tue, 2008-05-20 at 09:06 -0400, David Fix wrote: > Hey Dave, > > Thanks for following up on this... The previous kernel that I was > running was 2.6.18.1... From CentOS 5.1. > > It appears that the thread eating up that much CPU isn't a continuous > happening, only when there's a fair amount of activity going on. It's > hard to nail down exactly when it happens, but the next time it does, > I'll definitely let you all know! Since it's only happening during periods of high activity, it doesn't seem as serious a problem as I suspected it may be. Several threads operating on a lot of files and/or directories at one time could put a lot on the jfsCommit thread's queue, even though 99% cpu utilization still seems high. > I haven't been able to reboot this machine, as it's a production unit, > but if I do get the chance, I'll do so. Since the thread isn't stuck in some kind of infinite loop, I don't really think a reboot will make any difference. When I asked I wasn't sure if the thread was continuously munching cpu. > It seems to have leveled out > now, with there being no high usage at all on there right now. If you see it again, Christian's suggesting of running oprofile to see where the the cpu is spending most of its time might be helpful. Thanks, Shaggy -- David Kleikamp IBM Linux Technology Center ^ permalink raw reply [flat|nested] 6+ messages in thread
end of thread, other threads:[~2008-05-20 14:10 UTC | newest] Thread overview: 6+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2008-05-15 19:35 Hello and a question about high cpu usage on jfsCommit (kernel 2.6.25.1) David Fix 2008-05-15 20:15 ` David Fix 2008-05-19 15:37 ` Dave Kleikamp 2008-05-20 12:19 ` [Jfs-discussion] " Christian Kujau 2008-05-20 13:06 ` David Fix 2008-05-20 14:10 ` Dave Kleikamp
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox