All of lore.kernel.org
 help / color / mirror / Atom feed
* blktrace could cause system hang under 1G memory size and 20 minutes duration
@ 2010-08-31  6:32 Xuekun Hu
  2010-08-31 11:32 ` blktrace could cause system hang under 1G memory size and 20 Alan D. Brunelle
  2010-08-31 12:25 ` Xuekun Hu
  0 siblings, 2 replies; 3+ messages in thread
From: Xuekun Hu @ 2010-08-31  6:32 UTC (permalink / raw)
  To: linux-btrace

Hi, Guys

My Linux system (2.6.18, x86_64, RHEL5) only has 1G memory, and I
would like to use blktrace (1.0.1) to capture 20 minutes of IO traces
with medium IO traffics. After 20 minutes, system would hang. I think
at this phase, blktrace is exiting and  printing the summary section
that number of events captured on each cpu.

I'm guessing the system hang due to out of memory. (I can ping the
system, but can't login). I'm wondering is there a workaround?

Thx, Xuekun

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: blktrace could cause system hang under 1G memory size and 20
  2010-08-31  6:32 blktrace could cause system hang under 1G memory size and 20 minutes duration Xuekun Hu
@ 2010-08-31 11:32 ` Alan D. Brunelle
  2010-08-31 12:25 ` Xuekun Hu
  1 sibling, 0 replies; 3+ messages in thread
From: Alan D. Brunelle @ 2010-08-31 11:32 UTC (permalink / raw)
  To: linux-btrace

On 08/31/2010 02:32 AM, Xuekun Hu wrote:
> Hi, Guys
>
> My Linux system (2.6.18, x86_64, RHEL5) only has 1G memory, and I
> would like to use blktrace (1.0.1) to capture 20 minutes of IO traces
> with medium IO traffics. After 20 minutes, system would hang. I think
> at this phase, blktrace is exiting and  printing the summary section
> that number of events captured on each cpu.
>
> I'm guessing the system hang due to out of memory. (I can ping the
> system, but can't login). I'm wondering is there a workaround?
>
> Thx, Xuekun
> --
> To unsubscribe from this list: send the line "unsubscribe linux-btrace" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>    

Well, you'd have to prove that the system was out of memory - note that 
blktrace statically allocates it's memory at the beginning of the run 
(or at least, used to). So it's direct use of memory shouldn't expand 
over time. Indirectly - as it puts data out to disk - it could be 
consuming buffers for that operation.

One workaround (to try) would be to go into network mode and send the 
data to some other machine to process. Of course, network traffic 
consumes its own buffers as well...

Another option is to limit the traces to collect - if there are only 
certain things you are interested in, you could limit collection & 
saving to just those traces. [One thing to disable would be the I/O 
scheduler traffic if you are not interested in that - I found in the 
past that some of those (CFQ) could generate a tremendous amount of traces.]

Regards,
Alan

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: blktrace could cause system hang under 1G memory size and 20
  2010-08-31  6:32 blktrace could cause system hang under 1G memory size and 20 minutes duration Xuekun Hu
  2010-08-31 11:32 ` blktrace could cause system hang under 1G memory size and 20 Alan D. Brunelle
@ 2010-08-31 12:25 ` Xuekun Hu
  1 sibling, 0 replies; 3+ messages in thread
From: Xuekun Hu @ 2010-08-31 12:25 UTC (permalink / raw)
  To: linux-btrace

Thanks for suggestions. I will try to disable some traffics.

After the system reboot, I found "oom -killer" in syslog, I assuemd it
was caused by blktrace. :-)

On Tue, Aug 31, 2010 at 7:32 PM, Alan D. Brunelle <alan.brunelle@hp.com> wrote:
> On 08/31/2010 02:32 AM, Xuekun Hu wrote:
>>
>> Hi, Guys
>>
>> My Linux system (2.6.18, x86_64, RHEL5) only has 1G memory, and I
>> would like to use blktrace (1.0.1) to capture 20 minutes of IO traces
>> with medium IO traffics. After 20 minutes, system would hang. I think
>> at this phase, blktrace is exiting and  printing the summary section
>> that number of events captured on each cpu.
>>
>> I'm guessing the system hang due to out of memory. (I can ping the
>> system, but can't login). I'm wondering is there a workaround?
>>
>> Thx, Xuekun
>> --
>> To unsubscribe from this list: send the line "unsubscribe linux-btrace" in
>> the body of a message to majordomo@vger.kernel.org
>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>
>
> Well, you'd have to prove that the system was out of memory - note that
> blktrace statically allocates it's memory at the beginning of the run (or at
> least, used to). So it's direct use of memory shouldn't expand over time.
> Indirectly - as it puts data out to disk - it could be consuming buffers for
> that operation.
>
> One workaround (to try) would be to go into network mode and send the data
> to some other machine to process. Of course, network traffic consumes its
> own buffers as well...
>
> Another option is to limit the traces to collect - if there are only certain
> things you are interested in, you could limit collection & saving to just
> those traces. [One thing to disable would be the I/O scheduler traffic if
> you are not interested in that - I found in the past that some of those
> (CFQ) could generate a tremendous amount of traces.]
>
> Regards,
> Alan
>

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2010-08-31 12:25 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2010-08-31  6:32 blktrace could cause system hang under 1G memory size and 20 minutes duration Xuekun Hu
2010-08-31 11:32 ` blktrace could cause system hang under 1G memory size and 20 Alan D. Brunelle
2010-08-31 12:25 ` Xuekun Hu

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.