All of lore.kernel.org
 help / color / mirror / Atom feed
* getting kernel debug output
@ 2012-10-24 17:11 Sage Weil
  2012-10-24 18:14 ` Alex Elder
  0 siblings, 1 reply; 4+ messages in thread
From: Sage Weil @ 2012-10-24 17:11 UTC (permalink / raw)
  To: elder; +Cc: ceph-devel

I'm working on http://tracker.newdream.net/issues/3342 and was able to 
reproduce the msgr bug (some annoying msgr race I think) while generating 
full libceph debug output.  I used a teuthology yaml fragment like so:

tasks:
- clock: null
- ceph:
    log-whitelist:
    - wrongly marked me down
    - objects unfound and apparently lost
- thrashosds: null
- kclient: null
- exec:
    client.0:
      - echo 'module libceph +p' > /sys/kernel/debug/dynamic_debug/control
- workunit:
    clients:
      all:
      - suites/ffsb.sh

I was pleasantly surprised that even though this is putting copious 
amounts of crap in dmesg it didn't slow things down enough to avoid 
tripping the bug.  And the 'dmesg' command in kdb appears to be working 
now (a couple months back it wasn't).  Yay!

Anyway, this might be useful in tracking down other bugs as well...

sage

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: getting kernel debug output
  2012-10-24 17:11 getting kernel debug output Sage Weil
@ 2012-10-24 18:14 ` Alex Elder
  2012-10-24 18:25   ` David Zafman
  2012-10-24 18:32   ` Sage Weil
  0 siblings, 2 replies; 4+ messages in thread
From: Alex Elder @ 2012-10-24 18:14 UTC (permalink / raw)
  To: Sage Weil; +Cc: ceph-devel

On 10/24/2012 12:11 PM, Sage Weil wrote:
> I'm working on http://tracker.newdream.net/issues/3342 and was able to 
> reproduce the msgr bug (some annoying msgr race I think) while generating 
> full libceph debug output.  I used a teuthology yaml fragment like so:

I have more trouble than that, but perhaps there's something
weird about having my serial console connected from 1500 miles
away.  I'm impressed full debugging didn't mess things up.

> tasks:
> - clock: null
> - ceph:
>     log-whitelist:
>     - wrongly marked me down
>     - objects unfound and apparently lost
> - thrashosds: null
> - kclient: null
> - exec:
>     client.0:
>       - echo 'module libceph +p' > /sys/kernel/debug/dynamic_debug/control

This is cool, I didn't know you could do this.

> - workunit:
>     clients:
>       all:
>       - suites/ffsb.sh
> 
> I was pleasantly surprised that even though this is putting copious 
> amounts of crap in dmesg it didn't slow things down enough to avoid 
> tripping the bug.  And the 'dmesg' command in kdb appears to be working 
> now (a couple months back it wasn't).  Yay!

For me, dmesg has been working, but I'd like to know how to
truncate the output to just, say, the last 200 lines.  (Maybe
there is one.)

> Anyway, this might be useful in tracking down other bugs as well...

Yes, this is good news.

					-Alex

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: getting kernel debug output
  2012-10-24 18:14 ` Alex Elder
@ 2012-10-24 18:25   ` David Zafman
  2012-10-24 18:32   ` Sage Weil
  1 sibling, 0 replies; 4+ messages in thread
From: David Zafman @ 2012-10-24 18:25 UTC (permalink / raw)
  To: Alex Elder; +Cc: Sage Weil, ceph-devel


I also added a kcon_most teuthology task which does almost the same thing as ceph/src/script/kcon_most.sh to all or any set of clients.  The teuthology version does not raise the console log level.

For example:

tasks:
- ceph:
- kclient:
- kcon_most:
- interactive:


On Oct 24, 2012, at 11:14 AM, Alex Elder <elder@inktank.com> wrote:

> On 10/24/2012 12:11 PM, Sage Weil wrote:
>> I'm working on http://tracker.newdream.net/issues/3342 and was able to 
>> reproduce the msgr bug (some annoying msgr race I think) while generating 
>> full libceph debug output.  I used a teuthology yaml fragment like so:
> 
> I have more trouble than that, but perhaps there's something
> weird about having my serial console connected from 1500 miles
> away.  I'm impressed full debugging didn't mess things up.
> 
>> tasks:
>> - clock: null
>> - ceph:
>>    log-whitelist:
>>    - wrongly marked me down
>>    - objects unfound and apparently lost
>> - thrashosds: null
>> - kclient: null
>> - exec:
>>    client.0:
>>      - echo 'module libceph +p' > /sys/kernel/debug/dynamic_debug/control
> 
> This is cool, I didn't know you could do this.
> 
>> - workunit:
>>    clients:
>>      all:
>>      - suites/ffsb.sh
>> 
>> I was pleasantly surprised that even though this is putting copious 
>> amounts of crap in dmesg it didn't slow things down enough to avoid 
>> tripping the bug.  And the 'dmesg' command in kdb appears to be working 
>> now (a couple months back it wasn't).  Yay!
> 
> For me, dmesg has been working, but I'd like to know how to
> truncate the output to just, say, the last 200 lines.  (Maybe
> there is one.)
> 
>> Anyway, this might be useful in tracking down other bugs as well...
> 
> Yes, this is good news.
> 
> 					-Alex
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: getting kernel debug output
  2012-10-24 18:14 ` Alex Elder
  2012-10-24 18:25   ` David Zafman
@ 2012-10-24 18:32   ` Sage Weil
  1 sibling, 0 replies; 4+ messages in thread
From: Sage Weil @ 2012-10-24 18:32 UTC (permalink / raw)
  To: Alex Elder; +Cc: ceph-devel

On Wed, 24 Oct 2012, Alex Elder wrote:
> On 10/24/2012 12:11 PM, Sage Weil wrote:
> > I'm working on http://tracker.newdream.net/issues/3342 and was able to 
> > reproduce the msgr bug (some annoying msgr race I think) while generating 
> > full libceph debug output.  I used a teuthology yaml fragment like so:
> 
> I have more trouble than that, but perhaps there's something
> weird about having my serial console connected from 1500 miles
> away.  I'm impressed full debugging didn't mess things up.

The serial doesn't have to be attached until it crashes.  And in any case, 
the output doesn't go to the console because it's pr_debug() or whatever.  

> > tasks:
> > - clock: null
> > - ceph:
> >     log-whitelist:
> >     - wrongly marked me down
> >     - objects unfound and apparently lost
> > - thrashosds: null
> > - kclient: null
> > - exec:
> >     client.0:
> >       - echo 'module libceph +p' > /sys/kernel/debug/dynamic_debug/control
> 
> This is cool, I didn't know you could do this.

I just added it yesterday :)

> > - workunit:
> >     clients:
> >       all:
> >       - suites/ffsb.sh
> > 
> > I was pleasantly surprised that even though this is putting copious 
> > amounts of crap in dmesg it didn't slow things down enough to avoid 
> > tripping the bug.  And the 'dmesg' command in kdb appears to be working 
> > now (a couple months back it wasn't).  Yay!
> 
> For me, dmesg has been working, but I'd like to know how to
> truncate the output to just, say, the last 200 lines.  (Maybe
> there is one.)

Me too!  Not sure it does that..
 
> > Anyway, this might be useful in tracking down other bugs as well...
> 
> Yes, this is good news.
> 
> 					-Alex
> 
> 

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2012-10-24 18:32 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2012-10-24 17:11 getting kernel debug output Sage Weil
2012-10-24 18:14 ` Alex Elder
2012-10-24 18:25   ` David Zafman
2012-10-24 18:32   ` Sage Weil

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.