From: "Jim Schutt" <jaschut@sandia.gov>
To: Greg Farnum <greg@inktank.com>
Cc: ceph-devel@vger.kernel.org, Sage Weil <sage@inktank.com>,
Wido den Hollander <wido@42on.com>
Subject: Re: CephFS Space Accounting and Quotas
Date: Mon, 11 Mar 2013 14:40:04 -0600 [thread overview]
Message-ID: <513E4124.9040309@sandia.gov> (raw)
In-Reply-To: <90B756938EF64998B9230F4E48620CDE@inktank.com>
On 03/11/2013 10:57 AM, Greg Farnum wrote:
> On Monday, March 11, 2013 at 9:48 AM, Jim Schutt wrote:
>> On 03/11/2013 09:48 AM, Greg Farnum wrote:
>>> On Monday, March 11, 2013 at 7:47 AM, Jim Schutt wrote:
>>>>
>>>> For this run, the MDS logging slowed it down enough to cause the
>>>> client caps to occasionally go stale. I don't think it's the cause
>>>> of the issue, because I was having it before I turned MDS debugging
>>>> up. My client caps never go stale at, e.g., debug mds 5.
>>>
>>>
>>>
>>> Oh, so this might be behaviorally different than you were seeing before? Drat.
>>>
>>> You had said before that each newfstatat was taking tens of seconds,
>>> whereas in the strace log you sent along most of the individual calls
>>> were taking a bit less than 20 milliseconds. Do you have an strace of
>>> them individually taking much more than that, or were you just
>>> noticing that they took a long time in aggregate?
>>
>>
>>
>> When I did the first strace, I didn't turn on timestamps, and I was
>> watching it scroll by. I saw several stats in a row take ~30 secs,
>> at which point I got bored, and took a look at the strace man page to
>> figure out how to get timestamps ;)
>>
>> Also, another difference is for that test, I was looking at files
>> I had written the day before, whereas for the strace log I sent,
>> there was only several minutes between writing and the strace of find.
>>
>> I thought I had eliminated the page cache issue by using fdatasync
>> when writing the files. Perhaps the real issue is affected by that
>> delay?
>
> I'm not sure. I can't think of any mechanism by which waiting longer
> would increase the time lags, though, so I doubt it.
>
>>> I suppose if you were going to run it again then just the message
>>> logging could also be helpful. That way we could at least check and
>>> see the message delays and if the MDS is doing other work in the
>>> course of answering a request.
>>
>>
>>
>> I can do as many trials as needed to isolate the issue.
>>
>> What message debugging level is sufficient on the MDS; 1?
> Yep, that will capture all incoming and outgoing messages. :)
>>
>> If you want I can attempt to duplicate my memory of the first
>> test I reported, writing the files today and doing the strace
>> tomorrow (with timestamps, this time).
>>
>> Also, would it be helpful to write the files with minimal logging, in
>> hopes of inducing minimal timing changes, then upping the logging
>> for the stat phase?
>
> Well that would give us better odds of not introducing failures of
> any kind during the write phase, and then getting accurate
> information on what's happening during the stats, so it probably
> would. Basically I'd like as much logging as possible without
> changing the states they system goes through. ;)
Hmmm, this is getting more interesting...
I just did two complete trials where I built a file system,
did two sets of writes with minimal MDS logging, then
turned MDS logging up to 20 with MDS ms logging at 1 for
the stat phase.
In each trial my strace'd find finished in < 10 seconds,
and there were no slow stat calls (they were taking ~19 ms
each).
I'm going to do a third trial where I let things rest
overnight, after I write the files. That delay is the
only thing I haven't reproduced from my first trial....
-- Jim
> -Greg
>
>
>
next prev parent reply other threads:[~2013-03-11 20:41 UTC|newest]
Thread overview: 31+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <sfid-H20130305-170326-+024.05-1@marduk.tchpc.tcd.ie>
2013-03-05 17:03 ` CephFS First product release discussion Greg Farnum
2013-03-05 18:08 ` Wido den Hollander
2013-03-05 18:17 ` Greg Farnum
2013-03-05 18:28 ` Sage Weil
2013-03-05 18:36 ` Wido den Hollander
2013-03-05 18:48 ` Jim Schutt
2013-03-05 19:33 ` Sage Weil
2013-03-06 17:24 ` Wido den Hollander
2013-03-06 19:07 ` Jim Schutt
2013-03-06 19:13 ` CephFS Space Accounting and Quotas (was: CephFS First product release discussion) Greg Farnum
2013-03-06 19:58 ` CephFS Space Accounting and Quotas Jim Schutt
2013-03-06 20:21 ` Greg Farnum
2013-03-06 21:28 ` Jim Schutt
2013-03-06 21:39 ` Greg Farnum
2013-03-06 23:14 ` Jim Schutt
2013-03-07 0:18 ` Greg Farnum
2013-03-07 15:15 ` Jim Schutt
2013-03-08 22:45 ` Jim Schutt
2013-03-09 2:05 ` Greg Farnum
2013-03-11 14:47 ` Jim Schutt
2013-03-11 15:48 ` Greg Farnum
2013-03-11 16:48 ` Jim Schutt
2013-03-11 16:57 ` Greg Farnum
2013-03-11 20:40 ` Jim Schutt [this message]
2013-03-12 22:34 ` Jim Schutt
[not found] ` <513FAE0F.2010608@sandia.gov>
[not found] ` <BE627BF4B6E74BD49037D07821FC1DB9@inktank.com>
[not found] ` <5143AA84.50409@sandia.gov>
2013-03-15 23:17 ` Greg Farnum
2013-03-18 14:19 ` Jim Schutt
2013-03-06 21:42 ` Sage Weil
2013-03-06 5:01 ` [ceph-users] CephFS First product release discussion Neil Levine
[not found] ` <CANygib-U_MQi1TMmQuT_Q9MVwPfT+PzJwN=+BMcBK69WuRfu3w-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2013-03-07 13:11 ` Félix Ortega Hortigüela
[not found] ` <E0B1337A572647BA9FCC0CE8CA946F42-4GqslpFJ+cxBDgjK7y7TUQ@public.gmane.org>
2013-03-07 11:54 ` Jimmy Tang
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=513E4124.9040309@sandia.gov \
--to=jaschut@sandia.gov \
--cc=ceph-devel@vger.kernel.org \
--cc=greg@inktank.com \
--cc=sage@inktank.com \
--cc=wido@42on.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.