From: Nic Henke <nic@cray.com>
To: lustre-devel@lists.lustre.org
Subject: [Lustre-devel] replacing Lustre pings with LNet Peer Health
Date: Tue, 17 May 2011 09:27:43 -0500 [thread overview]
Message-ID: <4DD285DF.2000700@cray.com> (raw)
In-Reply-To: <9BC94E70-4EB6-49D9-8AA1-B07E1455E51D@whamcloud.com>
On 05/12/2011 12:27 PM, Andreas Dilger wrote:
> On May 12, 2011, at 08:57, Nic Henke wrote:
>> Just floating an idea... I'd much appreciate any feedback
>>
> One issue is that the Lustre OBD_PING RPC is not just detecting peer
> death. It is also reporting the last_committed value to the RPC
> stack, so that clients can discard RPCs that were committed on the
> server. It is also signalling to the server that this client is
> still alive, so that it doesn't get evicted. If there are LNET
> routers in a system, the LNET peer health will only report the health
> of the routers, and not of the clients or servers behind the routers,
> so this isn't going to result in a working Lustre filesystem...
>
Good point, I had missed this. Pesky "working" filesystems...
>> Eric - I know this doesn't get us that far down the road toward
>> your new health network, but does solve a near term issue with
>> pinger rates on large systems.
>
> There would need to be at least some of the health network
> implemented in order to "pass through" the peer health on the
> routers, and also to broadcast some of the data, like last_rcvd.
Yeah, not sure how I thinko'd the LNet Router case. We'd need to add
.lnd_notify into the LNDs and have them broadcast the failures at the
router level. Not exactly ideal, and I think the use of lnd_notify has
been dropped in favor of the newer LNet Peer Health.
Cheers,
Nic
next prev parent reply other threads:[~2011-05-17 14:27 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-05-12 14:57 [Lustre-devel] replacing Lustre pings with LNet Peer Health Nic Henke
2011-05-12 17:27 ` Andreas Dilger
2011-05-17 14:27 ` Nic Henke [this message]
2011-05-12 17:37 ` Christopher J. Morrone
2011-05-15 7:44 ` Alexey Lyashkov
2011-05-17 14:30 ` Nic Henke
2011-05-17 22:53 ` Isaac Huang
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4DD285DF.2000700@cray.com \
--to=nic@cray.com \
--cc=lustre-devel@lists.lustre.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.