From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-9.8 required=3.0 tests=DKIMWL_WL_HIGH,DKIM_SIGNED, DKIM_VALID,HEADER_FROM_DIFFERENT_DOMAINS,INCLUDES_PATCH,MAILING_LIST_MULTI, SIGNED_OFF_BY,SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED,USER_AGENT_GIT autolearn=unavailable autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 37E47C35642 for ; Fri, 21 Feb 2020 07:59:50 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id 0212520801 for ; Fri, 21 Feb 2020 07:59:50 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1582271990; bh=4jo9KR+p8Pvoztd6WQ9nIREIO+yassnhD6D/AMCTV7Q=; h=From:To:Cc:Subject:Date:In-Reply-To:References:List-ID:From; b=2DScRq/EBCkHV67/0Ykn/UPDvKbFOmOLUcF3ENijxIzYxLqgMXjbLQ/J/Ylg4jf+P tSedEYB1VI8gU2AwFyZ3tLyLU9nbh3iS9u+5wDBLgl9UiQSQLX4Y72rGCF+MHXvjMB xQwBkcJNyGtEA7aXpZHvIKLMpnmw2zOr8UGHPR40= Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730776AbgBUH7o (ORCPT ); Fri, 21 Feb 2020 02:59:44 -0500 Received: from mail.kernel.org ([198.145.29.99]:59874 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1730955AbgBUH7o (ORCPT ); Fri, 21 Feb 2020 02:59:44 -0500 Received: from localhost (83-86-89-107.cable.dynamic.v4.ziggo.nl [83.86.89.107]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 5D6F4222C4; Fri, 21 Feb 2020 07:59:43 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1582271983; bh=4jo9KR+p8Pvoztd6WQ9nIREIO+yassnhD6D/AMCTV7Q=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=12GUs59v3xp3TWjI1oYUTeYZYf3BrqMmRR60i4EEAvaexIYIImQxfYCrhFebk8Yej RIPgC9sOL+LH6ZsXwyoR6MSTEFUT/BBi5AssxjlbxpsONmf7hVH+LUYLT1zq64SY3b sowCRyQs/JVTrEd5iYJ+yJBJdZ5Xy2cOohgiFLnA= From: Greg Kroah-Hartman To: linux-kernel@vger.kernel.org Cc: Greg Kroah-Hartman , stable@vger.kernel.org, Xiubo Li , Jeff Layton , Ilya Dryomov , Sasha Levin Subject: [PATCH 5.5 355/399] ceph: check availability of mds cluster on mount after wait timeout Date: Fri, 21 Feb 2020 08:41:20 +0100 Message-Id: <20200221072435.393145622@linuxfoundation.org> X-Mailer: git-send-email 2.25.1 In-Reply-To: <20200221072402.315346745@linuxfoundation.org> References: <20200221072402.315346745@linuxfoundation.org> User-Agent: quilt/0.66 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Sender: stable-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: stable@vger.kernel.org From: Xiubo Li [ Upstream commit 97820058fb2831a4b203981fa2566ceaaa396103 ] If all the MDS daemons are down for some reason, then the first mount attempt will fail with EIO after the mount request times out. A mount attempt will also fail with EIO if all of the MDS's are laggy. This patch changes the code to return -EHOSTUNREACH in these situations and adds a pr_info error message to help the admin determine the cause. URL: https://tracker.ceph.com/issues/4386 Signed-off-by: Xiubo Li Reviewed-by: Jeff Layton Signed-off-by: Ilya Dryomov Signed-off-by: Sasha Levin --- fs/ceph/mds_client.c | 3 +-- fs/ceph/super.c | 5 +++++ 2 files changed, 6 insertions(+), 2 deletions(-) diff --git a/fs/ceph/mds_client.c b/fs/ceph/mds_client.c index 145d46ba25ae2..816d49aed96bc 100644 --- a/fs/ceph/mds_client.c +++ b/fs/ceph/mds_client.c @@ -2558,8 +2558,7 @@ static void __do_request(struct ceph_mds_client *mdsc, if (!(mdsc->fsc->mount_options->flags & CEPH_MOUNT_OPT_MOUNTWAIT) && !ceph_mdsmap_is_cluster_available(mdsc->mdsmap)) { - err = -ENOENT; - pr_info("probably no mds server is up\n"); + err = -EHOSTUNREACH; goto finish; } } diff --git a/fs/ceph/super.c b/fs/ceph/super.c index 9b5536451528b..5a708ac9a54c3 100644 --- a/fs/ceph/super.c +++ b/fs/ceph/super.c @@ -1066,6 +1066,11 @@ static int ceph_get_tree(struct fs_context *fc) return 0; out_splat: + if (!ceph_mdsmap_is_cluster_available(fsc->mdsc->mdsmap)) { + pr_info("No mds server is up or the cluster is laggy\n"); + err = -EHOSTUNREACH; + } + ceph_mdsc_close_sessions(fsc->mdsc); deactivate_locked_super(sb); goto out_final; -- 2.20.1