From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from mail-qk1-f182.google.com (mail-qk1-f182.google.com [209.85.222.182]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 344BF30FF1E for ; Wed, 7 Jan 2026 17:19:23 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.222.182 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1767806368; cv=none; b=QfTj1uIhx9v8bD5ERXMobsMUB0auhnMME8oP92Jz2ioKdMulbdH60TLm1ERYiIcVJkIhphcba7464lu4Ncti/atRx8ruQ+JBJ0Lz9XxhIxK7CLyeZZbzVo1XQq18Vh3fvQpsQwzJYSXrt8Xz0S+dMYxYOnCPf+BBBr3ZCBeTR8I= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1767806368; c=relaxed/simple; bh=dEOXcM0qLcGyYDdpYrpwaPuAKh0TeDT6z74M9N7a6x8=; h=From:To:Cc:Subject:Date:Message-ID:In-Reply-To:References: MIME-Version; b=DLddrF78mgtycT1MonIcOZCuWe1tWn/EDc4EdN4emgyLCWknpXNjMGk2+otbQTb4QKhZJgo0w2W2YFJMLbFWNm22ELeltXSi+kZ8546Hikn8584I8YiX4qr/GtJoYNTNzt36/uCOhVgwwqeBy7c2D7vONuFWhtVfMJ3vgOVCLgE= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=Groves.net; spf=pass smtp.mailfrom=gmail.com; dkim=pass (2048-bit key) header.d=gmail.com header.i=@gmail.com header.b=NDjfi4A/; arc=none smtp.client-ip=209.85.222.182 Authentication-Results: smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=Groves.net Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=gmail.com Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=gmail.com header.i=@gmail.com header.b="NDjfi4A/" Received: by mail-qk1-f182.google.com with SMTP id af79cd13be357-8b2a4b6876fso310944185a.3 for ; Wed, 07 Jan 2026 09:19:23 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1767806361; x=1768411161; darn=vger.kernel.org; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:sender:from:to:cc:subject:date :message-id:reply-to; bh=IsUBbcgm6GmRe1pvdN9mqC0Y9Ykr2Q9MySmIE+mClu8=; b=NDjfi4A/GzQo0RWtRAgVKFwne1uzChKPhtgDJmlo8Nd+9Vq2CEHJjp7FJNhK+G2sJg KXIAcVrBRV3o2eeUDFI089vrZKMAsCeVDSRVH5UUn0MMy4wGoT+OXN3Jma9x5ISTviZV 6PioGKlEyor/QvzE0vG+lpnZrX1RODw0Gu7mkXcK0um09e3sBuJkg2ncjyeJl9smVgXE TVQHLRglXDgjv++7+mxsKi+71NxlNTZy8nhbu3D9zIQYYqrSrqAbqzpKhHJ5nUGET8rW iSR35X8sTaZ7u8QXrbNmfvilolRJXwkTlObTs/turbaW5Y+f84aJK+d6rtPb/jYHjebz y/ag== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1767806361; x=1768411161; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:sender:x-gm-gg :x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=IsUBbcgm6GmRe1pvdN9mqC0Y9Ykr2Q9MySmIE+mClu8=; b=VP4uun5pXY2ZTPwfx2hl2CI1rscG0sPprjxlt3BZQfxRrVOjEnkF6C11r4yaJF+0MY Fr2Is4Nh1aSJWBCxolg/5lfhTGQmTRKEHB4nhzxl4rWrshnKoEFXTal0XchiUdAtjWUb wIMYyM0TySbhzwi6ISicAYadlvG45INcQ90PeW2oxkSYMWcUnIiwx/1kEkC1qGNpYKRs dfP1O1HNv+uIzjKm8ETXlJ/enua7km9s5KASFCrdF1cYK+prP+swC9aR5yc+CB5x8oBg yR+cHhwGEPBhOPfy4LWWsfIjstmtv19zZZmKzhcG0Y8uAxrAeHiPoan1MpQ/jpK0rpaD afig== X-Forwarded-Encrypted: i=1; AJvYcCVZnlzR/apAMRApz9xIcNn0ImHMp7fP3wrNyE+ln1MnS2okXSQsPrQ8C3TqR012F1OeQB+MqJsF0Tw=@vger.kernel.org X-Gm-Message-State: AOJu0Yzu0Ov5WnQkKr/uGwEWaa/aXsXETQSnW944jHWbNcdpGSEIvHSx 2gUPqMUqRJxgtpFFHY65JhnyRG/gC06iMaif7FUiEXIeYiKzW6Eylkg+B52xKw== X-Gm-Gg: AY/fxX4om87SN1sQ097dsXrrxhRfhW6yU5Q8arBBc++1IZLplP8xf3hbTzzwUdPS9nV jwcpa2QQJ69GL7n6GqkOzFyBqUGwjPf4+YSAqppU3qgaul/MqAoh3irDRftxMCvqMIlJjGv1PRa LEN+Qe0e80o1aX0Eej3JEtyKtPkGGdQ0Pb45da+ddCMLupGDntCSG5gXay1Wu/H1KXzAj7Q0hVr 5pxZdLd2uEB7VECw+oKGW2ggeN0jj13r/yoYaymOC6MXwclnSwFaIzR77H3j+c6REBZPO2ILQHC lx7I2QH+Wz0XWU4vGnUpssuwkEAKCF01eLkDGuXsragJiEDduYVzgoYcI+nMf1K0nok3c6EPlNY MkksbL9G4K2xvQWKZS/fdD+iXDbQJQ0sOemvtGJdj3q0HlAgR21Rg9REXNBxBri8IrBTKsIpvcJ 8SPlGE+4waYGQ0wV/shBKZR/HGHFulmzqgxuZWI8EeTJU3WO8kZoQJ16E= X-Google-Smtp-Source: AGHT+IEAvq5jIS7yNdaICnZjGGLTm+9F3WfQSWgCD5SfovqjmCa/xKQRtzE2sJgjYbpKAc4uVl4aqw== X-Received: by 2002:a05:6808:150f:b0:45a:5894:4979 with SMTP id 5614622812f47-45a6bdbcf78mr1522078b6e.20.1767800073716; Wed, 07 Jan 2026 07:34:33 -0800 (PST) Received: from localhost.localdomain ([2603:8080:1500:3d89:a917:5124:7300:7cef]) by smtp.gmail.com with ESMTPSA id 5614622812f47-45a5e2f1de5sm2398106b6e.22.2026.01.07.07.34.31 (version=TLS1_3 cipher=TLS_CHACHA20_POLY1305_SHA256 bits=256/256); Wed, 07 Jan 2026 07:34:33 -0800 (PST) Sender: John Groves From: John Groves X-Google-Original-From: John Groves To: John Groves , Miklos Szeredi , Dan Williams , Bernd Schubert , Alison Schofield Cc: John Groves , Jonathan Corbet , Vishal Verma , Dave Jiang , Matthew Wilcox , Jan Kara , Alexander Viro , David Hildenbrand , Christian Brauner , "Darrick J . Wong" , Randy Dunlap , Jeff Layton , Amir Goldstein , Jonathan Cameron , Stefan Hajnoczi , Joanne Koong , Josef Bacik , Bagas Sanjaya , Chen Linxuan , James Morse , Fuad Tabba , Sean Christopherson , Shivank Garg , Ackerley Tng , Gregory Price , Aravind Ramesh , Ajay Joshi , venkataravis@micron.com, linux-doc@vger.kernel.org, linux-kernel@vger.kernel.org, nvdimm@lists.linux.dev, linux-cxl@vger.kernel.org, linux-fsdevel@vger.kernel.org, John Groves Subject: [PATCH V3 18/21] famfs_fuse: Add holder_operations for dax notify_failure() Date: Wed, 7 Jan 2026 09:33:27 -0600 Message-ID: <20260107153332.64727-19-john@groves.net> X-Mailer: git-send-email 2.50.1 In-Reply-To: <20260107153332.64727-1-john@groves.net> References: <20260107153244.64703-1-john@groves.net> <20260107153332.64727-1-john@groves.net> Precedence: bulk X-Mailing-List: linux-cxl@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Memory errors are at least somewhat more likely on disaggregated memory than on-board memory. This commit registers to be notified by fsdev_dax in the event that a memory failure is detected. When a file access resolves to a daxdev with memory errors, it will fail with an appropriate error. If a daxdev failed fs_dax_get(), we set dd->dax_err. If a daxdev called our notify_failure(), set dd->error. When any of the above happens, set (file)->error and stop allowing access. In general, the recovery from memory errors is to unmount the file system and re-initialize the memory, but there may be usable degraded modes of operation - particularly in the future when famfs supports file systems backed by more than one daxdev. In those cases, accessing data that is on a working daxdev can still work. For now, return errors for any file that has encountered a memory or dax error. Signed-off-by: John Groves --- fs/fuse/famfs.c | 115 +++++++++++++++++++++++++++++++++++++++--- fs/fuse/famfs_kfmap.h | 3 +- 2 files changed, 109 insertions(+), 9 deletions(-) diff --git a/fs/fuse/famfs.c b/fs/fuse/famfs.c index c02b14789c6e..4eb87c5c628e 100644 --- a/fs/fuse/famfs.c +++ b/fs/fuse/famfs.c @@ -20,6 +20,26 @@ #include "famfs_kfmap.h" #include "fuse_i.h" +static void famfs_set_daxdev_err( + struct fuse_conn *fc, struct dax_device *dax_devp); + +static int +famfs_dax_notify_failure(struct dax_device *dax_devp, u64 offset, + u64 len, int mf_flags) +{ + struct fuse_conn *fc = dax_holder(dax_devp); + + famfs_set_daxdev_err(fc, dax_devp); + + return 0; +} + +static const struct dax_holder_operations famfs_fuse_dax_holder_ops = { + .notify_failure = famfs_dax_notify_failure, +}; + +/*****************************************************************************/ + /* * famfs_teardown() * @@ -48,9 +68,12 @@ famfs_teardown(struct fuse_conn *fc) if (!dd->valid) continue; - /* Release reference from dax_dev_get() */ - if (dd->devp) + /* Only call fs_put_dax if fs_dax_get succeeded */ + if (dd->devp) { + if (!dd->dax_err) + fs_put_dax(dd->devp, fc); put_dax(dd->devp); + } kfree(dd->name); } @@ -174,6 +197,17 @@ famfs_fuse_get_daxdev(struct fuse_mount *fm, const u64 index) goto out; } + err = fs_dax_get(daxdev->devp, fc, &famfs_fuse_dax_holder_ops); + if (err) { + /* If fs_dax_get() fails, we don't attempt recovery; + * We mark the daxdev valid with dax_err + */ + daxdev->dax_err = 1; + pr_err("%s: fs_dax_get(%lld) failed\n", + __func__, (u64)daxdev->devno); + err = -EBUSY; + } + daxdev->name = kstrdup(daxdev_out.name, GFP_KERNEL); wmb(); /* all daxdev fields must be visible before marking it valid */ daxdev->valid = 1; @@ -254,6 +288,38 @@ famfs_update_daxdev_table( return 0; } +static void +famfs_set_daxdev_err( + struct fuse_conn *fc, + struct dax_device *dax_devp) +{ + int i; + + /* Gotta search the list by dax_devp; + * read lock because we're not adding or removing daxdev entries + */ + down_read(&fc->famfs_devlist_sem); + for (i = 0; i < fc->dax_devlist->nslots; i++) { + if (fc->dax_devlist->devlist[i].valid) { + struct famfs_daxdev *dd = &fc->dax_devlist->devlist[i]; + + if (dd->devp != dax_devp) + continue; + + dd->error = true; + up_read(&fc->famfs_devlist_sem); + + pr_err("%s: memory error on daxdev %s (%d)\n", + __func__, dd->name, i); + goto done; + } + } + up_read(&fc->famfs_devlist_sem); + pr_err("%s: memory err on unrecognized daxdev\n", __func__); + +done: +} + /***************************************************************************/ void @@ -611,6 +677,26 @@ famfs_file_init_dax( static ssize_t famfs_file_bad(struct inode *inode); +static int famfs_dax_err(struct famfs_daxdev *dd) +{ + if (!dd->valid) { + pr_err("%s: daxdev=%s invalid\n", + __func__, dd->name); + return -EIO; + } + if (dd->dax_err) { + pr_err("%s: daxdev=%s dax_err\n", + __func__, dd->name); + return -EIO; + } + if (dd->error) { + pr_err("%s: daxdev=%s memory error\n", + __func__, dd->name); + return -EHWPOISON; + } + return 0; +} + static int famfs_interleave_fileofs_to_daxofs(struct inode *inode, struct iomap *iomap, loff_t file_offset, off_t len, unsigned int flags) @@ -648,6 +734,7 @@ famfs_interleave_fileofs_to_daxofs(struct inode *inode, struct iomap *iomap, /* Is the data is in this striped extent? */ if (local_offset < ext_size) { + struct famfs_daxdev *dd; u64 chunk_num = local_offset / chunk_size; u64 chunk_offset = local_offset % chunk_size; u64 stripe_num = chunk_num / nstrips; @@ -656,6 +743,7 @@ famfs_interleave_fileofs_to_daxofs(struct inode *inode, struct iomap *iomap, u64 strip_offset = chunk_offset + (stripe_num * chunk_size); u64 strip_dax_ofs = fei->ie_strips[strip_num].ext_offset; u64 strip_devidx = fei->ie_strips[strip_num].dev_index; + int rc; if (strip_devidx >= fc->dax_devlist->nslots) { pr_err("%s: strip_devidx %llu >= nslots %d\n", @@ -670,6 +758,15 @@ famfs_interleave_fileofs_to_daxofs(struct inode *inode, struct iomap *iomap, goto err_out; } + dd = &fc->dax_devlist->devlist[strip_devidx]; + + rc = famfs_dax_err(dd); + if (rc) { + /* Shut down access to this file */ + meta->error = true; + return rc; + } + iomap->addr = strip_dax_ofs + strip_offset; iomap->offset = file_offset; iomap->length = min_t(loff_t, len, chunk_remainder); @@ -767,6 +864,7 @@ famfs_fileofs_to_daxofs(struct inode *inode, struct iomap *iomap, if (local_offset < dax_ext_len) { loff_t ext_len_remainder = dax_ext_len - local_offset; struct famfs_daxdev *dd; + int rc; if (daxdev_idx >= fc->dax_devlist->nslots) { pr_err("%s: daxdev_idx %llu >= nslots %d\n", @@ -777,11 +875,11 @@ famfs_fileofs_to_daxofs(struct inode *inode, struct iomap *iomap, dd = &fc->dax_devlist->devlist[daxdev_idx]; - if (!dd->valid || dd->error) { - pr_err("%s: daxdev=%lld %s\n", __func__, - daxdev_idx, - dd->valid ? "error" : "invalid"); - goto err_out; + rc = famfs_dax_err(dd); + if (rc) { + /* Shut down access to this file */ + meta->error = true; + return rc; } /* @@ -966,7 +1064,8 @@ famfs_file_bad(struct inode *inode) return -EIO; } if (meta->error) { - pr_debug("%s: previously detected metadata errors\n", __func__); + pr_debug("%s: previously detected metadata errors\n", + __func__); return -EIO; } if (i_size != meta->file_size) { diff --git a/fs/fuse/famfs_kfmap.h b/fs/fuse/famfs_kfmap.h index e76b9057a1e0..6a6420bdff48 100644 --- a/fs/fuse/famfs_kfmap.h +++ b/fs/fuse/famfs_kfmap.h @@ -73,7 +73,8 @@ struct famfs_file_meta { struct famfs_daxdev { /* Include dev uuid? */ bool valid; - bool error; + bool error; /* Dax has reported a memory error (probably poison) */ + bool dax_err; /* fs_dax_get() failed */ dev_t devno; struct dax_device *devp; char *name; -- 2.49.0