Linux NFS development
 help / color / mirror / Atom feed
* [PATCH 0/1] pNFS/flexfiles: mark device unavailable on fatal connection error
@ 2025-06-09 21:43 Tigran Mkrtchyan
  2025-06-09 21:43 ` [PATCH 1/1] " Tigran Mkrtchyan
  0 siblings, 1 reply; 5+ messages in thread
From: Tigran Mkrtchyan @ 2025-06-09 21:43 UTC (permalink / raw)
  To: linux-nfs; +Cc: Trond Myklebust, Anna Schumaker, Tigran Mkrtchyan

As mentioned in the thread 

https://lore.kernel.org/linux-nfs/601285843.50695650.1748800817824.JavaMail.zimbra@desy.de/T/#u


We observe that interrupted batch processing jobs put the client into an unrecoverable state that requires
the client host reboot. Finally, I was able to build a custom kernel with all required third-party drivers to prove
my assumption. So indeed, marking pNFS device unavailable fixes the issue. Thus, please consider the proposed
change and backport it to older kernels. I did testing with (which is not part of the patch) and will try to
add a trace point as soon as I find out how to implement one.

Tigran Mkrtchyan (1):
  pNFS/flexfiles: mark device unavailable on fatal connection error

 fs/nfs/flexfilelayout/flexfilelayoutdev.c | 4 ++++
 1 file changed, 4 insertions(+)

-- 
2.49.0


^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2025-06-26  9:17 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2025-06-09 21:43 [PATCH 0/1] pNFS/flexfiles: mark device unavailable on fatal connection error Tigran Mkrtchyan
2025-06-09 21:43 ` [PATCH 1/1] " Tigran Mkrtchyan
2025-06-25 19:19   ` Mkrtchyan, Tigran
2025-06-25 19:39     ` Trond Myklebust
2025-06-26  9:17       ` Mkrtchyan, Tigran

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox