Linux NFS development
 help / color / mirror / Atom feed
* [PATCH RFC 0/9] nfs/sunrpc: stop holding netns references in client-side NFS and RPC objects
@ 2025-03-17 20:59 Jeff Layton
  2025-03-17 20:59 ` [PATCH RFC 1/9] sunrpc: transplant shutdown_client() to sunrpc module Jeff Layton
                   ` (9 more replies)
  0 siblings, 10 replies; 18+ messages in thread
From: Jeff Layton @ 2025-03-17 20:59 UTC (permalink / raw)
  To: Trond Myklebust, Anna Schumaker, Chuck Lever, Neil Brown,
	Olga Kornievskaia, Dai Ngo, Tom Talpey, David S. Miller,
	Eric Dumazet, Jakub Kicinski, Paolo Abeni, Simon Horman
  Cc: Josef Bacik, Benjamin Coddington, linux-nfs, linux-kernel, netdev,
	Jeff Layton

We have a long-standing problem with containers that have NFS mounts in
them. Best practice is to unmount gracefully, of course, but sometimes
containers just spontaneously die (e.g. SIGSEGV in the init task in the
container). When that happens the orchestrator will see that all of the
tasks are dead, and will detach the mount namespace and kill off the
network connection.

If there are RPCs in flight at the time, the rpc_clnt will try to
retransmit them indefinitely, but there is no hope of them ever
contacting the server since nothing in userland can reach the netns
at that point to fix anything.

This patchset takes the approach of changing various nfs client and
sunrpc objects to not hold a netns reference. Instead, when a nfs_net or
sunrpc_net is exiting, all nfs_server, nfs_client and rpc_clnt objects
associated with it are shut down, and the pre_exit functions block
until they are gone.

With this approach, when the last userland task in the container exits,
the NFS and RPC clients get cleaned up automatically. As a bonus, this
fixes another bug with the gssproxy RPC client that causes net namespace
leaks in any container where it runs (details in the patch
descriptions).

Signed-off-by: Jeff Layton <jlayton@kernel.org>
---
Jeff Layton (9):
      sunrpc: transplant shutdown_client() to sunrpc module
      lockd: add a helper to shut down rpc_clnt in nlm_host
      lockd: don't #include debug.h from lockd.h
      nfs: transplant nfs_server shutdown into a helper function
      nfs: don't hold a reference to struct net in struct nfs_client
      auth_gss: shut down gssproxy rpc_clnt in net pre_exit
      auth_gss: don't hold a net reference in gss_auth
      sunrpc: don't hold a struct net reference in rpc_xprt
      sunrpc: don't upgrade passive net reference in xs_create_sock

 fs/lockd/clnt4xdr.c                |  1 +
 fs/lockd/clntlock.c                |  1 +
 fs/lockd/clntproc.c                |  1 +
 fs/lockd/clntxdr.c                 |  1 +
 fs/lockd/host.c                    |  8 ++++++++
 fs/lockd/mon.c                     |  1 +
 fs/lockd/svc.c                     |  1 +
 fs/lockd/svc4proc.c                |  1 +
 fs/lockd/svclock.c                 |  1 +
 fs/lockd/svcproc.c                 |  1 +
 fs/lockd/svcsubs.c                 |  1 +
 fs/nfs/client.c                    |  6 ++++--
 fs/nfs/inode.c                     | 28 ++++++++++++++++++++++++++++
 fs/nfs/internal.h                  |  1 +
 fs/nfs/super.c                     | 18 ++++++++++++++++++
 fs/nfs/sysfs.c                     | 27 ++-------------------------
 include/linux/lockd/lockd.h        |  2 +-
 include/linux/sunrpc/sched.h       |  1 +
 include/linux/sunrpc/svcauth_gss.h |  1 +
 include/linux/sunrpc/xprt.h        |  1 -
 net/sunrpc/auth_gss/auth_gss.c     | 15 ++++++++-------
 net/sunrpc/auth_gss/svcauth_gss.c  |  7 ++++++-
 net/sunrpc/clnt.c                  | 14 ++++++++++++++
 net/sunrpc/sunrpc_syms.c           | 29 +++++++++++++++++++++++++++++
 net/sunrpc/xprt.c                  |  3 +--
 net/sunrpc/xprtsock.c              |  3 ---
 26 files changed, 132 insertions(+), 42 deletions(-)
---
base-commit: 80e54e84911a923c40d7bee33a34c1b4be148d7a
change-id: 20250317-rpc-shutdown-1519aacd1db3

Best regards,
-- 
Jeff Layton <jlayton@kernel.org>


^ permalink raw reply	[flat|nested] 18+ messages in thread

end of thread, other threads:[~2025-03-18 11:30 UTC | newest]

Thread overview: 18+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2025-03-17 20:59 [PATCH RFC 0/9] nfs/sunrpc: stop holding netns references in client-side NFS and RPC objects Jeff Layton
2025-03-17 20:59 ` [PATCH RFC 1/9] sunrpc: transplant shutdown_client() to sunrpc module Jeff Layton
2025-03-17 20:59 ` [PATCH RFC 2/9] lockd: add a helper to shut down rpc_clnt in nlm_host Jeff Layton
2025-03-17 20:59 ` [PATCH RFC 3/9] lockd: don't #include debug.h from lockd.h Jeff Layton
2025-03-17 20:59 ` [PATCH RFC 4/9] nfs: transplant nfs_server shutdown into a helper function Jeff Layton
2025-03-17 20:59 ` [PATCH RFC 5/9] nfs: don't hold a reference to struct net in struct nfs_client Jeff Layton
2025-03-17 20:59 ` [PATCH RFC 6/9] auth_gss: shut down gssproxy rpc_clnt in net pre_exit Jeff Layton
2025-03-17 20:59 ` [PATCH RFC 7/9] auth_gss: don't hold a net reference in gss_auth Jeff Layton
2025-03-17 21:00 ` [PATCH RFC 8/9] sunrpc: don't hold a struct net reference in rpc_xprt Jeff Layton
2025-03-17 21:00 ` [PATCH RFC 9/9] sunrpc: don't upgrade passive net reference in xs_create_sock Jeff Layton
2025-03-17 21:28   ` Trond Myklebust
2025-03-17 21:36     ` Jeff Layton
2025-03-17 21:37       ` Trond Myklebust
2025-03-17 21:41         ` Jeff Layton
2025-03-17 21:35 ` [PATCH RFC 0/9] nfs/sunrpc: stop holding netns references in client-side NFS and RPC objects Trond Myklebust
2025-03-17 21:57   ` Jeff Layton
2025-03-17 22:11     ` Trond Myklebust
2025-03-18 11:30       ` Jeff Layton

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox