public inbox for virtualization@lists.linux-foundation.org
 help / color / mirror / Atom feed
From: Petr Mladek via Virtualization <virtualization@lists.linux-foundation.org>
To: "Seth Forshee (DigitalOcean)" <sforshee@digitalocean.com>
Cc: Joe Lawrence <joe.lawrence@redhat.com>,
	kvm@vger.kernel.org, "Michael S. Tsirkin" <mst@redhat.com>,
	netdev@vger.kernel.org, Jiri Kosina <jikos@kernel.org>,
	linux-kernel@vger.kernel.org,
	virtualization@lists.linux-foundation.org,
	live-patching@vger.kernel.org, Miroslav Benes <mbenes@suse.cz>,
	Josh Poimboeuf <jpoimboe@kernel.org>
Subject: Re: [PATCH 0/2] vhost: improve livepatch switching for heavily loaded vhost worker kthreads
Date: Fri, 27 Jan 2023 12:19:03 +0100	[thread overview]
Message-ID: <Y9OzJzHIASUeIrzO@alley> (raw)
In-Reply-To: <Y9LswwnPAf+nOVFG@do-x1extreme>

On Thu 2023-01-26 15:12:35, Seth Forshee (DigitalOcean) wrote:
> On Thu, Jan 26, 2023 at 06:03:16PM +0100, Petr Mladek wrote:
> > On Fri 2023-01-20 16:12:20, Seth Forshee (DigitalOcean) wrote:
> > > We've fairly regularaly seen liveptches which cannot transition within kpatch's
> > > timeout period due to busy vhost worker kthreads.
> > 
> > I have missed this detail. Miroslav told me that we have solved
> > something similar some time ago, see
> > https://lore.kernel.org/all/20220507174628.2086373-1-song@kernel.org/
> 
> Interesting thread. I had thought about something along the lines of the
> original patch, but there are some ideas in there that I hadn't
> considered.

Could you please provide some more details about the test system?
Is there anything important to make it reproducible?

The following aspects come to my mind. It might require:

   + more workers running on the same system
   + have a dedicated CPU for the worker
   + livepatching the function called by work->fn()
   + running the same work again and again
   + huge and overloaded system


> > Honestly, kpatch's timeout 1 minute looks incredible low to me. Note
> > that the transition is tried only once per minute. It means that there
> > are "only" 60 attempts.
> > 
> > Just by chance, does it help you to increase the timeout, please?
> 
> To be honest my test setup reproduces the problem well enough to make
> KLP wait significant time due to vhost threads, but it seldom causes it
> to hit kpatch's timeout.
> 
> Our system management software will try to load a patch tens of times in
> a day, and we've seen real-world cases where patches couldn't load
> within kpatch's timeout for multiple days. But I don't have such an
> environment readily accessible for my own testing. I can try to refine
> my test case and see if I can get it to that point.

My understanding is that you try to load the patch repeatedly but
it always fails after the 1 minute timeout. It means that it always
starts from the beginning (no livepatched process).

Is there any chance to try it with a longer timeout, for example, one
hour? It should increase the chance if there are more problematic kthreads.

> > This low timeout might be useful for testing. But in practice, it does
> > not matter when the transition is lasting one hour or even longer.
> > It takes much longer time to prepare the livepatch.
> 
> Agreed. And to be clear, we cope with the fact that patches may take
> hours or even days to get applied in some cases. The patches I sent are
> just about improving the only case I've identified which has lead to
> kpatch failing to load a patch for a day or longer.

If it is acceptable to wait hours or even days then the 1 minute
timeout is quite contra-productive. We actually do not use any timeout
at all in livepatches provided by SUSE.

Best Regards,
Petr
_______________________________________________
Virtualization mailing list
Virtualization@lists.linux-foundation.org
https://lists.linuxfoundation.org/mailman/listinfo/virtualization

  parent reply	other threads:[~2023-01-27 11:19 UTC|newest]

Thread overview: 16+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <20230120-vhost-klp-switching-v1-0-7c2b65519c43@kernel.org>
2023-01-22  8:34 ` [PATCH 0/2] vhost: improve livepatch switching for heavily loaded vhost worker kthreads Michael S. Tsirkin
     [not found] ` <20230120-vhost-klp-switching-v1-2-7c2b65519c43@kernel.org>
2023-01-24 14:17   ` [PATCH 2/2] vhost: check for pending livepatches from " Petr Mladek via Virtualization
     [not found]     ` <Y9ATo5FukOhphwqT@do-x1extreme>
2023-01-25 11:34       ` Petr Mladek via Virtualization
     [not found]         ` <Y9FfenH/p3qzRlar@do-x1extreme>
2023-01-26 11:16           ` Petr Mladek via Virtualization
2023-01-26 11:49             ` Petr Mladek via Virtualization
2023-01-26 17:03 ` [PATCH 0/2] vhost: improve livepatch switching for heavily loaded " Petr Mladek via Virtualization
     [not found]   ` <Y9LswwnPAf+nOVFG@do-x1extreme>
     [not found]     ` <20230127044355.frggdswx424kd5dq@treble>
2023-01-27 10:37       ` Peter Zijlstra
2023-01-27 12:09         ` Petr Mladek via Virtualization
     [not found]         ` <20230127165236.rjcp6jm6csdta6z3@treble>
     [not found]           ` <20230127170946.zey6xbr4sm4kvh3x@treble>
     [not found]             ` <20230127221131.sdneyrlxxhc4h3fa@treble>
2023-01-30 12:40               ` Peter Zijlstra
2023-01-30 18:36                 ` Mark Rutland
     [not found]                   ` <20230130194823.6y3rc227bvsgele4@treble>
2023-01-31 10:22                     ` Mark Rutland
     [not found]                       ` <20230131163832.z46ihurbmjcwuvck@treble>
2023-02-01 11:10                         ` Mark Rutland
     [not found]                           ` <20230201165727.lnywx6zyefbqbrke@treble>
2023-02-01 17:11                             ` Mark Rutland
     [not found]                 ` <20230130195930.s5iu76e56j4q5bra@treble>
2023-01-31 10:02                   ` Peter Zijlstra
2023-01-27 11:19     ` Petr Mladek via Virtualization [this message]
     [not found]       ` <Y9PmZFBEwUBwV3s/@do-x1extreme>
2023-01-30  9:55         ` Petr Mladek via Virtualization

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=Y9OzJzHIASUeIrzO@alley \
    --to=virtualization@lists.linux-foundation.org \
    --cc=jikos@kernel.org \
    --cc=joe.lawrence@redhat.com \
    --cc=jpoimboe@kernel.org \
    --cc=kvm@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=live-patching@vger.kernel.org \
    --cc=mbenes@suse.cz \
    --cc=mst@redhat.com \
    --cc=netdev@vger.kernel.org \
    --cc=pmladek@suse.com \
    --cc=sforshee@digitalocean.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox