All of lore.kernel.org
 help / color / mirror / Atom feed
From: Sasha Levin <sashal@kernel.org>
To: "Manthey, Norbert" <nmanthey@amazon.de>
Cc: "stable@vger.kernel.org" <stable@vger.kernel.org>
Subject: Re: [PATCH 6.1.y 0/1] Backporting patches with git-llm-pick
Date: Tue, 2 Sep 2025 13:11:39 -0400	[thread overview]
Message-ID: <aLclSwgwIhaMjE88@laps> (raw)
In-Reply-To: <f0f5fd8da13d000355166d9eb87e24ddc1b8fa70.camel@amazon.de>

On Tue, Sep 02, 2025 at 04:22:40PM +0000, Manthey, Norbert wrote:
>On Tue, 2025-09-02 at 07:48 -0400, Sasha Levin wrote:
>> One note about the tool: in my experience, unless the tool can also act as an
>> agent and investigate the relevant git repo (and attempt builds and run tests)
>> on it's own, the results used to be very lackluster.
>
>I agree in general. On the other hand, we want to keep the amount of work done by
>the LLM or agent small. For now, we only submit a bit of context and the commit
>messages. The validation is executed by the application independently of the
>agent. There is no feedback loop yet, or similar -- that could all be done in the
>agent-stage. We have a few more filters and limits to only process commits that
>are likely to be finished successfully by an LLM.

Consider a simple backport example: let's say that upstream we see a patch that
does something like:

   mutex_lock(&m);
- old_func();
+ new_func();
   mutex_unlock(&m);

But when we look at an older tree, we see:

   spin_lock(&l);
   old_func();
   spin_unlock(&l);

If you don't pass massive amounts of context in, there's no way for an LLM to
know if it's safe to simply replace old_func() with new_func() in the old code.
Most LLMs I played with will just go ahead and do that.

A human backporter (and most likely, an AI agent) would have a lightbulb moment
where they go look at new_func() to see if it's safe to be called under a
spinlock.

I guess that my point is that at this level for this usecase, LLMs don't end up
being much better than using something like wiggle[1].


[1] https://github.com/neilbrown/wiggle

-- 
Thanks,
Sasha

  reply	other threads:[~2025-09-02 17:11 UTC|newest]

Thread overview: 15+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2025-01-11 16:19 FAILED: patch "[PATCH] fs: relax assertions on failure to encode file handles" failed to apply to 6.1-stable tree gregkh
2025-09-01 15:35 ` [PATCH 6.1.y 0/1] Backporting patches with git-llm-pick Norbert Manthey
2025-09-01 15:35   ` [PATCH 6.1.y 1/1] fs: relax assertions on failure to encode file handles Norbert Manthey
2025-09-01 15:51     ` Amir Goldstein
2025-09-01 19:54     ` Greg Kroah-Hartman
2025-09-01 20:00       ` Greg Kroah-Hartman
2025-09-02  7:20         ` Manthey, Norbert
2025-09-02  7:29           ` Amir Goldstein
2025-09-02  8:58             ` gregkh
2025-09-02  9:02           ` gregkh
2025-09-02 11:48   ` [PATCH 6.1.y 0/1] Backporting patches with git-llm-pick Sasha Levin
2025-09-02 16:22     ` Manthey, Norbert
2025-09-02 17:11       ` Sasha Levin [this message]
2025-09-04  9:21 ` [PATCH 6.1.y v2] fs: relax assertions on failure to encode file handles Norbert Manthey
2025-09-07  8:00   ` Greg KH

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=aLclSwgwIhaMjE88@laps \
    --to=sashal@kernel.org \
    --cc=nmanthey@amazon.de \
    --cc=stable@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.