Re: [PATCH RFC] coding-assistants: simplify attribution

Linux Documentation
 help / color / mirror / Atom feed

From: Lorenzo Stoakes <ljs@kernel.org>
To: "David Hildenbrand (Arm)" <david@kernel.org>
Cc: Christian Brauner <brauner@kernel.org>,
	 Linus Torvalds <torvalds@linux-foundation.org>,
	Jonathan Corbet <corbet@lwn.net>, Jens Axboe <axboe@kernel.dk>,
	 Jeff Layton <jlayton@kernel.org>,
	Vlastimil Babka <vbabka@kernel.org>,
	workflows@vger.kernel.org,  linux-doc@vger.kernel.org,
	linux-kernel@vger.kernel.org, linux-fsdevel@vger.kernel.org
Subject: Re: [PATCH RFC] coding-assistants: simplify attribution
Date: Thu, 2 Jul 2026 13:49:07 +0100	[thread overview]
Message-ID: <akZYSGoysWSb0K1J@lucifer> (raw)
In-Reply-To: <54d3a698-a275-488e-ad36-ef423db30f70@kernel.org>

On Thu, Jul 02, 2026 at 01:51:10PM +0200, David Hildenbrand (Arm) wrote:
> On 7/2/26 12:04, Lorenzo Stoakes wrote:
> > (thanks for the cc-!)
> >
> > On Thu, Jul 02, 2026 at 09:46:37AM +0200, David Hildenbrand (Arm) wrote:
> >> On 7/2/26 09:27, Christian Brauner wrote:
> >>>
> >>> I think we should just drop any attribution as a general kernel-wide
> >>> rule and let subsystems require them as needed. Then you can have all
> >>> the complexity in mm for this that you think is needed for your
> >>> workflow to function. This is precisely what the subsystem profiles are
> >>> for. So maybe just add:
> >
> > A single comment is complexity?
>
> I think Christian meant more elaborate rules. More than just "If you used LLMs,
> disclose how you used them."

What's elaborate?

"Say how much of your patch is LLM written, here are some examples".

Surely?

>
> >>
> >> I'm not really sure if having (more?) subsystem-specific tags is the way to go.
> >> (below)
> >>
> >> So either we find a very simple, kernel-wide rule for such tags, or we drop them
> >> entirely.
> >
> > Yup I couldn't disagree more with Christian here, the whole thing feels like
> > trying to 'wish away' the AI issue, and now punting off to subsystem
> > maintainers...
> >
> > Subsystems impact each other. Right now I'm writing a series that changes driver
> > code so we can enforce some sanity in mm APIs.
> >
> > I've had to interact with fs code quite a bit that uses mm logic.
> >
> > It's all interconnected, and one subsystem let's say going with 'let it all in'
> > say, impacts another.
> >
> > Yes some people lie about it, but having the guidelines only STRENGTHENS our
> > position on that, and I've seen that in practice.
> >
> > So yeah, sorry, I think it's beyond silly to push back on requesting somebody
> > disclose how much of a patch/series was AI generated.
> >
> > And [0] already essentially says people NEED to do this now. But that doc has
> > been rather downplayed unfortunately I think.
>
> [...]
>
> >> I agree on the "enforce" aspect. It's impossible, but it's still easy to catch
> >> people using AI irresponsibly today ... and that's what we care about. Not
> >> people that know what they are doing using AI responsibly.
> >
> > For me it's about empowering maintainers to push back.
>
> Right, but I suspect maintainers do have this power already, it's just not
> exercised that often on obvious AI slop yet.

Well I certainly don't feel I do :)

I tried pushing back on obvious AI slop and got a huge amount of blow back for
it because the guy wasn't honest about it.

A key reason for me pushing back on the tooling documentation was precisely
because I felt we needed a clear means of doing this.

This being the part:

"As with the output of any tooling, the result may be incorrect or
inappropriate. You are expected to understand and to be able to defend
everything you submit. If you are unable to do so, then do not submit the
resulting changes.

If you do so anyway, maintainers are entitled to reject your series without
detailed review."

But if somebody denies it, no matter how strong the evidence, you can never
really 'prove' it.

I think honestly if there's a newcomer who suddenly out of nowhere does a huge
involved series in an area they've not touched before and LLMs assess it as 90%
likely to be LLM generated,and they reply making mistakes that only an LLM would
make (misinterpreting a field's symbol and then acting as if really exists) -
it's not unreasonable to cite these things as a reason to 'not really trust'
that it's their work.

Perhaps worded nicely to say 'sorry if I'm mistaken'?

All I'm really asking is for the ability to say something like "I reasonably
believe that this is generated, so we need to build more trust here, apologies
if I'm mistaken, but can we see some smaller patches in this area first" or
something like this.

>
> >
> >>
> >>>
> >>> If the information is mostly useful during review then I still would
> >>> question why it has to end up in our git logs. It's completely
> >>> irrelevant information imho.
> >>
> >> Fully agreed. In the tree it's irrelevant.
> >
> > Not sure about that, if it turns out AI-generated patches are causing 95% more
> > bugs say that's pretty useful information no?
>
> Well
>
> a) You don't know how much AI was used. In particular, it could just slip in as

Hence 'tell us how much was used' :)

> the submitter tries to untangle some of the mess the AI created (so not AI's
> fault). Or the submitter just used it to write+translate the patch description.
> Really, the tag itself doesn't tell you much as it stands, which is the biggest
> problem I am having with it.
>
> b) You don't catch all the cases where people didn't use the tag.

Is this arguing 'we don't have complete information so let's have no
information'? Because I would say something > nothing?

>
> >
> > Or if you find that a patch somebody sent from another subsystem that has a
> > lassez faire approach to AI slop completely breaks you in some subtle way, isn't
> > it easier to push for a revert if you see it's LLM-generated?
>
> The information would have to be had from the linked mailing list posting.

That's creating a lot more work for maintainers?

You could even figure out bug rate from Fixes: tags alone using metadata.

And yes it will be imperfect but something > nothing.

>
> Given that some subsystems already started suppressing the tags when applying
> patches, that doesn't really help ... :/

Well that's unfortunate. But something > nothing, again.

>
> >
> > And is it really that egregious to include a tag? You can ignore it if you don't
> > care.
>
> I hate the current tags as they are. The question I am asking myself: assume we
> stop using the Assisted-by for LLM stuff. What to do with the other tools? Why
> are LLMs suddenly no longer a tool to mention there.

Because it turns out it's useful to have this information and more information >
less information?

>
> --
> Cheers,
>
> David

Thanks, Lorenzo

next prev parent reply	other threads:[~2026-07-02 12:49 UTC|newest]

Thread overview: 56+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-07-01 15:54 [PATCH RFC] coding-assistants: simplify attribution Christian Brauner
2026-07-01 16:08 ` Mark Brown
2026-07-02  7:10   ` Christian Brauner
2026-07-02 11:35     ` Mark Brown
2026-07-01 16:08 ` Jonathan Corbet
2026-07-01 16:12   ` David Hildenbrand (Arm)
2026-07-02  7:11   ` Christian Brauner
2026-07-02  9:51   ` David Disseldorp
2026-07-01 16:10 ` David Hildenbrand (Arm)
2026-07-02  7:27   ` Christian Brauner
2026-07-02  7:46     ` David Hildenbrand (Arm)
2026-07-02  8:10       ` Laurent Pinchart
2026-07-02  8:16         ` David Hildenbrand (Arm)
2026-07-02 10:04       ` Lorenzo Stoakes
2026-07-02 11:51         ` David Hildenbrand (Arm)
2026-07-02 12:49           ` Lorenzo Stoakes [this message]
2026-07-02 13:41             ` Jani Nikula
2026-07-02 13:48               ` Jeff Layton
2026-07-02 13:26           ` Christian Brauner
2026-07-02 13:35             ` Christoph Hellwig
2026-07-02 13:47             ` Laurent Pinchart
2026-07-02 13:57             ` Lorenzo Stoakes
2026-07-02 14:09               ` Christian Brauner
2026-07-02 14:46                 ` Lorenzo Stoakes
2026-07-02 16:18                   ` Christian Brauner
2026-07-02 16:29                     ` Lorenzo Stoakes
2026-07-02  8:08     ` Laurent Pinchart
2026-07-02  8:28       ` Christian Brauner
2026-07-02 13:48         ` Carlos Maiolino
2026-07-02  9:24   ` Lorenzo Stoakes
2026-07-01 18:35 ` Jeff Layton
2026-07-01 18:53   ` Jakub Kicinski
2026-07-02  7:29     ` Christian Brauner
2026-07-02  7:28   ` Christian Brauner
2026-07-02  8:12 ` Jori Koolstra
2026-07-02  8:44   ` Vlastimil Babka (SUSE)
2026-07-02  9:09     ` Jori Koolstra
2026-07-02  9:39       ` Lorenzo Stoakes
2026-07-02  9:37     ` Lorenzo Stoakes
2026-07-02  9:38     ` Laurent Pinchart
2026-07-02  9:44       ` Lorenzo Stoakes
2026-07-02 11:57         ` Brian Foster
2026-07-02 12:18           ` Lorenzo Stoakes
2026-07-02 13:07             ` Laurent Pinchart
2026-07-02 13:37               ` Lorenzo Stoakes
2026-07-02 13:47                 ` Jori Koolstra
2026-07-02 13:09             ` Brian Foster
2026-07-02 13:25               ` Lorenzo Stoakes
2026-07-02 10:34     ` Krzysztof Kozlowski
2026-07-02 12:23       ` Lorenzo Stoakes
2026-07-02 12:39         ` Krzysztof Kozlowski
2026-07-02 12:53           ` Lorenzo Stoakes
2026-07-02 13:23       ` Laurent Pinchart
2026-07-02 10:27 ` Krzysztof Kozlowski
2026-07-02 11:27 ` Christoph Hellwig
2026-07-02 13:40   ` Carlos Maiolino

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=akZYSGoysWSb0K1J@lucifer \
    --to=ljs@kernel.org \
    --cc=axboe@kernel.dk \
    --cc=brauner@kernel.org \
    --cc=corbet@lwn.net \
    --cc=david@kernel.org \
    --cc=jlayton@kernel.org \
    --cc=linux-doc@vger.kernel.org \
    --cc=linux-fsdevel@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=torvalds@linux-foundation.org \
    --cc=vbabka@kernel.org \
    --cc=workflows@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox