Linux Documentation
 help / color / mirror / Atom feed
From: Lorenzo Stoakes <ljs@kernel.org>
To: Brian Foster <bfoster@redhat.com>
Cc: Laurent Pinchart <laurent.pinchart@ideasonboard.com>,
	 "Vlastimil Babka (SUSE)" <vbabka@kernel.org>,
	Jori Koolstra <jkoolstra@xs4all.nl>,
	 Christian Brauner <brauner@kernel.org>,
	Linus Torvalds <torvalds@linux-foundation.org>,
	 Jonathan Corbet <corbet@lwn.net>, Jens Axboe <axboe@kernel.dk>,
	 David Hildenbrand <david@kernel.org>,
	Jeff Layton <jlayton@kernel.org>,
	workflows@vger.kernel.org,  linux-doc@vger.kernel.org,
	linux-kernel@vger.kernel.org, linux-fsdevel@vger.kernel.org
Subject: Re: [PATCH RFC] coding-assistants: simplify attribution
Date: Thu, 2 Jul 2026 14:25:31 +0100	[thread overview]
Message-ID: <akZmJgbSdyM2Ww-t@lucifer> (raw)
In-Reply-To: <akZi8XmYA0XUCx2-@bfoster>

On Thu, Jul 02, 2026 at 09:09:05AM -0400, Brian Foster wrote:
> On Thu, Jul 02, 2026 at 01:18:06PM +0100, Lorenzo Stoakes wrote:
> > On Thu, Jul 02, 2026 at 07:57:45AM -0400, Brian Foster wrote:
> > > On Thu, Jul 02, 2026 at 10:44:09AM +0100, Lorenzo Stoakes wrote:
> > > > On Thu, Jul 02, 2026 at 12:38:44PM +0300, Laurent Pinchart wrote:
> > > > > On Thu, Jul 02, 2026 at 10:44:34AM +0200, Vlastimil Babka (SUSE) wrote:
> > > > > > On 7/2/26 10:12, Jori Koolstra wrote:
> > > > > > > Ah, I still reigniting this discussion again :)
> > > > > > >
> > > > > > > What about a combination of what David and Jeff say? The whole point
> > > > > > > seems to me that the salient information is not that an LLM was used (or
> > > > > > > are we going to tag Sashiko as well or any other LLM-based code review
> > > > > > > tool?), but what is was used to do. This information may be relevant for
> > > > > > > how the review is approached. The latter should perhaps only be in the
> > > > > > > cover letter and then we can drop the assisted-by tags altogether.
> > > > > > >
> > > > > > > The question about enforcement remains.
> > > > > >
> > > > > > It's not possible to enforce it. People can deny it if the tag is missing
> > > > > > and you confront them and even though the submission has many signs of being
> > > > > > obviously LLM, there is no definite proof. We've seen (likely, as there's no
> > > > > > proof!) that happen in mm.
> > > > > >
> > > > > > Such situation then penalizes those who disclose so obviously they won't.
> > > > >
> > > > > I think there's also a penality for those who don't disclose when
> > > > > they're told they should: it will lower trust. Kernel development is
> > > > > largely based on a trust model. If a contributor decides to adopt a
> > > > > deceiptful behaviour, they can expect maintainers to raise the bar for
> > > > > accepting patches, when not rejecting them outright.
> > > >
> > > > Yes, I explicitly said this in response to somebody for whom there was
> > > > overwhelming evidence they were submitting AI slop, and that they'd need to
> > > > build it back up again.
> > > >
> > > > It's precisely the issue as I see it.
> > > >
> > > > But others within the community disagreed with me, so it turned into a very
> > > > long and draining discussion that I don't particularly wish to repeat.
> > > >
> > > > So we really need clarity on it being OK to do this (I remember saying this
> > > > last year when I made an ultimately unsuccessful submission to the
> > > > maintainer's summit about all this :)
> > > >
> > > > What matters overall is being able to _quickly_ dismiss AI slop so that
> > > > asymmetry between LLM generation + maintainer time isn't exploited.
> > > >
> > > > And ultimately I think the trust model will end up being 'newcomes have 0,
> > > > now build it up'.
> > > >
> > > > Which sucks but this issue is simply existential for open source.
> > > >
> > >
> > > Has anybody tried throwing any of the obvious LLM slop submissions we
> > > have seen into one of these LLM detector things? To be clear, I've never
> > > tried those so I'm certainly no authority on if they even work reliably,
> > > but if so I wonder if something like that is a potential solution for
> > > elminating the worst cases..
> > >
> > > I.e., suppose we had some Sashiko type LLM/bot whose job was mainly to
> > > detect purely LLM generated content based on some minimum level of
> > > confidence and reply with a loud and clear message to the thread. Maybe
> > > that would be a clear enough signal to maintainers and reviewers that
> > > something is not worth prioritizing for review.. Maybe also some "slop
> > > detected" feedback would help disincentivize flinging slop onto the
> > > lists. At the very least that could be something that is more easily
> > > configured/enabled per-subsystem without having to use per-subsystem
> > > commit tags.
> >
> > Yup I thought of this, have done this on series and they do detect it
> > reliably.
> >
>
> Interesting.
>
> > But then it becomes an arms race. People will get AI to try to defeat AI
> > detection. So I'm not sure it's a safe road to go down.
> >
>
> Yeah that's fair, but my thinking is the goal would be more to detect
> slop content as opposed to just LLM usage. The trend of these
> discussions has been that we generally don't care too much about LLM use
> in general when it's used properly as a tool, but rather as you stated
> earlier we want to be able to quickly dismiss slop to avoid wasting
> maintainer and reviewer time.
>
> It seems like we're generally able to sniff out pure LLM generated
> content in most cases. That suggests we should be able document some
> common patterns that tend to describe slop such that an LLM bot could
> detect and flag it. For example, "Slop bot has detected comments have
> LLM prose and just repeat what the code does," "Slop bot has detected
> unnecessary code volume and duplication," etc. etc.

Yep it might be helpful.

But it turns out this is more contentious than you might think, and it might be
a hard sell.

>
> So the idea with that would be that defeating the slop bot is the point.
> Consider it a sort of informal/minimal goal before a submitter is
> entitled to human review. But yeah, I guess it's still just a handwavy
> theory that you'd be able to reliably disentagle detection of general
> LLM code generation from something that more qualifies as LLM slop. Just
> thinking out loud.

Yeah tbh it'd not be perfect but I wouldn't mind if we did this.

Some information > no information.

>
> Brian
>
> > >
> > > Brian
> > >
> >
> > Thanks, Lorenzo
> >
>

Thanks, Lorenzo

  reply	other threads:[~2026-07-02 13:25 UTC|newest]

Thread overview: 53+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-07-01 15:54 [PATCH RFC] coding-assistants: simplify attribution Christian Brauner
2026-07-01 16:08 ` Mark Brown
2026-07-02  7:10   ` Christian Brauner
2026-07-02 11:35     ` Mark Brown
2026-07-01 16:08 ` Jonathan Corbet
2026-07-01 16:12   ` David Hildenbrand (Arm)
2026-07-02  7:11   ` Christian Brauner
2026-07-02  9:51   ` David Disseldorp
2026-07-01 16:10 ` David Hildenbrand (Arm)
2026-07-02  7:27   ` Christian Brauner
2026-07-02  7:46     ` David Hildenbrand (Arm)
2026-07-02  8:10       ` Laurent Pinchart
2026-07-02  8:16         ` David Hildenbrand (Arm)
2026-07-02 10:04       ` Lorenzo Stoakes
2026-07-02 11:51         ` David Hildenbrand (Arm)
2026-07-02 12:49           ` Lorenzo Stoakes
2026-07-02 13:41             ` Jani Nikula
2026-07-02 13:48               ` Jeff Layton
2026-07-02 13:26           ` Christian Brauner
2026-07-02 13:35             ` Christoph Hellwig
2026-07-02 13:47             ` Laurent Pinchart
2026-07-02 13:57             ` Lorenzo Stoakes
2026-07-02 14:09               ` Christian Brauner
2026-07-02  8:08     ` Laurent Pinchart
2026-07-02  8:28       ` Christian Brauner
2026-07-02 13:48         ` Carlos Maiolino
2026-07-02  9:24   ` Lorenzo Stoakes
2026-07-01 18:35 ` Jeff Layton
2026-07-01 18:53   ` Jakub Kicinski
2026-07-02  7:29     ` Christian Brauner
2026-07-02  7:28   ` Christian Brauner
2026-07-02  8:12 ` Jori Koolstra
2026-07-02  8:44   ` Vlastimil Babka (SUSE)
2026-07-02  9:09     ` Jori Koolstra
2026-07-02  9:39       ` Lorenzo Stoakes
2026-07-02  9:37     ` Lorenzo Stoakes
2026-07-02  9:38     ` Laurent Pinchart
2026-07-02  9:44       ` Lorenzo Stoakes
2026-07-02 11:57         ` Brian Foster
2026-07-02 12:18           ` Lorenzo Stoakes
2026-07-02 13:07             ` Laurent Pinchart
2026-07-02 13:37               ` Lorenzo Stoakes
2026-07-02 13:47                 ` Jori Koolstra
2026-07-02 13:09             ` Brian Foster
2026-07-02 13:25               ` Lorenzo Stoakes [this message]
2026-07-02 10:34     ` Krzysztof Kozlowski
2026-07-02 12:23       ` Lorenzo Stoakes
2026-07-02 12:39         ` Krzysztof Kozlowski
2026-07-02 12:53           ` Lorenzo Stoakes
2026-07-02 13:23       ` Laurent Pinchart
2026-07-02 10:27 ` Krzysztof Kozlowski
2026-07-02 11:27 ` Christoph Hellwig
2026-07-02 13:40   ` Carlos Maiolino

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=akZmJgbSdyM2Ww-t@lucifer \
    --to=ljs@kernel.org \
    --cc=axboe@kernel.dk \
    --cc=bfoster@redhat.com \
    --cc=brauner@kernel.org \
    --cc=corbet@lwn.net \
    --cc=david@kernel.org \
    --cc=jkoolstra@xs4all.nl \
    --cc=jlayton@kernel.org \
    --cc=laurent.pinchart@ideasonboard.com \
    --cc=linux-doc@vger.kernel.org \
    --cc=linux-fsdevel@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=torvalds@linux-foundation.org \
    --cc=vbabka@kernel.org \
    --cc=workflows@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox