From: Lorenzo Stoakes <ljs@kernel.org>
To: Brian Foster <bfoster@redhat.com>
Cc: Laurent Pinchart <laurent.pinchart@ideasonboard.com>,
"Vlastimil Babka (SUSE)" <vbabka@kernel.org>,
Jori Koolstra <jkoolstra@xs4all.nl>,
Christian Brauner <brauner@kernel.org>,
Linus Torvalds <torvalds@linux-foundation.org>,
Jonathan Corbet <corbet@lwn.net>, Jens Axboe <axboe@kernel.dk>,
David Hildenbrand <david@kernel.org>,
Jeff Layton <jlayton@kernel.org>,
workflows@vger.kernel.org, linux-doc@vger.kernel.org,
linux-kernel@vger.kernel.org, linux-fsdevel@vger.kernel.org
Subject: Re: [PATCH RFC] coding-assistants: simplify attribution
Date: Thu, 2 Jul 2026 14:25:31 +0100 [thread overview]
Message-ID: <akZmJgbSdyM2Ww-t@lucifer> (raw)
In-Reply-To: <akZi8XmYA0XUCx2-@bfoster>
On Thu, Jul 02, 2026 at 09:09:05AM -0400, Brian Foster wrote:
> On Thu, Jul 02, 2026 at 01:18:06PM +0100, Lorenzo Stoakes wrote:
> > On Thu, Jul 02, 2026 at 07:57:45AM -0400, Brian Foster wrote:
> > > On Thu, Jul 02, 2026 at 10:44:09AM +0100, Lorenzo Stoakes wrote:
> > > > On Thu, Jul 02, 2026 at 12:38:44PM +0300, Laurent Pinchart wrote:
> > > > > On Thu, Jul 02, 2026 at 10:44:34AM +0200, Vlastimil Babka (SUSE) wrote:
> > > > > > On 7/2/26 10:12, Jori Koolstra wrote:
> > > > > > > Ah, I still reigniting this discussion again :)
> > > > > > >
> > > > > > > What about a combination of what David and Jeff say? The whole point
> > > > > > > seems to me that the salient information is not that an LLM was used (or
> > > > > > > are we going to tag Sashiko as well or any other LLM-based code review
> > > > > > > tool?), but what is was used to do. This information may be relevant for
> > > > > > > how the review is approached. The latter should perhaps only be in the
> > > > > > > cover letter and then we can drop the assisted-by tags altogether.
> > > > > > >
> > > > > > > The question about enforcement remains.
> > > > > >
> > > > > > It's not possible to enforce it. People can deny it if the tag is missing
> > > > > > and you confront them and even though the submission has many signs of being
> > > > > > obviously LLM, there is no definite proof. We've seen (likely, as there's no
> > > > > > proof!) that happen in mm.
> > > > > >
> > > > > > Such situation then penalizes those who disclose so obviously they won't.
> > > > >
> > > > > I think there's also a penality for those who don't disclose when
> > > > > they're told they should: it will lower trust. Kernel development is
> > > > > largely based on a trust model. If a contributor decides to adopt a
> > > > > deceiptful behaviour, they can expect maintainers to raise the bar for
> > > > > accepting patches, when not rejecting them outright.
> > > >
> > > > Yes, I explicitly said this in response to somebody for whom there was
> > > > overwhelming evidence they were submitting AI slop, and that they'd need to
> > > > build it back up again.
> > > >
> > > > It's precisely the issue as I see it.
> > > >
> > > > But others within the community disagreed with me, so it turned into a very
> > > > long and draining discussion that I don't particularly wish to repeat.
> > > >
> > > > So we really need clarity on it being OK to do this (I remember saying this
> > > > last year when I made an ultimately unsuccessful submission to the
> > > > maintainer's summit about all this :)
> > > >
> > > > What matters overall is being able to _quickly_ dismiss AI slop so that
> > > > asymmetry between LLM generation + maintainer time isn't exploited.
> > > >
> > > > And ultimately I think the trust model will end up being 'newcomes have 0,
> > > > now build it up'.
> > > >
> > > > Which sucks but this issue is simply existential for open source.
> > > >
> > >
> > > Has anybody tried throwing any of the obvious LLM slop submissions we
> > > have seen into one of these LLM detector things? To be clear, I've never
> > > tried those so I'm certainly no authority on if they even work reliably,
> > > but if so I wonder if something like that is a potential solution for
> > > elminating the worst cases..
> > >
> > > I.e., suppose we had some Sashiko type LLM/bot whose job was mainly to
> > > detect purely LLM generated content based on some minimum level of
> > > confidence and reply with a loud and clear message to the thread. Maybe
> > > that would be a clear enough signal to maintainers and reviewers that
> > > something is not worth prioritizing for review.. Maybe also some "slop
> > > detected" feedback would help disincentivize flinging slop onto the
> > > lists. At the very least that could be something that is more easily
> > > configured/enabled per-subsystem without having to use per-subsystem
> > > commit tags.
> >
> > Yup I thought of this, have done this on series and they do detect it
> > reliably.
> >
>
> Interesting.
>
> > But then it becomes an arms race. People will get AI to try to defeat AI
> > detection. So I'm not sure it's a safe road to go down.
> >
>
> Yeah that's fair, but my thinking is the goal would be more to detect
> slop content as opposed to just LLM usage. The trend of these
> discussions has been that we generally don't care too much about LLM use
> in general when it's used properly as a tool, but rather as you stated
> earlier we want to be able to quickly dismiss slop to avoid wasting
> maintainer and reviewer time.
>
> It seems like we're generally able to sniff out pure LLM generated
> content in most cases. That suggests we should be able document some
> common patterns that tend to describe slop such that an LLM bot could
> detect and flag it. For example, "Slop bot has detected comments have
> LLM prose and just repeat what the code does," "Slop bot has detected
> unnecessary code volume and duplication," etc. etc.
Yep it might be helpful.
But it turns out this is more contentious than you might think, and it might be
a hard sell.
>
> So the idea with that would be that defeating the slop bot is the point.
> Consider it a sort of informal/minimal goal before a submitter is
> entitled to human review. But yeah, I guess it's still just a handwavy
> theory that you'd be able to reliably disentagle detection of general
> LLM code generation from something that more qualifies as LLM slop. Just
> thinking out loud.
Yeah tbh it'd not be perfect but I wouldn't mind if we did this.
Some information > no information.
>
> Brian
>
> > >
> > > Brian
> > >
> >
> > Thanks, Lorenzo
> >
>
Thanks, Lorenzo
next prev parent reply other threads:[~2026-07-02 13:25 UTC|newest]
Thread overview: 53+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-07-01 15:54 [PATCH RFC] coding-assistants: simplify attribution Christian Brauner
2026-07-01 16:08 ` Mark Brown
2026-07-02 7:10 ` Christian Brauner
2026-07-02 11:35 ` Mark Brown
2026-07-01 16:08 ` Jonathan Corbet
2026-07-01 16:12 ` David Hildenbrand (Arm)
2026-07-02 7:11 ` Christian Brauner
2026-07-02 9:51 ` David Disseldorp
2026-07-01 16:10 ` David Hildenbrand (Arm)
2026-07-02 7:27 ` Christian Brauner
2026-07-02 7:46 ` David Hildenbrand (Arm)
2026-07-02 8:10 ` Laurent Pinchart
2026-07-02 8:16 ` David Hildenbrand (Arm)
2026-07-02 10:04 ` Lorenzo Stoakes
2026-07-02 11:51 ` David Hildenbrand (Arm)
2026-07-02 12:49 ` Lorenzo Stoakes
2026-07-02 13:41 ` Jani Nikula
2026-07-02 13:48 ` Jeff Layton
2026-07-02 13:26 ` Christian Brauner
2026-07-02 13:35 ` Christoph Hellwig
2026-07-02 13:47 ` Laurent Pinchart
2026-07-02 13:57 ` Lorenzo Stoakes
2026-07-02 14:09 ` Christian Brauner
2026-07-02 8:08 ` Laurent Pinchart
2026-07-02 8:28 ` Christian Brauner
2026-07-02 13:48 ` Carlos Maiolino
2026-07-02 9:24 ` Lorenzo Stoakes
2026-07-01 18:35 ` Jeff Layton
2026-07-01 18:53 ` Jakub Kicinski
2026-07-02 7:29 ` Christian Brauner
2026-07-02 7:28 ` Christian Brauner
2026-07-02 8:12 ` Jori Koolstra
2026-07-02 8:44 ` Vlastimil Babka (SUSE)
2026-07-02 9:09 ` Jori Koolstra
2026-07-02 9:39 ` Lorenzo Stoakes
2026-07-02 9:37 ` Lorenzo Stoakes
2026-07-02 9:38 ` Laurent Pinchart
2026-07-02 9:44 ` Lorenzo Stoakes
2026-07-02 11:57 ` Brian Foster
2026-07-02 12:18 ` Lorenzo Stoakes
2026-07-02 13:07 ` Laurent Pinchart
2026-07-02 13:37 ` Lorenzo Stoakes
2026-07-02 13:47 ` Jori Koolstra
2026-07-02 13:09 ` Brian Foster
2026-07-02 13:25 ` Lorenzo Stoakes [this message]
2026-07-02 10:34 ` Krzysztof Kozlowski
2026-07-02 12:23 ` Lorenzo Stoakes
2026-07-02 12:39 ` Krzysztof Kozlowski
2026-07-02 12:53 ` Lorenzo Stoakes
2026-07-02 13:23 ` Laurent Pinchart
2026-07-02 10:27 ` Krzysztof Kozlowski
2026-07-02 11:27 ` Christoph Hellwig
2026-07-02 13:40 ` Carlos Maiolino
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=akZmJgbSdyM2Ww-t@lucifer \
--to=ljs@kernel.org \
--cc=axboe@kernel.dk \
--cc=bfoster@redhat.com \
--cc=brauner@kernel.org \
--cc=corbet@lwn.net \
--cc=david@kernel.org \
--cc=jkoolstra@xs4all.nl \
--cc=jlayton@kernel.org \
--cc=laurent.pinchart@ideasonboard.com \
--cc=linux-doc@vger.kernel.org \
--cc=linux-fsdevel@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=torvalds@linux-foundation.org \
--cc=vbabka@kernel.org \
--cc=workflows@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox