Linux Serial subsystem development
 help / color / mirror / Atom feed
From: "Weiming Shi" <bestswngs@gmail.com>
To: "Greg Kroah-Hartman" <gregkh@linuxfoundation.org>,
	"Weiming Shi" <bestswngs@gmail.com>
Cc: "Jiri Slaby" <jirislaby@kernel.org>,
	"Daniel Starke" <daniel.starke@siemens.com>,
	<linux-kernel@vger.kernel.org>, <linux-serial@vger.kernel.org>,
	"Xiang Mei" <xmei5@asu.edu>
Subject: Re: [PATCH] tty: n_gsm: fix NULL deref of gsm->dlci[0] in control message handlers
Date: Sat, 13 Jun 2026 13:43:24 +0800	[thread overview]
Message-ID: <DJ7OKN8EMAK8.22CE0B8NZXD73@gmail.com> (raw)
In-Reply-To: <2026061228-scrubber-cosmetics-c77a@gregkh>

On Fri Jun 12, 2026 at 10:28 PM CST, Greg Kroah-Hartman wrote:
> On Fri, Jun 12, 2026 at 10:19:18PM +0800, Weiming Shi wrote:
>> On Fri Jun 12, 2026 at 2:50 AM CST, Greg Kroah-Hartman wrote:
>> > On Thu, Jun 11, 2026 at 11:32:18AM -0700, Weiming Shi wrote:
>> >> gsm_control_command() and gsm_control_reply() load gsm->dlci[0] and
>> >> immediately dereference dlci->ftype without checking it for NULL.
>> >> 
>> >> On the receive path, gsm_queue() validates that gsm->dlci[0] is non-NULL
>> >> and DLCI_OPEN before invoking the control handler, but the value is not
>> >> held across that check: the receive worker runs from flush_to_ldisc()
>> >> without taking gsm->mutex, while a concurrent GSMIOC_SETCONF ioctl can
>> >> enter gsm_cleanup_mux(), which takes gsm->mutex, releases gsm->dlci[0]
>> >> and sets it to NULL. If the mux is torn down between gsm_queue()'s check
>> >> and the re-load inside gsm_control_command()/gsm_control_reply(), the
>> >> handler dereferences a NULL dlci.
>> >> 
>> >> A peer that drives DLCI 0 control frames (e.g. CMD_TEST) while the mux
>> >> owner reconfigures the line discipline can therefore crash the kernel
>> >> (line numbers from decode_stacktrace.sh against the crashing build):
>> >> 
>> >>  Oops: general protection fault, probably for non-canonical address
>> >>  KASAN: null-ptr-deref in range [0x0000000000000208-0x000000000000020f]
>> >>  RIP: 0010:gsm_control_reply (drivers/tty/n_gsm.c:1497)
>> >>  Call Trace:
>> >>   gsm_dlci_command (drivers/tty/n_gsm.c:2482)
>> >>   gsm_queue.part.0 (drivers/tty/n_gsm.c:2852)
>> >>   gsm0_receive (drivers/tty/n_gsm.c:2972)
>> >>   gsmld_receive_buf (drivers/tty/n_gsm.c:3629)
>> >>   tty_ldisc_receive_buf (drivers/tty/tty_buffer.c:391)
>> >>   tty_port_default_receive_buf (drivers/tty/tty_port.c:39)
>> >>   flush_to_ldisc (drivers/tty/tty_buffer.c:495)
>> >>   process_one_work
>> >>   worker_thread
>> >>   kthread
>> >> 
>> >> The other callers of these helpers (the keep-alive and negotiation timer
>> >> paths) already guard the gsm->dlci[0] access; only the receive path is
>> >> unguarded. The CMD_CLD handler in the same switch already checks the
>> >> loaded dlci for NULL for the very same reason. Bail out early when
>> >> gsm->dlci[0] has been cleared instead of dereferencing it.
>> >> 
>> >> Triggering this requires CAP_NET_ADMIN to attach the n_gsm line
>> >> discipline (gsmld_open() uses capable(), not ns_capable()), so it is a
>> >> local denial of service for a privileged mux owner racing its own
>> >> control channel; harden the handlers regardless.
>> >> 
>> >> Fixes: 5767712668b8 ("tty: n_gsm: cleanup gsm_control_command and gsm_control_reply")
>> >> Reported-by: Xiang Mei <xmei5@asu.edu>
>> >> Assisted-by: Claude:claude-opus-4-8
>> >> Signed-off-by: Weiming Shi <bestswngs@gmail.com>
>> >> ---
>> >>  drivers/tty/n_gsm.c | 6 ++++++
>> >>  1 file changed, 6 insertions(+)
>> >> 
>> >> diff --git a/drivers/tty/n_gsm.c b/drivers/tty/n_gsm.c
>> >> index 214abeb89aaa..860cfb91d510 100644
>> >> --- a/drivers/tty/n_gsm.c
>> >> +++ b/drivers/tty/n_gsm.c
>> >> @@ -1457,6 +1457,9 @@ static int gsm_control_command(struct gsm_mux *gsm, int cmd, const u8 *data,
>> >>  	struct gsm_msg *msg;
>> >>  	struct gsm_dlci *dlci = gsm->dlci[0];
>> >>  
>> >> +	if (!dlci)
>> >> +		return -EINVAL;
>> >
>> > What precents dlci from being NULL right after you check this?
>> >
>> > thanks,
>> >
>> > greg k-h
>> 
>> Hi greg,
>> 
>> I'm sorry for taking so long to respond.
>> 
>> After a closer look I think your review is correct.
>> 
>> The real problem is that the receive path touches gsm->dlci[] with no
>> lock. The teardown side holds gsm->mutex while it releases and frees the
>> dlci, but the receive worker does not: gsm_queue() loads
>> dlci = gsm->dlci[address] while it is still valid and passes it down
>> through dlci->data() to gsm_control_command()/gsm_control_reply(), which
>> also re-read gsm->dlci[0] and dereference dlci->ftype.
>> 
>> Meanwhile GSMIOC_SETCONF -> gsm_cleanup_mux() takes gsm->mutex, closes
>> DLCI0 and drops its reference via gsm_dlci_release(); the final
>> tty_port_put() runs the gsm_dlci_free() destructor, which clears the slot
>> and frees the object:
>> 
>> ```
>>   dlci->gsm->dlci[dlci->addr] = NULL;
>>   kfree(dlci);
>> ```
>> If that happens while the worker is still in the dispatch above, it ends
>> up dereferencing the freed dlci. I can reproduce this as a use-after-free:
>> 
>> ```
>> [  997.227486][   T46] BUG: KASAN: slab-use-after-free in gsm_control_reply.isra.0 (drivers/tty/n_gsm.c:1162 drivers/tty/n_gsm.c:1494)
>> [  997.229052][   T46] Read of size 8 at addr ffff888029ae9000 by task kworker/u16:2/46
>> [  997.230517][   T46]
>> [  997.230952][   T46] CPU: 1 UID: 0 PID: 46 Comm: kworker/u16:2 Not tainted 7.1.0-rc7 #1 PREEMPT(full)
>> [  997.230958][   T46] Hardware name: QEMU Ubuntu 24.04 PC v2 (i440FX + PIIX, arch_caps fix, 1996), BIOS 1.16.3-debian-4
>> [  997.230961][   T46] Workqueue: events_unbound flush_to_ldisc
>> [  997.230969][   T46] Call Trace:
>> [  997.230972][   T46]  <TASK>
>> [  997.230974][   T46]  dump_stack_lvl (lib/dump_stack.c:94 lib/dump_stack.c:120)
>> [  997.230990][   T46]  print_report (mm/kasan/report.c:378 mm/kasan/report.c:482)
>> [  997.231008][   T46]  kasan_report (mm/kasan/report.c:595)
>> [  997.231016][   T46]  gsm_control_reply.isra.0 (drivers/tty/n_gsm.c:1162 drivers/tty/n_gsm.c:1494)
>> [  997.231020][   T46]  gsm_dlci_command (drivers/tty/n_gsm.c:1873 drivers/tty/n_gsm.c:2477)
>> [  997.231036][   T46]  gsmld_receive_buf (drivers/tty/n_gsm.c:3616)
>> [  997.231044][   T46]  tty_ldisc_receive_buf (drivers/tty/tty_buffer.c:398)
>> [  997.231052][   T46]  tty_port_default_receive_buf (drivers/tty/tty_port.c:37)
>> [  997.231056][   T46]  flush_to_ldisc (drivers/tty/tty_buffer.c:452 drivers/tty/tty_buffer.c:502)
>> [  997.231066][   T46]  process_one_work (kernel/workqueue.c:3314)
>> [  997.231082][   T46]  worker_thread (kernel/workqueue.c:3397 kernel/workqueue.c:3478)
>> [  997.231091][   T46]  kthread (kernel/kthread.c:436)
>> [  997.231103][   T46]  ret_from_fork (arch/x86/kernel/process.c:158)
>> [  997.231120][   T46]  ret_from_fork_asm (arch/x86/entry/entry_64.S:245)
>> [  997.231128][   T46]  </TASK>
>> [  997.231130][   T46]
>> [  997.267905][   T46] Allocated by task 5110:
>> [  997.268716][   T46]  kasan_save_stack (mm/kasan/common.c:57)
>> [  997.269595][   T46]  kasan_save_track (mm/kasan/common.c:78)
>> [  997.270483][   T46]  __kasan_kmalloc (mm/kasan/common.c:398 mm/kasan/common.c:415)
>> [  997.271353][   T46]  gsm_dlci_alloc (./include/linux/slab.h:950 ./include/linux/slab.h:1188 drivers/tty/n_gsm.c:2648)
>> [  997.272203][   T46]  gsm_activate_mux (drivers/tty/n_gsm.c:3189)
>> [  997.273109][   T46]  gsmld_ioctl (drivers/tty/n_gsm.c:3443 drivers/tty/n_gsm.c:3846)
>> [  997.273981][   T46]  tty_ioctl (drivers/tty/tty_io.c:2801)
>> [  997.274789][   T46]  __x64_sys_ioctl (fs/ioctl.c:51 fs/ioctl.c:597 fs/ioctl.c:583 fs/ioctl.c:583)
>> [  997.275682][   T46]  do_syscall_64 (arch/x86/entry/syscall_64.c:63 arch/x86/entry/syscall_64.c:94)
>> [  997.276544][   T46]  entry_SYSCALL_64_after_hwframe (arch/x86/entry/entry_64.S:121)
>> [  997.277658][   T46]
>> [  997.278108][   T46] Freed by task 5110:
>> [  997.278865][   T46]  kasan_save_stack (mm/kasan/common.c:57)
>> [  997.279740][   T46]  kasan_save_track (mm/kasan/common.c:78)
>> [  997.280615][   T46]  kasan_save_free_info (mm/kasan/generic.c:584)
>> [  997.281554][   T46]  __kasan_slab_free (mm/kasan/common.c:253 mm/kasan/common.c:285)
>> [  997.282435][   T46]  kfree (./include/linux/kasan.h:235 mm/slub.c:2689 mm/slub.c:6251 mm/slub.c:6566)
>> [  997.283159][   T46]  gsm_cleanup_mux (drivers/tty/n_gsm.c:2711 drivers/tty/n_gsm.c:2744 drivers/tty/n_gsm.c:3161)
>> [  997.284050][   T46]  gsmld_ioctl (drivers/tty/n_gsm.c:3415 drivers/tty/n_gsm.c:3846)
>> [  997.284928][   T46]  tty_ioctl (drivers/tty/tty_io.c:2801)
>> [  997.285746][   T46]  __x64_sys_ioctl (fs/ioctl.c:51 fs/ioctl.c:597 fs/ioctl.c:583 fs/ioctl.c:583)
>> [  997.286653][   T46]  do_syscall_64 (arch/x86/entry/syscall_64.c:63 arch/x86/entry/syscall_64.c:94)
>> [  997.287526][   T46]  entry_SYSCALL_64_after_hwframe (arch/x86/entry/entry_64.S:121)
>> [  997.288639][   T46]
>> 
>> ```
>> 
>> The NULL deref I reported  is the same unguarded access on the same
>> path, just hitting the window after the slot is already cleared. Either
>> way a NULL check in the handlers can't fix it, since in the UAF case dlci
>> isn't NULL.
>> 
>> I think the fix should serialize the receive side against
>> gsm_cleanup_mux() instead of checking in the handlers. Two ways I can see:
>> 
>>   1. take gsm->mutex around the dlci lookup and dispatch in gsm_queue(), or
>>   2. pin the dlci across the dispatch using its existing tty_port ref
>>     (dlci_get/dlci_put), so gsm_dlci_free() can't run while it's in use.
>> 
>> Do you have a preference, or is there a pattern in n_gsm you'd rather I
>> use? I'll respin v2 once I know which way to go.
>
> The bigger issue here is that almost no one has this hardware.  And
> those that do, don't care about these types of issues as they do not
> have untrusted data or untrusted users, so be careful when changing
> things that you aren't able to test.
>
> I think that option 2 would probably be best, as that should not affect
> any fast code paths, right?
>
>> And I'll send the reproducer and the config to trigger it in a separate mail.
>
> Can you turn that into a real test to be added to the tree in the
> correct location?  I think we need to start adding these so that we get
> a base regression test for people to be able to run as all the LLM tools
> seem to love the broken code in this file :)
>
> thanks,
>
> greg k-h

Hi greg,

I'll try to write a base regression test and sent it together with the v2.

Best,
Weiming Shi

  reply	other threads:[~2026-06-13  5:43 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-06-11 18:32 [PATCH] tty: n_gsm: fix NULL deref of gsm->dlci[0] in control message handlers Weiming Shi
2026-06-11 18:50 ` Greg Kroah-Hartman
2026-06-12 14:19   ` Weiming Shi
2026-06-12 14:28     ` Greg Kroah-Hartman
2026-06-13  5:43       ` Weiming Shi [this message]
2026-06-12 14:22 ` Weiming Shi

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=DJ7OKN8EMAK8.22CE0B8NZXD73@gmail.com \
    --to=bestswngs@gmail.com \
    --cc=daniel.starke@siemens.com \
    --cc=gregkh@linuxfoundation.org \
    --cc=jirislaby@kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-serial@vger.kernel.org \
    --cc=xmei5@asu.edu \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox