Re: [PATCH 3/3] tee: add MbedTEE driver

Devicetree
 help / color / mirror / Atom feed

From: sashiko-bot@kernel.org
To: "Xing Loong" <xing.xl.loong@gmail.com>
Cc: conor+dt@kernel.org, robh@kernel.org, devicetree@vger.kernel.org
Subject: Re: [PATCH 3/3] tee: add MbedTEE driver
Date: Wed, 01 Jul 2026 13:42:39 +0000	[thread overview]
Message-ID: <20260701134240.040051F00A3A@smtp.kernel.org> (raw)
In-Reply-To: <20260701132514.186953-4-xing.xl.loong@gmail.com>

Thank you for your contribution! Sashiko AI review found 16 potential issue(s) to consider:
- [High] Calling xa_insert_irq() with GFP_KERNEL without XA_FLAGS_LOCK_IRQ causes a sleep-in-atomic kernel panic.
- [High] mbedtee_release() leaks sessions if a TEE context acts as a supplicant.
- [High] Transient GFP_ATOMIC allocation failures cause a permanent hang of the TEE-to-REE channel.
- [High] rpc_routine() silently drops the COMPLETE_TEE notification if mbedtee_rpc_fastcall() fails with -ENOMEM.
- [High] Closing any single supplicant context incorrectly aborts all unassigned global requests.
- [High] Dereferencing and freeing `call` after transferring ownership to the background completion handler on interrupted wait.
- [High] Double free of `tee_shm` object on error path in `mbedtee_supp_recv` due to missing `NULL` assignment.
- [High] Missing reference acquisition for `req->shm` leads to UAF and double free in `mbedtee_supp_send`.
- [High] Uninitialized or stale error codes are sent to the TEE when supplicant requests complete or abort.
- [High] The TEE-to-REE RPC hardirq handler sleeps while holding a spinlock if the TEE sends an async supplicant request, leading to an immediate kernel panic.
- [High] The RPC ring buffer parser skips only the command header when encountering invalid or oversized async RPCs, abandoning payload bytes in the ring.
- [High] Physical pages backing dynamic shared memory are unconditionally freed to the Linux allocator even if the driver fails to revoke the TEE's access.
- [High] The driver initializes its local ring buffer indices directly from unvalidated shared memory, enabling arbitrary out-of-bounds kernel memory accesses.
- [Medium] mbedtee_release() leaks mbedtee_session objects if mbedtee_close_session() fails.
- [Medium] Bypassing wait_for_completion_interruptible() creates an unbounded phantom wakeup accumulation causing CPU spikes.
- [Medium] Permanent memory leak of `mbedtee_rpc_call` structures on module removal for interrupted calls.
--

commit 37902c22c67b16bd6c582807221293c21dc8b899
Author: Xing Loong <xing.xl.loong@gmail.com>
Subject: tee: add MbedTEE driver

This commit adds a new Linux TEE driver for communicating with the MbedTEE
OS. It supports ARM TrustZone and RISC-V IMSIC platforms, utilizing a
shared-memory RPC protocol to interact with the Trusted Execution Environment.

> diff --git a/drivers/tee/mbedtee/core.c b/drivers/tee/mbedtee/core.c
> --- a/drivers/tee/mbedtee/core.c
> +++ b/drivers/tee/mbedtee/core.c
> @@ -83,10 +83,11 @@ static void mbedtee_release(struct tee_context *ctx)
>  
>  	if (d->is_supp_ctx) {
>  		mbedtee_supp_release(&mbedtee->supp, ctx);
>  	} else {
>  		list_for_each_entry_safe(s, n, &d->sess_list, list_node)
>  			mbedtee_close_session(ctx, s->session_id);
>  	}
>  
>  	kfree(d);

[Severity: High]
Does this mutually exclusive block skip session cleanup entirely if the context
was acting as a supplicant? It appears a userspace application could open a
session, act as a supplicant, and close the file descriptor, resulting in
leaked mbedtee_session objects since the else branch is completely bypassed.

[Severity: Medium]
If mbedtee_close_session fails (for example, due to an RPC allocation failure)
and leaves a session in the list, does freeing the context data via kfree
permanently lose the pointers to those session objects?

> [ ... ]
> @@ -195,12 +196,10 @@ static void mbedtee_remove(struct platform_device *pdev)
>  	mbedtee_rpc_uninit(mbedtee);
>  	mbedtee_r2t_uninit(mbedtee);
>  	mbedtee_supp_uninit(&mbedtee->supp);
>  	tee_shm_pool_free(mbedtee->pool);
>  	xa_destroy(&mbedtee->rpc_calls);
>  }

[Severity: Medium]
Does xa_destroy only free the internal XArray nodes? If there are any
mbedtee_rpc_call structures left behind by interrupted calls when the module
is removed, it seems this might cause a kernel memory leak.

> diff --git a/drivers/tee/mbedtee/rpc_callee.c b/drivers/tee/mbedtee/rpc_callee.c
> --- a/drivers/tee/mbedtee/rpc_callee.c
> +++ b/drivers/tee/mbedtee/rpc_callee.c
> @@ -224,10 +224,9 @@ static struct rpc_work *rpc_pick_next(struct mbedtee_device *mbedtee,
>  skip:
>  	if (cmd.waiter_id != 0 && !rpc_queue_complete_only(ctx, cmd.waiter_id))
>  		return NULL;
>  
>  	/* Bad or unhandled header: consume it to keep the ring moving */
>  	t2r_ring_advance(ctx, sizeof(cmd));
>  
>  	return NULL;
>  }

[Severity: High]
If an async RPC with an inline payload reaches this skip path, does advancing
only the header size abandon the payload bytes in the ring? It looks like the
next iteration might mistakenly parse those remaining payload bytes as a new
command header, corrupting the ring buffer permanently.

> [ ... ]
> @@ -245,10 +244,9 @@ static struct rpc_work *rpc_pick_next(struct mbedtee_device *mbedtee,
>  		if (off > ctx->t2r_shm_sz - cmd.size)
>  			goto skip;
>  
>  		new_work = kzalloc_obj(*new_work, GFP_ATOMIC);
>  		if (!new_work)
>  			return NULL;
>  
>  		t2r_ring_advance(ctx, sizeof(cmd));

[Severity: High]
If this transient GFP_ATOMIC memory allocation fails, returning NULL without
advancing the ring or queuing a retry mechanism seems to leave the ring
permanently stalled. Is there a way for the TEE-to-REE channel to recover
from this condition?

> [ ... ]
> @@ -288,14 +286,13 @@ static void rpc_drain_ring(struct mbedtee_device *mbedtee)
>  	while (READ_ONCE(ctx->t2r_ring_rd) !=
>  	    /* Pair with producer store-release after ring write. */
>  	    smp_load_acquire(&ctx->t2r_ring->wr)) {
>  		rw.data = ctx->rpc_data;
>  		c = rpc_pick_next(mbedtee, ctx, &rw);
>  		if (!c)
>  			break;
>  
>  		if (c->waiter_id == 0) {
>  			c->func(mbedtee, c->data, c->size);
>  		} else {
>  			INIT_WORK(&c->work, rpc_routine);

[Severity: High]
Breaking the loop here on a NULL return from rpc_pick_next due to an
allocation failure appears to abandon ring processing entirely. Will this
leave the TEE waiting for completion indefinitely?

[Severity: High]
For async requests (waiter_id == 0), the handler is called inline while
holding the ctx->ring_lock spinlock with interrupts disabled. If the TEE
sends an async supplicant request, does mbedtee_supp_handler end up sleeping
on wait_for_completion in atomic context, leading to a kernel panic?

> [ ... ]
> @@ -328,9 +325,8 @@ static void rpc_routine(struct work_struct *work)
>  	 * Keep retrying rather than silently dropping the completion, which
>  	 * would leave the TEE thread blocked in rpc_call_sync forever.
>  	 * On ARM the fastcall is a direct SMC so -ENOSPC never occurs.
>  	 */
>  	while (mbedtee_rpc_fastcall(c->mbedtee, MBEDTEE_RPC_COMPLETE_TEE,
>  				    (unsigned long)c->waiter_id, 0, 0) == -ENOSPC)
>  		cond_resched();

[Severity: High]
Could mbedtee_rpc_fastcall return -ENOMEM instead of -ENOSPC on RISC-V?
If so, this loop exits immediately and drops the COMPLETE_TEE notification,
which might leave the TEE thread permanently blocked.

> [ ... ]
> @@ -420,10 +416,9 @@ int mbedtee_rpc_init(struct mbedtee_device *mbedtee)
>  		memunmap(ctx->t2r_ring);
>  		ctx->t2r_ring = NULL;
>  		return -EINVAL;
>  	}
>  	ctx->t2r_ring_sz = resource_size(&res) - sizeof(struct rpc_ringbuf);
>  	/* Observe latest producer index only after ring metadata is visible. */
>  	WRITE_ONCE(ctx->t2r_ring_rd, smp_load_acquire(&ctx->t2r_ring->rd));

[Severity: High]
Does reading the initial consumer index directly from unvalidated shared memory
allow a compromised TEE to provide out-of-bounds indices? This might lead to
out-of-bounds reads or writes in kernel memory during ring buffer copies.

> diff --git a/drivers/tee/mbedtee/rpc_caller.c b/drivers/tee/mbedtee/rpc_caller.c
> --- a/drivers/tee/mbedtee/rpc_caller.c
> +++ b/drivers/tee/mbedtee/rpc_caller.c
> @@ -37,10 +37,9 @@ int mbedtee_rpc_call_alloc(struct mbedtee_device *mbedtee,
>  
>  	do {
>  		rpc_index = atomic_long_inc_return(&mbedtee->rpc_call_seq);
>  	} while (rpc_index == 0);
>  
>  	ret = xa_insert_irq(&mbedtee->rpc_calls, rpc_index, rcall, GFP_KERNEL);
>  	if (ret != 0) {
>  		kfree(rcall);

[Severity: High]
The XArray for rpc_calls is initialized with XA_FLAGS_ALLOC1 but lacks
XA_FLAGS_LOCK_IRQ. When xa_insert_irq is called here with GFP_KERNEL, it
disables local interrupts and might attempt to sleep during memory allocation
if the array needs to expand. Could this trigger a sleep-in-atomic panic?

> [ ... ]
> @@ -58,10 +57,9 @@ void mbedtee_rpc_call_free(struct mbedtee_device *mbedtee,
>  
>  	/*
>  	 * If interrupted/completed, complete_call owns this allocation
>  	 * and will xa_erase + kfree when the TEE eventually finishes.
>  	 */
>  	if (READ_ONCE(call->state) == MBEDTEE_RPC_CALL_INTERRUPTED)
>  		return;
>  
>  	xa_erase_irq(&mbedtee->rpc_calls, rpc_index);

[Severity: High]
If ownership is transferred to the background completion handler upon an
interrupted wait, could the background handler free the call object
concurrently while the caller is still accessing it?

> [ ... ]
> @@ -382,10 +380,9 @@ int mbedtee_invoke_func(struct tee_context *ctx,
>  
>  	ret = mbedtee_rpc_yieldcall(mbedtee, MBEDTEE_RPC_INVOKE_SESSION, call, true);
>  
>  	dev_dbg(mbedtee->dev, "invoke session ret %d gp_ret %d\n", ret,
>  		call->rpc.ret);
>  
>  	if (ret != 0) {
>  		arg->ret = TEEC_ERROR_COMMUNICATION;

[Severity: High]
If mbedtee_rpc_yieldcall was interrupted and transferred ownership of the call
object to the background handler, is reading call->rpc.ret here susceptible to
a use-after-free?

> [ ... ]
> @@ -416,10 +413,9 @@ int mbedtee_close_session(struct tee_context *ctx, u32 session)
>  	ret = mbedtee_rpc_call_alloc(mbedtee, sizeof(*rp), &call);
>  	if (ret != 0) {
>  		/*
>  		 * RPC allocation failed but the session is still alive on
>  		 * the TEE side. Leave it in the session list so a future
>  		 * close attempt can retry; warn so the condition is visible.
>  		 */
>  		mutex_unlock(&ctxdata->mutex);
>  		return ret;

[Severity: Medium]
If this returns early leaving the session in the list, the caller
(mbedtee_release) ignores the error and proceeds to kfree the context data.
Does this permanently leak the mbedtee_session objects?

> diff --git a/drivers/tee/mbedtee/rpc_caller_riscv.c b/drivers/tee/mbedtee/rpc_caller_riscv.c
> --- a/drivers/tee/mbedtee/rpc_caller_riscv.c
> +++ b/drivers/tee/mbedtee/rpc_caller_riscv.c
> @@ -170,9 +170,8 @@ int mbedtee_r2t_init(struct mbedtee_device *mbedtee)
>  	}
>  	ctx->ring_sz = resource_size(&res) - sizeof(struct rpc_ringbuf);
>  	/* Read initial writer index with acquire for coherent ring bootstrap. */
>  	WRITE_ONCE(ctx->ring_wr, smp_load_acquire(&ctx->ring->wr));
>  
>  	dev_dbg(mbedtee->dev, "rpc-r2t-ring %pa\n", &res.start);

[Severity: High]
Similar to the T2R ring, does accepting this unvalidated write index from the
TEE-controlled shared memory pose an out-of-bounds memory access risk?

> diff --git a/drivers/tee/mbedtee/shm_pool.c b/drivers/tee/mbedtee/shm_pool.c
> --- a/drivers/tee/mbedtee/shm_pool.c
> +++ b/drivers/tee/mbedtee/shm_pool.c
> @@ -62,10 +62,9 @@ static void pool_op_free(struct tee_shm_pool *pool,
>  	if (!(shm->flags & TEE_SHM_PRIV)) {
>  		struct mbedtee_device *mbedtee = tee_get_drvdata(shm->ctx->teedev);
>  
>  		if (mbedtee->yield && shm->sec_world_id)
>  			mbedtee_shm_unregister(shm->ctx, shm);
>  	}
>  
>  	free_pages((unsigned long)shm->kaddr, get_order(shm->size));
>  	shm->kaddr = NULL;

[Severity: High]
If mbedtee_shm_unregister fails to revoke the TEE's access (for instance, if
the RPC ring is full), the memory pages are still unconditionally freed to the
Linux allocator. Could this allow the TEE to maintain access to memory that
gets reused for other kernel processes?

> diff --git a/drivers/tee/mbedtee/supp.c b/drivers/tee/mbedtee/supp.c
> --- a/drivers/tee/mbedtee/supp.c
> +++ b/drivers/tee/mbedtee/supp.c
> @@ -73,10 +73,9 @@ void mbedtee_supp_release(struct mbedtee_supp *supp, struct tee_context *ctx)
>  	 * interfere with each other.
>  	 */
>  	list_for_each_entry_safe(r, n, &supp->reqs, node) {
>  		if (r->ctx && r->ctx != ctx)
>  			continue;
>  		mbedtee_supp_complete_req(r);
>  	}

[Severity: High]
Unassigned global requests waiting in supp->reqs will have r->ctx set to NULL.
This makes the condition evaluate to false. Does this mean that whenever any
single supplicant context is closed, it incorrectly aborts all unassigned
global requests across the system?

> [ ... ]
> @@ -156,10 +155,9 @@ void mbedtee_supp_handler(struct mbedtee_device *mbedtee,
>  
>  	/* Wake up the supplicant daemon to handle this request. */
>  	complete(&supp->reqs_c);
>  
>  	wait_for_completion(&req->c);
>  
>  	kfree(req);
>  }

[Severity: High]
When the request finishes, req->ret contains the completion status. Is this
return code ever copied back into the wire-format cmd->ret in shared memory
before freeing the request, or is the TEE receiving uninitialized data?

> [ ... ]
> @@ -206,10 +204,9 @@ int mbedtee_supp_recv(struct tee_context *ctx,
>  	if (ret != 0)
>  		return ret;
>  
>  	shm = tee_shm_get_from_id(ctx, shm->id);
>  	tee_shm_put(param->u.memref.shm);
>  	if (IS_ERR(shm))
>  		return PTR_ERR(shm);
>  	param->u.memref.shm = shm;

[Severity: High]
If tee_shm_get_from_id fails, the function returns the error without clearing
param->u.memref.shm. Will the TEE core caller's cleanup path call tee_shm_put
again on this same pointer, causing a double free?

> [ ... ]
> @@ -217,10 +214,9 @@ int mbedtee_supp_recv(struct tee_context *ctx,
>  	d->is_supp_ctx = true;
>  
>  	while (true) {
>  		ret = supp_pop_req(supp, &req);
>  		if (ret != 0)
>  			goto err;
>  		if (req)
>  			break;

[Severity: Medium]
If a request is already queued, this bypasses
wait_for_completion_interruptible entirely. Could this leave the internal
completion counter un-decremented, causing phantom wakeups to accumulate and
creating a high CPU spinning loop later when the queue is empty?

> [ ... ]
> @@ -238,10 +234,9 @@ int mbedtee_supp_recv(struct tee_context *ctx,
>  		goto err;
>  	}
>  
>  	*func = req->func;
>  	req->shm = shm;
>  
>  	memcpy(shm->kaddr, req->data, req->size);
>  	param->u.memref.size = req->size;

[Severity: High]
This assigns the shared memory pointer to req->shm without bumping its
reference count. If the TEE core drops its reference when returning to
userspace, might this memory be freed prematurely?

> [ ... ]
> @@ -295,11 +290,10 @@ int mbedtee_supp_send(struct tee_context *ctx, u32 ret, u32 num_params,
>  
>  	if (param->u.memref.size > req->size) {
>  		req->ret = TEEC_ERROR_COMMUNICATION;
>  	} else {
>  		req->ret = ret;
>  		memcpy(req->data, req->shm->kaddr, param->u.memref.size);
>  	}
>  	if (req->shm)
>  		tee_shm_put(req->shm);

[Severity: High]
Following up on the previous question, if req->shm was freed because a
reference was not acquired in mbedtee_supp_recv, does this memcpy and
subsequent tee_shm_put result in a use-after-free and a double free?

-- 
Sashiko AI review · https://sashiko.dev/#/patchset/20260701132514.186953-1-xing.xl.loong@gmail.com?part=3

next prev parent reply	other threads:[~2026-07-01 13:42 UTC|newest]

Thread overview: 15+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-07-01 13:25 [PATCH 0/3] tee: add MbedTEE driver Xing Loong
2026-07-01 13:25 ` [PATCH 1/3] dt-bindings: vendor-prefixes: add mbedtee Xing Loong
2026-07-01 13:25 ` [PATCH 2/3] dt-bindings: firmware: add mbedtee,rpc binding Xing Loong
2026-07-01 13:32   ` sashiko-bot
2026-07-01 14:05   ` Krzysztof Kozlowski
2026-07-01 16:39   ` Rob Herring (Arm)
2026-07-01 13:25 ` [PATCH 3/3] tee: add MbedTEE driver Xing Loong
2026-07-01 13:42   ` sashiko-bot [this message]
2026-07-02 15:11 ` [PATCH v2 0/3] " Xing Loong
2026-07-02 15:11   ` [PATCH v2 1/3] dt-bindings: vendor-prefixes: add mbedtee Xing Loong
2026-07-02 15:11   ` [PATCH v2 2/3] dt-bindings: firmware: add mbedtee,tee binding Xing Loong
2026-07-02 15:21     ` sashiko-bot
2026-07-02 15:27     ` Krzysztof Kozlowski
2026-07-02 15:11   ` [PATCH v2 3/3] tee: add MbedTEE driver Xing Loong
2026-07-02 15:32     ` sashiko-bot

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20260701134240.040051F00A3A@smtp.kernel.org \
    --to=sashiko-bot@kernel.org \
    --cc=conor+dt@kernel.org \
    --cc=devicetree@vger.kernel.org \
    --cc=robh@kernel.org \
    --cc=sashiko-reviews@lists.linux.dev \
    --cc=xing.xl.loong@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox