* [PATCH] binder: Address corner cases in deferred copy and fixup
@ 2022-04-13 8:54 Alessandro Astone
2022-04-13 10:00 ` Greg KH
0 siblings, 1 reply; 4+ messages in thread
From: Alessandro Astone @ 2022-04-13 8:54 UTC (permalink / raw)
To: tkjos, gregkh, brauner, arve, linux-kernel, maco
Cc: Alessandro Astone, Joel Fernandes, Hridya Valsaraju,
Suren Baghdasaryan
When handling BINDER_TYPE_FDA object we are pushing a parent fixup
with a certain skip_size but no scatter-gather copy object, since
the copy is handled standalone.
If BINDER_TYPE_FDA is the last children the scatter-gather copy
loop will never stop to skip it, thus we are left with an item in
the parent fixup list. This will trigger the BUG_ON().
Furthermore, it is possible to receive BINDER_TYPE_FDA object
with num_fds=0 which will confuse the scatter-gather code.
In the android userspace I could only find these usecases in the
libstagefright OMX implementation, so it might be that they're
doing something very weird, but nonetheless the kernel should not
panic about it.
Fixes: 09184ae9b575 ("binder: defer copies of pre-patched txn data")
Signed-off-by: Alessandro Astone <ales.astone@gmail.com>
---
drivers/android/binder.c | 11 +++++++++--
1 file changed, 9 insertions(+), 2 deletions(-)
diff --git a/drivers/android/binder.c b/drivers/android/binder.c
index 8351c5638880..18ad6825ba30 100644
--- a/drivers/android/binder.c
+++ b/drivers/android/binder.c
@@ -2295,7 +2295,7 @@ static int binder_do_deferred_txn_copies(struct binder_alloc *alloc,
{
int ret = 0;
struct binder_sg_copy *sgc, *tmpsgc;
- struct binder_ptr_fixup *pf =
+ struct binder_ptr_fixup *tmppf, *pf =
list_first_entry_or_null(pf_head, struct binder_ptr_fixup,
node);
@@ -2349,7 +2349,11 @@ static int binder_do_deferred_txn_copies(struct binder_alloc *alloc,
list_del(&sgc->node);
kfree(sgc);
}
- BUG_ON(!list_empty(pf_head));
+ list_for_each_entry_safe(pf, tmppf, pf_head, node) {
+ BUG_ON(pf->skip_size == 0);
+ list_del(&pf->node);
+ kfree(pf);
+ }
BUG_ON(!list_empty(sgc_head));
return ret > 0 ? -EINVAL : ret;
@@ -2486,6 +2490,9 @@ static int binder_translate_fd_array(struct list_head *pf_head,
struct binder_proc *proc = thread->proc;
int ret;
+ if (fda->num_fds == 0)
+ return 0;
+
fd_buf_size = sizeof(u32) * fda->num_fds;
if (fda->num_fds >= SIZE_MAX / sizeof(u32)) {
binder_user_error("%d:%d got transaction with invalid number of fds (%lld)\n",
--
2.35.1
^ permalink raw reply related [flat|nested] 4+ messages in thread* Re: [PATCH] binder: Address corner cases in deferred copy and fixup 2022-04-13 8:54 [PATCH] binder: Address corner cases in deferred copy and fixup Alessandro Astone @ 2022-04-13 10:00 ` Greg KH 2022-04-13 11:20 ` Alessandro Astone 0 siblings, 1 reply; 4+ messages in thread From: Greg KH @ 2022-04-13 10:00 UTC (permalink / raw) To: Alessandro Astone Cc: tkjos, brauner, arve, linux-kernel, maco, Joel Fernandes, Hridya Valsaraju, Suren Baghdasaryan On Wed, Apr 13, 2022 at 10:54:27AM +0200, Alessandro Astone wrote: > When handling BINDER_TYPE_FDA object we are pushing a parent fixup > with a certain skip_size but no scatter-gather copy object, since > the copy is handled standalone. > If BINDER_TYPE_FDA is the last children the scatter-gather copy > loop will never stop to skip it, thus we are left with an item in > the parent fixup list. This will trigger the BUG_ON(). > > Furthermore, it is possible to receive BINDER_TYPE_FDA object > with num_fds=0 which will confuse the scatter-gather code. > > In the android userspace I could only find these usecases in the > libstagefright OMX implementation, so it might be that they're > doing something very weird, but nonetheless the kernel should not > panic about it. > > Fixes: 09184ae9b575 ("binder: defer copies of pre-patched txn data") > Signed-off-by: Alessandro Astone <ales.astone@gmail.com> > --- > drivers/android/binder.c | 11 +++++++++-- > 1 file changed, 9 insertions(+), 2 deletions(-) > > diff --git a/drivers/android/binder.c b/drivers/android/binder.c > index 8351c5638880..18ad6825ba30 100644 > --- a/drivers/android/binder.c > +++ b/drivers/android/binder.c > @@ -2295,7 +2295,7 @@ static int binder_do_deferred_txn_copies(struct binder_alloc *alloc, > { > int ret = 0; > struct binder_sg_copy *sgc, *tmpsgc; > - struct binder_ptr_fixup *pf = > + struct binder_ptr_fixup *tmppf, *pf = Just make this a new line: struct binder_ptr_fixup *tmppf; above the existing line. > list_first_entry_or_null(pf_head, struct binder_ptr_fixup, > node); > > @@ -2349,7 +2349,11 @@ static int binder_do_deferred_txn_copies(struct binder_alloc *alloc, > list_del(&sgc->node); > kfree(sgc); > } > - BUG_ON(!list_empty(pf_head)); So you are hitting this BUG_ON() today? > + list_for_each_entry_safe(pf, tmppf, pf_head, node) { > + BUG_ON(pf->skip_size == 0); > + list_del(&pf->node); > + kfree(pf); > + } > BUG_ON(!list_empty(sgc_head)); > > return ret > 0 ? -EINVAL : ret; > @@ -2486,6 +2490,9 @@ static int binder_translate_fd_array(struct list_head *pf_head, > struct binder_proc *proc = thread->proc; > int ret; > > + if (fda->num_fds == 0) > + return 0; Why return 0? This feels like a separate issue from above, should this be 2 different commits? thanks, greg k-h ^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [PATCH] binder: Address corner cases in deferred copy and fixup 2022-04-13 10:00 ` Greg KH @ 2022-04-13 11:20 ` Alessandro Astone 2022-04-19 18:42 ` Todd Kjos 0 siblings, 1 reply; 4+ messages in thread From: Alessandro Astone @ 2022-04-13 11:20 UTC (permalink / raw) To: Greg KH Cc: tkjos, brauner, arve, linux-kernel, maco, Joel Fernandes, Hridya Valsaraju, Suren Baghdasaryan On 13/04/2022 12:00, Greg KH wrote: > On Wed, Apr 13, 2022 at 10:54:27AM +0200, Alessandro Astone wrote: >> When handling BINDER_TYPE_FDA object we are pushing a parent fixup >> with a certain skip_size but no scatter-gather copy object, since >> the copy is handled standalone. >> If BINDER_TYPE_FDA is the last children the scatter-gather copy >> loop will never stop to skip it, thus we are left with an item in >> the parent fixup list. This will trigger the BUG_ON(). >> >> Furthermore, it is possible to receive BINDER_TYPE_FDA object >> with num_fds=0 which will confuse the scatter-gather code. >> >> In the android userspace I could only find these usecases in the >> libstagefright OMX implementation, so it might be that they're >> doing something very weird, but nonetheless the kernel should not >> panic about it. >> >> Fixes: 09184ae9b575 ("binder: defer copies of pre-patched txn data") >> Signed-off-by: Alessandro Astone <ales.astone@gmail.com> >> --- >> drivers/android/binder.c | 11 +++++++++-- >> 1 file changed, 9 insertions(+), 2 deletions(-) >> >> diff --git a/drivers/android/binder.c b/drivers/android/binder.c >> index 8351c5638880..18ad6825ba30 100644 >> --- a/drivers/android/binder.c >> +++ b/drivers/android/binder.c >> @@ -2295,7 +2295,7 @@ static int binder_do_deferred_txn_copies(struct binder_alloc *alloc, >> { >> int ret = 0; >> struct binder_sg_copy *sgc, *tmpsgc; >> - struct binder_ptr_fixup *pf = >> + struct binder_ptr_fixup *tmppf, *pf = > Just make this a new line: > struct binder_ptr_fixup *tmppf; > above the existing line. > Ack. >> list_first_entry_or_null(pf_head, struct binder_ptr_fixup, >> node); >> >> @@ -2349,7 +2349,11 @@ static int binder_do_deferred_txn_copies(struct binder_alloc *alloc, >> list_del(&sgc->node); >> kfree(sgc); >> } >> - BUG_ON(!list_empty(pf_head)); > So you are hitting this BUG_ON() today? > Correct, both on 5.17, stable 5.17.2 and current master >> + list_for_each_entry_safe(pf, tmppf, pf_head, node) { >> + BUG_ON(pf->skip_size == 0); >> + list_del(&pf->node); >> + kfree(pf); >> + } >> BUG_ON(!list_empty(sgc_head)); >> >> return ret > 0 ? -EINVAL : ret; >> @@ -2486,6 +2490,9 @@ static int binder_translate_fd_array(struct list_head *pf_head, >> struct binder_proc *proc = thread->proc; >> int ret; >> >> + if (fda->num_fds == 0) >> + return 0; > Why return 0? > > This feels like a separate issue from above, should this be 2 different > commits? > return 0 because I want it to be handled as it was handled before 09184ae9b575 ("binder: defer copies of pre-patched txn data") Function `binder_do_deferred_txn_copies` distinguishes between a copy-fixup and a skip with `if (pf->skip_size)` so if the skip_size is 0, which happens if fda->num_fds is 0, it would accidentally enter the wrong branch. By returning 0 early i make sure a skip of size 0 is not added. It's not an error because it was never an error before commit 09184ae9b575 and some userspace in android is hitting this path. I would agree it's a separate issue. I originally merged it in this same patch because 1) Both are fixups to 09184ae9b575 2) Both are triggered by the same real-world android transaction that looks something like this: obj[0] BINDER_TYPE_PTR, parent obj[1] BINDER_TYPE_PTR, child obj[2] BINDER_TYPE_PTR, child obj[3] BINDER_TYPE_FDA with num_fds=0 3) In the other hunk of this patch i replace the BUG_ON with: BUG_ON(pf->skip_size == 0) to only BUG if any item remaining in the pf_head list is not a skip, but as observed we may receive skips of size 0. 4) With this hunk only, you would no longer reproduce the BUG_ON because the only transaction we receive in android with BINDER_TYPE_FDA as last child coincidentally always has num_fds=0. Certainly some weird behaviour... So if I split them, patch A would depend on patch B (see point 3), but the BUG of patch B would only be reproducible without patch A (see point 4). But let me know if you still prefer them split. > thanks, > > greg k-h ^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [PATCH] binder: Address corner cases in deferred copy and fixup 2022-04-13 11:20 ` Alessandro Astone @ 2022-04-19 18:42 ` Todd Kjos 0 siblings, 0 replies; 4+ messages in thread From: Todd Kjos @ 2022-04-19 18:42 UTC (permalink / raw) To: Alessandro Astone Cc: Greg KH, tkjos, brauner, arve, linux-kernel, maco, Joel Fernandes, Hridya Valsaraju, Suren Baghdasaryan On Wed, Apr 13, 2022 at 4:20 AM Alessandro Astone <ales.astone@gmail.com> wrote: > > On 13/04/2022 12:00, Greg KH wrote: > > > On Wed, Apr 13, 2022 at 10:54:27AM +0200, Alessandro Astone wrote: > >> When handling BINDER_TYPE_FDA object we are pushing a parent fixup > >> with a certain skip_size but no scatter-gather copy object, since > >> the copy is handled standalone. > >> If BINDER_TYPE_FDA is the last children the scatter-gather copy > >> loop will never stop to skip it, thus we are left with an item in > >> the parent fixup list. This will trigger the BUG_ON(). > >> > >> Furthermore, it is possible to receive BINDER_TYPE_FDA object > >> with num_fds=0 which will confuse the scatter-gather code. > >> > >> In the android userspace I could only find these usecases in the > >> libstagefright OMX implementation, so it might be that they're > >> doing something very weird, but nonetheless the kernel should not > >> panic about it. > >> > >> Fixes: 09184ae9b575 ("binder: defer copies of pre-patched txn data") > >> Signed-off-by: Alessandro Astone <ales.astone@gmail.com> > >> --- > >> drivers/android/binder.c | 11 +++++++++-- > >> 1 file changed, 9 insertions(+), 2 deletions(-) > >> > >> diff --git a/drivers/android/binder.c b/drivers/android/binder.c > >> index 8351c5638880..18ad6825ba30 100644 > >> --- a/drivers/android/binder.c > >> +++ b/drivers/android/binder.c > >> @@ -2295,7 +2295,7 @@ static int binder_do_deferred_txn_copies(struct binder_alloc *alloc, > >> { > >> int ret = 0; > >> struct binder_sg_copy *sgc, *tmpsgc; > >> - struct binder_ptr_fixup *pf = > >> + struct binder_ptr_fixup *tmppf, *pf = > > Just make this a new line: > > struct binder_ptr_fixup *tmppf; > > above the existing line. > > > Ack. > > >> list_first_entry_or_null(pf_head, struct binder_ptr_fixup, > >> node); > >> > >> @@ -2349,7 +2349,11 @@ static int binder_do_deferred_txn_copies(struct binder_alloc *alloc, > >> list_del(&sgc->node); > >> kfree(sgc); > >> } > >> - BUG_ON(!list_empty(pf_head)); > > So you are hitting this BUG_ON() today? > > > Correct, both on 5.17, stable 5.17.2 and current master > > >> + list_for_each_entry_safe(pf, tmppf, pf_head, node) { > >> + BUG_ON(pf->skip_size == 0); > >> + list_del(&pf->node); > >> + kfree(pf); > >> + } > >> BUG_ON(!list_empty(sgc_head)); > >> > >> return ret > 0 ? -EINVAL : ret; > >> @@ -2486,6 +2490,9 @@ static int binder_translate_fd_array(struct list_head *pf_head, > >> struct binder_proc *proc = thread->proc; > >> int ret; > >> > >> + if (fda->num_fds == 0) > >> + return 0; > > Why return 0? > > > > This feels like a separate issue from above, should this be 2 different > > commits? > > > return 0 because I want it to be handled as it was handled before > 09184ae9b575 ("binder: defer copies of pre-patched txn data") > > Function `binder_do_deferred_txn_copies` distinguishes between a copy-fixup > and a skip with `if (pf->skip_size)` so if the skip_size is 0, which happens > if fda->num_fds is 0, it would accidentally enter the wrong branch. > By returning 0 early i make sure a skip of size 0 is not added. It's not an > error because it was never an error before commit 09184ae9b575 and some > userspace in android is hitting this path. > > I would agree it's a separate issue. > I originally merged it in this same patch because > 1) Both are fixups to 09184ae9b575 > 2) Both are triggered by the same real-world android transaction that looks > something like this: > obj[0] BINDER_TYPE_PTR, parent > obj[1] BINDER_TYPE_PTR, child > obj[2] BINDER_TYPE_PTR, child > obj[3] BINDER_TYPE_FDA with num_fds=0 > 3) In the other hunk of this patch i replace the BUG_ON with: > BUG_ON(pf->skip_size == 0) > to only BUG if any item remaining in the pf_head list is not a skip, > but as observed we may receive skips of size 0. > 4) With this hunk only, you would no longer reproduce the BUG_ON because the > only transaction we receive in android with BINDER_TYPE_FDA as last child > coincidentally always has num_fds=0. Certainly some weird behaviour... > > So if I split them, patch A would depend on patch B (see point 3), but the > BUG of patch B would only be reproducible without patch A (see point 4). > > But let me know if you still prefer them split. Thanks for the fix! Please go ahead and split it into two patches. > > > thanks, > > > > greg k-h ^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2022-04-19 18:42 UTC | newest] Thread overview: 4+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2022-04-13 8:54 [PATCH] binder: Address corner cases in deferred copy and fixup Alessandro Astone 2022-04-13 10:00 ` Greg KH 2022-04-13 11:20 ` Alessandro Astone 2022-04-19 18:42 ` Todd Kjos
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox