* [PATCH v2 1/2] lib/ts_bm: fix integer overflow in pattern length calculation
@ 2026-03-08 18:10 Josh Law
2026-03-08 18:10 ` [PATCH v2 2/2] lib/ts_kmp: " Josh Law
2026-03-08 19:55 ` [PATCH v2 1/2] lib/ts_bm: " Andrew Morton
0 siblings, 2 replies; 8+ messages in thread
From: Josh Law @ 2026-03-08 18:10 UTC (permalink / raw)
To: Andrew Morton; +Cc: linux-kernel, Josh Law
From: Josh Law <objecting@objecting.org>
The ts_bm algorithm computes the required allocation size by
multiplying the pattern length by the size of an integer. If the
pattern length is sufficiently large, this can overflow the 32-bit
unsigned int before it is widened to size_t. This could result in an
undersized allocation and a subsequent heap buffer overflow when
copying the pattern.
Fix this by explicitly checking that the length does not exceed
the maximum safe threshold before calculating the buffer sizes.
Signed-off-by: Josh Law <objecting@objecting.org>
---
lib/ts_bm.c | 3 +++
1 file changed, 3 insertions(+)
diff --git a/lib/ts_bm.c b/lib/ts_bm.c
index eed5967238c5..01ea0a93cf6e 100644
--- a/lib/ts_bm.c
+++ b/lib/ts_bm.c
@@ -166,6 +166,9 @@ static struct ts_config *bm_init(const void *pattern, unsigned int len,
unsigned int prefix_tbl_len = len * sizeof(unsigned int);
size_t priv_size = sizeof(*bm) + len + prefix_tbl_len;
+ if (unlikely(len == 0 || len > (UINT_MAX - sizeof(*bm)) / (sizeof(unsigned int) + 1)))
+ return ERR_PTR(-EINVAL);
+
conf = alloc_ts_config(priv_size, gfp_mask);
if (IS_ERR(conf))
return conf;
--
2.43.0
^ permalink raw reply related [flat|nested] 8+ messages in thread
* [PATCH v2 2/2] lib/ts_kmp: fix integer overflow in pattern length calculation
2026-03-08 18:10 [PATCH v2 1/2] lib/ts_bm: fix integer overflow in pattern length calculation Josh Law
@ 2026-03-08 18:10 ` Josh Law
2026-03-08 19:55 ` [PATCH v2 1/2] lib/ts_bm: " Andrew Morton
1 sibling, 0 replies; 8+ messages in thread
From: Josh Law @ 2026-03-08 18:10 UTC (permalink / raw)
To: Andrew Morton; +Cc: linux-kernel, Josh Law
From: Josh Law <objecting@objecting.org>
The ts_kmp algorithm computes the required allocation size by
multiplying the pattern length by the size of an integer. If the
pattern length is sufficiently large, this can overflow the 32-bit
unsigned int before it is widened to size_t. This could result in an
undersized allocation and a subsequent heap buffer overflow when
copying the pattern.
Fix this by explicitly checking that the length does not exceed
the maximum safe threshold before calculating the buffer sizes.
Signed-off-by: Josh Law <objecting@objecting.org>
---
lib/ts_kmp.c | 3 +++
1 file changed, 3 insertions(+)
diff --git a/lib/ts_kmp.c b/lib/ts_kmp.c
index 5520dc28255a..e07f5e80d076 100644
--- a/lib/ts_kmp.c
+++ b/lib/ts_kmp.c
@@ -97,6 +97,9 @@ static struct ts_config *kmp_init(const void *pattern, unsigned int len,
unsigned int prefix_tbl_len = len * sizeof(unsigned int);
size_t priv_size = sizeof(*kmp) + len + prefix_tbl_len;
+ if (unlikely(len == 0 || len > (UINT_MAX - sizeof(*kmp)) / (sizeof(unsigned int) + 1)))
+ return ERR_PTR(-EINVAL);
+
conf = alloc_ts_config(priv_size, gfp_mask);
if (IS_ERR(conf))
return conf;
--
2.43.0
^ permalink raw reply related [flat|nested] 8+ messages in thread
* Re: [PATCH v2 1/2] lib/ts_bm: fix integer overflow in pattern length calculation
2026-03-08 18:10 [PATCH v2 1/2] lib/ts_bm: fix integer overflow in pattern length calculation Josh Law
2026-03-08 18:10 ` [PATCH v2 2/2] lib/ts_kmp: " Josh Law
@ 2026-03-08 19:55 ` Andrew Morton
2026-03-08 20:06 ` Josh Law
2026-03-08 20:17 ` [PATCH v3 " Josh Law
1 sibling, 2 replies; 8+ messages in thread
From: Andrew Morton @ 2026-03-08 19:55 UTC (permalink / raw)
To: Josh Law; +Cc: linux-kernel, Josh Law
On Sun, 8 Mar 2026 18:10:53 +0000 Josh Law <hlcj1234567@gmail.com> wrote:
> From: Josh Law <objecting@objecting.org>
>
> The ts_bm algorithm computes the required allocation size by
> multiplying the pattern length by the size of an integer. If the
> pattern length is sufficiently large, this can overflow the 32-bit
> unsigned int before it is widened to size_t. This could result in an
> undersized allocation and a subsequent heap buffer overflow when
> copying the pattern.
>
> Fix this by explicitly checking that the length does not exceed
> the maximum safe threshold before calculating the buffer sizes.
>
> ...
>
> --- a/lib/ts_bm.c
> +++ b/lib/ts_bm.c
> @@ -166,6 +166,9 @@ static struct ts_config *bm_init(const void *pattern, unsigned int len,
> unsigned int prefix_tbl_len = len * sizeof(unsigned int);
> size_t priv_size = sizeof(*bm) + len + prefix_tbl_len;
The above description is referring to this expression?
I think the uints will be promoted to size_t before the addition occurs?
If you're referring to the prefix_tbl_len initialization then yes,
overflow could happen.
> + if (unlikely(len == 0 || len > (UINT_MAX - sizeof(*bm)) / (sizeof(unsigned int) + 1)))
> + return ERR_PTR(-EINVAL);
Seems odd to perform these (uncommented!) checks after having performed
the problematic operations. Something like:
unsigned int prefix_tbl_len;
size_t priv_size;
/* Explanatory comment goes here */
/* Can this actually happen? */
if (unlikely(len == 0))
return ERR_PTR(-EINVAL);
/* Explanatory comment goes here */
if (unlikely(len > (UINT_MAX - sizeof(*bm)) / (sizeof(unsigned int) + 1))
return ERR_PTR(-EINVAL);
prefix_tbl_len = len * sizeof(unsigned int);
priv_size = sizeof(*bm) + len + prefix_tbl_len;
Also, please check the various helpers in overflow.h.
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [PATCH v2 1/2] lib/ts_bm: fix integer overflow in pattern length calculation
2026-03-08 19:55 ` [PATCH v2 1/2] lib/ts_bm: " Andrew Morton
@ 2026-03-08 20:06 ` Josh Law
2026-03-08 20:15 ` Josh Law
2026-03-08 20:17 ` [PATCH v3 " Josh Law
1 sibling, 1 reply; 8+ messages in thread
From: Josh Law @ 2026-03-08 20:06 UTC (permalink / raw)
To: Andrew Morton; +Cc: linux-kernel, Josh Law
8 Mar 2026 19:55:20 Andrew Morton <akpm@linux-foundation.org>:
> On Sun, 8 Mar 2026 18:10:53 +0000 Josh Law <hlcj1234567@gmail.com> wrote:
>
>> From: Josh Law <objecting@objecting.org>
>>
>> The ts_bm algorithm computes the required allocation size by
>> multiplying the pattern length by the size of an integer. If the
>> pattern length is sufficiently large, this can overflow the 32-bit
>> unsigned int before it is widened to size_t. This could result in an
>> undersized allocation and a subsequent heap buffer overflow when
>> copying the pattern.
>>
>> Fix this by explicitly checking that the length does not exceed
>> the maximum safe threshold before calculating the buffer sizes.
>>
>> ...
>>
>> --- a/lib/ts_bm.c
>> +++ b/lib/ts_bm.c
>> @@ -166,6 +166,9 @@ static struct ts_config *bm_init(const void *pattern, unsigned int len,
>> unsigned int prefix_tbl_len = len * sizeof(unsigned int);
>> size_t priv_size = sizeof(*bm) + len + prefix_tbl_len;
>
> The above description is referring to this expression?
>
> I think the uints will be promoted to size_t before the addition occurs?
>
> If you're referring to the prefix_tbl_len initialization then yes,
> overflow could happen.
>
>> + if (unlikely(len == 0 || len > (UINT_MAX - sizeof(*bm)) / (sizeof(unsigned int) + 1)))
>> + return ERR_PTR(-EINVAL);
>
> Seems odd to perform these (uncommented!) checks after having performed
> the problematic operations. Something like:
>
> unsigned int prefix_tbl_len;
> size_t priv_size;
>
> /* Explanatory comment goes here */
> /* Can this actually happen? */
> if (unlikely(len == 0))
> return ERR_PTR(-EINVAL);
>
> /* Explanatory comment goes here */
> if (unlikely(len > (UINT_MAX - sizeof(*bm)) / (sizeof(unsigned int) + 1))
> return ERR_PTR(-EINVAL);
>
> prefix_tbl_len = len * sizeof(unsigned int);
> priv_size = sizeof(*bm) + len + prefix_tbl_len;
>
>
> Also, please check the various helpers in overflow.h.
Yeah, I'm fixing that now, I also submitted a couple other patches you may want to have a look at
V/R
Josh law
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [PATCH v2 1/2] lib/ts_bm: fix integer overflow in pattern length calculation
2026-03-08 20:06 ` Josh Law
@ 2026-03-08 20:15 ` Josh Law
0 siblings, 0 replies; 8+ messages in thread
From: Josh Law @ 2026-03-08 20:15 UTC (permalink / raw)
To: Andrew Morton; +Cc: linux-kernel, Josh Law
8 Mar 2026 20:06:15 Josh Law <hlcj1234567@gmail.com>:
> 8 Mar 2026 19:55:20 Andrew Morton <akpm@linux-foundation.org>:
>
>> On Sun, 8 Mar 2026 18:10:53 +0000 Josh Law <hlcj1234567@gmail.com> wrote:
>>
>>> From: Josh Law <objecting@objecting.org>
>>>
>>> The ts_bm algorithm computes the required allocation size by
>>> multiplying the pattern length by the size of an integer. If the
>>> pattern length is sufficiently large, this can overflow the 32-bit
>>> unsigned int before it is widened to size_t. This could result in an
>>> undersized allocation and a subsequent heap buffer overflow when
>>> copying the pattern.
>>>
>>> Fix this by explicitly checking that the length does not exceed
>>> the maximum safe threshold before calculating the buffer sizes.
>>>
>>> ...
>>>
>>> --- a/lib/ts_bm.c
>>> +++ b/lib/ts_bm.c
>>> @@ -166,6 +166,9 @@ static struct ts_config *bm_init(const void *pattern, unsigned int len,
>>> unsigned int prefix_tbl_len = len * sizeof(unsigned int);
>>> size_t priv_size = sizeof(*bm) + len + prefix_tbl_len;
>>
>> The above description is referring to this expression?
>>
>> I think the uints will be promoted to size_t before the addition occurs?
>>
>> If you're referring to the prefix_tbl_len initialization then yes,
>> overflow could happen.
>>
>>> + if (unlikely(len == 0 || len > (UINT_MAX - sizeof(*bm)) / (sizeof(unsigned int) + 1)))
>>> + return ERR_PTR(-EINVAL);
>>
>> Seems odd to perform these (uncommented!) checks after having performed
>> the problematic operations. Something like:
>>
>> unsigned int prefix_tbl_len;
>> size_t priv_size;
>>
>> /* Explanatory comment goes here */
>> /* Can this actually happen? */
>> if (unlikely(len == 0))
>> return ERR_PTR(-EINVAL);
>>
>> /* Explanatory comment goes here */
>> if (unlikely(len > (UINT_MAX - sizeof(*bm)) / (sizeof(unsigned int) + 1))
>> return ERR_PTR(-EINVAL);
>>
>> prefix_tbl_len = len * sizeof(unsigned int);
>> priv_size = sizeof(*bm) + len + prefix_tbl_len;
>>
>>
>> Also, please check the various helpers in overflow.h.
>
> Yeah, I'm fixing that now, I also submitted a couple other patches you may want to have a look at
>
>
> V/R
>
>
> Josh law
Oh and also, sorry for bugging, but I have a maintainers patch I sent, and another one you need to look at,
V/R
Josh law
^ permalink raw reply [flat|nested] 8+ messages in thread
* [PATCH v3 1/2] lib/ts_bm: fix integer overflow in pattern length calculation
2026-03-08 19:55 ` [PATCH v2 1/2] lib/ts_bm: " Andrew Morton
2026-03-08 20:06 ` Josh Law
@ 2026-03-08 20:17 ` Josh Law
2026-03-08 20:17 ` [PATCH v3 2/2] lib/ts_kmp: " Josh Law
1 sibling, 1 reply; 8+ messages in thread
From: Josh Law @ 2026-03-08 20:17 UTC (permalink / raw)
To: Andrew Morton; +Cc: linux-kernel, Josh Law
From: Josh Law <objecting@objecting.org>
Changes in v3:
- move overflow checks before the arithmetic they guard
- add comments explaining the zero-length and size-overflow cases
- use check_mul_overflow() and check_add_overflow() from overflow.h
The ts_bm algorithm stores its good_shift[] table and pattern in a
single allocation sized from the pattern length. If the good_shift[]
size calculation wraps, the resulting allocation can be too small and
subsequent pattern copies can overflow it.
Fix this by rejecting zero-length patterns and by using overflow
helpers before calculating the combined allocation size.
Signed-off-by: Josh Law <objecting@objecting.org>
---
lib/ts_bm.c | 18 ++++++++++++++++--
1 file changed, 16 insertions(+), 2 deletions(-)
diff --git a/lib/ts_bm.c b/lib/ts_bm.c
index eed5967238c5..676105e84005 100644
--- a/lib/ts_bm.c
+++ b/lib/ts_bm.c
@@ -163,8 +163,22 @@ static struct ts_config *bm_init(const void *pattern, unsigned int len,
struct ts_config *conf;
struct ts_bm *bm;
int i;
- unsigned int prefix_tbl_len = len * sizeof(unsigned int);
- size_t priv_size = sizeof(*bm) + len + prefix_tbl_len;
+ unsigned int prefix_tbl_len;
+ size_t priv_size;
+
+ /* Zero-length patterns would underflow bm_find()'s initial shift. */
+ if (unlikely(!len))
+ return ERR_PTR(-EINVAL);
+
+ /*
+ * bm->pattern is stored immediately after the good_shift[] table.
+ * Reject lengths that would wrap while sizing either region.
+ */
+ if (unlikely(check_mul_overflow(len, sizeof(*bm->good_shift),
+ &prefix_tbl_len) ||
+ check_add_overflow(sizeof(*bm), (size_t)len, &priv_size) ||
+ check_add_overflow(priv_size, prefix_tbl_len, &priv_size)))
+ return ERR_PTR(-EINVAL);
conf = alloc_ts_config(priv_size, gfp_mask);
if (IS_ERR(conf))
--
2.43.0
^ permalink raw reply related [flat|nested] 8+ messages in thread
* [PATCH v3 2/2] lib/ts_kmp: fix integer overflow in pattern length calculation
2026-03-08 20:17 ` [PATCH v3 " Josh Law
@ 2026-03-08 20:17 ` Josh Law
0 siblings, 0 replies; 8+ messages in thread
From: Josh Law @ 2026-03-08 20:17 UTC (permalink / raw)
To: Andrew Morton; +Cc: linux-kernel, Josh Law
From: Josh Law <objecting@objecting.org>
Changes in v3:
- mirror the ts_bm review fixes for consistency across the series
- move overflow checks before the arithmetic they guard
- add comments and use overflow.h helpers for the allocation math
The ts_kmp algorithm stores its prefix_tbl[] table and pattern in a
single allocation sized from the pattern length. If the prefix_tbl[]
size calculation wraps, the resulting allocation can be too small and
subsequent pattern copies can overflow it.
Fix this by rejecting zero-length patterns and by using overflow
helpers before calculating the combined allocation size.
Signed-off-by: Josh Law <objecting@objecting.org>
---
lib/ts_kmp.c | 18 ++++++++++++++++--
1 file changed, 16 insertions(+), 2 deletions(-)
diff --git a/lib/ts_kmp.c b/lib/ts_kmp.c
index 5520dc28255a..29466c1803c9 100644
--- a/lib/ts_kmp.c
+++ b/lib/ts_kmp.c
@@ -94,8 +94,22 @@ static struct ts_config *kmp_init(const void *pattern, unsigned int len,
struct ts_config *conf;
struct ts_kmp *kmp;
int i;
- unsigned int prefix_tbl_len = len * sizeof(unsigned int);
- size_t priv_size = sizeof(*kmp) + len + prefix_tbl_len;
+ unsigned int prefix_tbl_len;
+ size_t priv_size;
+
+ /* Zero-length patterns would make kmp_find() read beyond kmp->pattern. */
+ if (unlikely(!len))
+ return ERR_PTR(-EINVAL);
+
+ /*
+ * kmp->pattern is stored immediately after the prefix_tbl[] table.
+ * Reject lengths that would wrap while sizing either region.
+ */
+ if (unlikely(check_mul_overflow(len, sizeof(*kmp->prefix_tbl),
+ &prefix_tbl_len) ||
+ check_add_overflow(sizeof(*kmp), (size_t)len, &priv_size) ||
+ check_add_overflow(priv_size, prefix_tbl_len, &priv_size)))
+ return ERR_PTR(-EINVAL);
conf = alloc_ts_config(priv_size, gfp_mask);
if (IS_ERR(conf))
--
2.43.0
^ permalink raw reply related [flat|nested] 8+ messages in thread
* [PATCH v3 2/2] lib/ts_kmp: fix integer overflow in pattern length calculation
2026-03-08 20:20 [PATCH v3 1/2] lib/ts_bm: " Josh Law
@ 2026-03-08 20:20 ` Josh Law
0 siblings, 0 replies; 8+ messages in thread
From: Josh Law @ 2026-03-08 20:20 UTC (permalink / raw)
To: Andrew Morton; +Cc: linux-kernel, Josh Law
From: Josh Law <objecting@objecting.org>
Changes in v3:
- mirror the ts_bm review fixes for consistency across the series
- move overflow checks before the arithmetic they guard
- add comments and use overflow.h helpers for the allocation math
The ts_kmp algorithm stores its prefix_tbl[] table and pattern in a
single allocation sized from the pattern length. If the prefix_tbl[]
size calculation wraps, the resulting allocation can be too small and
subsequent pattern copies can overflow it.
Fix this by rejecting zero-length patterns and by using overflow
helpers before calculating the combined allocation size.
Signed-off-by: Josh Law <objecting@objecting.org>
---
lib/ts_kmp.c | 18 ++++++++++++++++--
1 file changed, 16 insertions(+), 2 deletions(-)
diff --git a/lib/ts_kmp.c b/lib/ts_kmp.c
index 5520dc28255a..29466c1803c9 100644
--- a/lib/ts_kmp.c
+++ b/lib/ts_kmp.c
@@ -94,8 +94,22 @@ static struct ts_config *kmp_init(const void *pattern, unsigned int len,
struct ts_config *conf;
struct ts_kmp *kmp;
int i;
- unsigned int prefix_tbl_len = len * sizeof(unsigned int);
- size_t priv_size = sizeof(*kmp) + len + prefix_tbl_len;
+ unsigned int prefix_tbl_len;
+ size_t priv_size;
+
+ /* Zero-length patterns would make kmp_find() read beyond kmp->pattern. */
+ if (unlikely(!len))
+ return ERR_PTR(-EINVAL);
+
+ /*
+ * kmp->pattern is stored immediately after the prefix_tbl[] table.
+ * Reject lengths that would wrap while sizing either region.
+ */
+ if (unlikely(check_mul_overflow(len, sizeof(*kmp->prefix_tbl),
+ &prefix_tbl_len) ||
+ check_add_overflow(sizeof(*kmp), (size_t)len, &priv_size) ||
+ check_add_overflow(priv_size, prefix_tbl_len, &priv_size)))
+ return ERR_PTR(-EINVAL);
conf = alloc_ts_config(priv_size, gfp_mask);
if (IS_ERR(conf))
--
2.43.0
^ permalink raw reply related [flat|nested] 8+ messages in thread
end of thread, other threads:[~2026-03-08 20:20 UTC | newest]
Thread overview: 8+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2026-03-08 18:10 [PATCH v2 1/2] lib/ts_bm: fix integer overflow in pattern length calculation Josh Law
2026-03-08 18:10 ` [PATCH v2 2/2] lib/ts_kmp: " Josh Law
2026-03-08 19:55 ` [PATCH v2 1/2] lib/ts_bm: " Andrew Morton
2026-03-08 20:06 ` Josh Law
2026-03-08 20:15 ` Josh Law
2026-03-08 20:17 ` [PATCH v3 " Josh Law
2026-03-08 20:17 ` [PATCH v3 2/2] lib/ts_kmp: " Josh Law
-- strict thread matches above, loose matches on Subject: below --
2026-03-08 20:20 [PATCH v3 1/2] lib/ts_bm: " Josh Law
2026-03-08 20:20 ` [PATCH v3 2/2] lib/ts_kmp: " Josh Law
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox