From: sashiko-bot@kernel.org
To: "Arnaldo Carvalho de Melo" <acme@kernel.org>
Cc: linux-perf-users@vger.kernel.org
Subject: Re: [PATCH 04/29] perf zstd: Fix compression error path in zstd_compress_stream_to_records()
Date: Sun, 24 May 2026 04:06:53 +0000 [thread overview]
Message-ID: <20260524040653.6A4871F000E9@smtp.kernel.org> (raw)
In-Reply-To: <20260524032709.1080771-5-acme@kernel.org>
Thank you for your contribution! Sashiko AI review found 3 potential issue(s) to consider:
New issues:
- [High] The patch skips the final record__mmap_read_all() flush on error in the main thread to avoid retrying a failed read, but misses the identical retry path in parallel worker threads.
Pre-existing issues:
- [High] ZSTD compression silently corrupts the perf.data stream if the output requires multiple records, because record__pushfn overwrites the multi-record headers and incorrectly wraps them into a single record.
- [High] The `zstd_compress_stream_to_records` loop advances the `dst` buffer by unaligned sizes, which can trigger unaligned memory access exceptions on strict architectures.
--
commit 62a2343957107fded20fb827b93d6a1cadb0934a
Author: Arnaldo Carvalho de Melo <acme@redhat.com>
perf zstd: Fix compression error path in zstd_compress_stream_to_records()
This commit fixes bugs in the error fallback path of
zstd_compress_stream_to_records(). It returns -1 on error instead of doing
an unsafe memcpy, skips the final flush on mmap-read failure, and adds
boundary checks to ensure the compression context is properly reset.
> diff --git a/tools/perf/builtin-record.c b/tools/perf/builtin-record.c
> --- a/tools/perf/builtin-record.c
> +++ b/tools/perf/builtin-record.c
[ ... ]
> @@ -2890,6 +2890,10 @@ static int __cmd_record(struct record *rec, int argc, const char **argv)
> out_child:
> record__stop_threads(rec);
> record__mmap_read_all(rec, true);
> + goto out_free_threads;
> +out_child_no_flush:
> + /* mmap read already failed — retrying would just fail again */
> + record__stop_threads(rec);
[Severity: High]
The patch avoids retrying the failed read in the main thread by skipping
record__mmap_read_all() here, but what happens in parallel worker threads?
If a parallel worker thread running record__thread() encounters a read
error, it breaks its loop and unconditionally calls
record__mmap_read_all(thread->rec, true).
Does this circumvent the error handling logic and cause the worker thread
to immediately retry the failed read?
> out_free_threads:
> record__free_thread_data(rec);
> evlist__finalize_ctlfd(rec->evlist);
> diff --git a/tools/perf/util/zstd.c b/tools/perf/util/zstd.c
> --- a/tools/perf/util/zstd.c
> +++ b/tools/perf/util/zstd.c
[ ... ]
> @@ -54,7 +55,13 @@ ssize_t zstd_compress_stream_to_records(struct zstd_data *data, void *dst, size_
>
> while (input.pos < input.size) {
> record = dst;
> + /* process_header writes the event header into record */
> + if (dst_size < sizeof(struct perf_event_header))
> + goto reset;
> size = process_header(record, 0);
[Severity: High]
This is a pre-existing issue, but does this function corrupt the perf.data
stream if the output requires multiple records?
If the compressed output exceeds max_record_size, this loop splits it into
multiple chunks with a new perf_event_header for each. However,
record__pushfn() assumes a single event is returned and blindly overwrites
the first event's header.size and data_size with the total size of all
chunks.
Since header.size is a u16, will it overflow and truncate if the total size
exceeds 65535 bytes, while also swallowing subsequent headers into the
payload?
> + /* Output buffer full — cannot fit even the record header */
> + if (size > dst_size)
> + goto reset;
> compressed += size;
> dst += size;
> dst_size -= size;
[ ... ]
> @@ -65,10 +72,18 @@ ssize_t zstd_compress_stream_to_records(struct zstd_data *data, void *dst, size_
> if (ZSTD_isError(ret)) {
> pr_err("failed to compress %ld bytes: %s\n",
> (long)src_size, ZSTD_getErrorName(ret));
> - memcpy(dst, src, src_size);
> - return src_size;
> + goto reset;
> }
> size = output.pos;
> + /*
> + * No progress: ZSTD couldn't emit any bytes into the
> + * remaining output buffer. Calling process_header
> + * with size=0 would re-trigger header initialization,
> + * double-subtracting the header size from dst_size and
> + * underflowing the unsigned counter.
> + */
> + if (size == 0)
> + goto reset;
> size = process_header(record, size);
> compressed += size;
> dst += size;
[Severity: High]
This is a pre-existing issue, but does advancing dst by an unaligned
size cause a crash on strict architectures?
Since output.pos is not guaranteed to be 8-byte or 4-byte aligned, dst
can become unaligned here.
On the next iteration, process_header() casts this unaligned address to
struct perf_record_compressed2 * and writes a 32-bit value to
event->header.type. Will this raise a SIGBUS exception on architectures
like SPARC?
> }
>
> return compressed;
[ ... ]
--
Sashiko AI review · https://sashiko.dev/#/patchset/20260524032709.1080771-1-acme@kernel.org?part=4
next prev parent reply other threads:[~2026-05-24 4:06 UTC|newest]
Thread overview: 47+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-05-24 3:26 [PATCHES v2 00/29] perf: Harden perf.data parsing against crafted/corrupted files Arnaldo Carvalho de Melo
2026-05-24 3:26 ` [PATCH 01/29] perf session: Add minimum event size and alignment validation Arnaldo Carvalho de Melo
2026-05-24 4:13 ` sashiko-bot
2026-05-24 3:26 ` [PATCH 02/29] perf session: Bounds-check one_mmap event pointer in peek_event Arnaldo Carvalho de Melo
2026-05-24 4:03 ` sashiko-bot
2026-05-24 3:26 ` [PATCH 03/29] perf tools: Fix event_contains() macro to verify full field extent Arnaldo Carvalho de Melo
2026-05-24 3:26 ` [PATCH 04/29] perf zstd: Fix compression error path in zstd_compress_stream_to_records() Arnaldo Carvalho de Melo
2026-05-24 4:06 ` sashiko-bot [this message]
2026-05-24 3:26 ` [PATCH 05/29] perf zstd: Fix multi-iteration decompression and error handling Arnaldo Carvalho de Melo
2026-05-24 3:26 ` [PATCH 06/29] perf session: Fix PERF_RECORD_READ swap and dump for variable-length events Arnaldo Carvalho de Melo
2026-05-24 3:26 ` [PATCH 07/29] perf session: Fix swap_sample_id_all() crash on crafted events Arnaldo Carvalho de Melo
2026-05-24 3:26 ` [PATCH 08/29] perf session: Add validated swap infrastructure with null-termination checks Arnaldo Carvalho de Melo
2026-05-24 4:05 ` sashiko-bot
2026-05-24 3:26 ` [PATCH 09/29] perf session: Use bounded copy for PERF_RECORD_TIME_CONV Arnaldo Carvalho de Melo
2026-05-24 3:26 ` [PATCH 10/29] perf session: Validate HEADER_ATTR attr.size before swapping Arnaldo Carvalho de Melo
2026-05-24 4:08 ` sashiko-bot
2026-05-24 3:26 ` [PATCH 11/29] perf session: Validate nr fields against event size on both swap and common paths Arnaldo Carvalho de Melo
2026-05-24 3:26 ` [PATCH 12/29] perf header: Byte-swap build ID event pid and bounds check section entries Arnaldo Carvalho de Melo
2026-05-24 4:08 ` sashiko-bot
2026-05-24 3:26 ` [PATCH 13/29] perf cpumap: Reject RANGE_CPUS with start_cpu > end_cpu Arnaldo Carvalho de Melo
2026-05-24 4:04 ` sashiko-bot
2026-05-24 3:26 ` [PATCH 14/29] perf auxtrace: Harden auxtrace_error event handling Arnaldo Carvalho de Melo
2026-05-24 3:26 ` [PATCH 15/29] perf session: Add byte-swap and bounds check for PERF_RECORD_BPF_METADATA events Arnaldo Carvalho de Melo
2026-05-24 4:13 ` sashiko-bot
2026-05-24 3:26 ` [PATCH 16/29] perf header: Validate null-termination in PERF_RECORD_EVENT_UPDATE string fields Arnaldo Carvalho de Melo
2026-05-24 3:26 ` [PATCH 17/29] perf tools: Bounds check perf_event_attr fields against attr.size before printing Arnaldo Carvalho de Melo
2026-05-24 4:01 ` sashiko-bot
2026-05-24 3:26 ` [PATCH 18/29] perf header: Propagate feature section processing errors Arnaldo Carvalho de Melo
2026-05-24 3:26 ` [PATCH 19/29] perf header: Validate f_attr.ids section before use in perf_session__read_header() Arnaldo Carvalho de Melo
2026-05-24 3:26 ` [PATCH 20/29] perf header: Validate feature section size and add read path bounds checking Arnaldo Carvalho de Melo
2026-05-24 4:37 ` sashiko-bot
2026-05-24 3:26 ` [PATCH 21/29] perf header: Sanity check HEADER_EVENT_DESC attr.size before swap Arnaldo Carvalho de Melo
2026-05-24 3:26 ` [PATCH 22/29] perf header: Validate bitmap size before allocating in do_read_bitmap() Arnaldo Carvalho de Melo
2026-05-24 3:26 ` [PATCH 23/29] perf session: Add byte-swap handler for PERF_RECORD_COMPRESSED2 Arnaldo Carvalho de Melo
2026-05-24 3:26 ` [PATCH 24/29] perf tools: Harden compressed event processing Arnaldo Carvalho de Melo
2026-05-24 4:35 ` sashiko-bot
2026-05-24 3:26 ` [PATCH 25/29] perf session: Check for decompression buffer size overflow Arnaldo Carvalho de Melo
2026-05-24 3:27 ` [PATCH 26/29] perf session: Bound nr_cpus_avail and validate sample CPU Arnaldo Carvalho de Melo
2026-05-24 6:23 ` sashiko-bot
2026-05-24 3:27 ` [PATCH 27/29] perf kwork: Bounds check work->cpu before indexing cpus_runtime[] Arnaldo Carvalho de Melo
2026-05-24 3:27 ` [PATCH 28/29] perf session: Snapshot event->header.size in process_user_event() Arnaldo Carvalho de Melo
2026-05-24 4:31 ` sashiko-bot
2026-05-24 3:27 ` [PATCH 29/29] perf test: Add truncated perf.data robustness test Arnaldo Carvalho de Melo
-- strict thread matches above, loose matches on Subject: below --
2026-05-25 1:05 [PATCHES v3 00/29] perf: Harden perf.data parsing against crafted/corrupted files Arnaldo Carvalho de Melo
2026-05-25 1:05 ` [PATCH 04/29] perf zstd: Fix compression error path in zstd_compress_stream_to_records() Arnaldo Carvalho de Melo
2026-05-25 1:52 ` sashiko-bot
2026-05-26 21:17 [PATCHES v4 00/29] perf: Harden perf.data parsing against crafted/corrupted files Arnaldo Carvalho de Melo
2026-05-26 21:17 ` [PATCH 04/29] perf zstd: Fix compression error path in zstd_compress_stream_to_records() Arnaldo Carvalho de Melo
2026-05-26 22:00 ` sashiko-bot
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20260524040653.6A4871F000E9@smtp.kernel.org \
--to=sashiko-bot@kernel.org \
--cc=acme@kernel.org \
--cc=linux-perf-users@vger.kernel.org \
--cc=sashiko-reviews@lists.linux.dev \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox