From: Zhen Lei <thunder.leizhen@huawei.com>
To: Josh Poimboeuf <jpoimboe@kernel.org>,
Jiri Kosina <jikos@kernel.org>, Miroslav Benes <mbenes@suse.cz>,
Petr Mladek <pmladek@suse.com>,
Joe Lawrence <joe.lawrence@redhat.com>,
<live-patching@vger.kernel.org>, <linux-kernel@vger.kernel.org>,
Masahiro Yamada <masahiroy@kernel.org>,
Alexei Starovoitov <ast@kernel.org>, Jiri Olsa <jolsa@kernel.org>,
Kees Cook <keescook@chromium.org>,
Andrew Morton <akpm@linux-foundation.org>,
"Luis Chamberlain" <mcgrof@kernel.org>,
<linux-modules@vger.kernel.org>
Cc: Zhen Lei <thunder.leizhen@huawei.com>
Subject: [PATCH v4 4/8] kallsyms: Improve the performance of kallsyms_lookup_name()
Date: Tue, 20 Sep 2022 15:13:13 +0800 [thread overview]
Message-ID: <20220920071317.1787-5-thunder.leizhen@huawei.com> (raw)
In-Reply-To: <20220920071317.1787-1-thunder.leizhen@huawei.com>
Currently, to search for a symbol, we need to expand the symbols in
'kallsyms_names' one by one, and then use the expanded string for
comparison. This process can be optimized.
And now scripts/kallsyms no longer compresses the symbol types, each
symbol type always occupies one byte. So we can first compress the
searched symbol and then make a quick comparison based on the compressed
length and content. In this way, for entries with mismatched lengths,
there is no need to expand and compare strings. And for those matching
lengths, there's no need to expand the symbol. This saves a lot of time.
According to my test results, the average performance of
kallsyms_lookup_name() can be improved by 20 to 30 times.
The pseudo code of the test case is as follows:
static int stat_find_name(...)
{
start = sched_clock();
(void)kallsyms_lookup_name(name);
end = sched_clock();
//Update min, max, cnt, sum
}
/*
* Traverse all symbols in sequence and collect statistics on the time
* taken by kallsyms_lookup_name() to lookup each symbol.
*/
kallsyms_on_each_symbol(stat_find_name, NULL);
The test results are as follows (twice):
After : min=5250, max= 726560, avg= 302132
After : min=5320, max= 726850, avg= 301978
Before: min=170, max=15949190, avg=7553906
Before: min=160, max=15877280, avg=7517784
The average time consumed is only 4.01% and the maximum time consumed is
only 4.57% of the time consumed before optimization.
Signed-off-by: Zhen Lei <thunder.leizhen@huawei.com>
---
kernel/kallsyms.c | 79 +++++++++++++++++++++++++++++++++++++++++++++--
1 file changed, 76 insertions(+), 3 deletions(-)
diff --git a/kernel/kallsyms.c b/kernel/kallsyms.c
index 3e7e2c2ad2f75ef..2d76196cfe89f34 100644
--- a/kernel/kallsyms.c
+++ b/kernel/kallsyms.c
@@ -87,6 +87,71 @@ static unsigned int kallsyms_expand_symbol(unsigned int off,
return off;
}
+static int kallsyms_name_to_tokens(const char *name, char *buf)
+{
+ int i, j, k, n;
+ int len, token_len;
+ const char *token;
+ unsigned char token_idx[KSYM_NAME_LEN];
+ unsigned char token_bak[KSYM_NAME_LEN];
+
+ /*
+ * n, number of tokens in the string name.
+ * token_idx[i], the start index of the ith token.
+ * token_idx[n] is used to calculate the length of the last token.
+ */
+ n = strlen(name);
+ if (n >= KSYM_NAME_LEN) {
+ buf[0] = 0;
+ return 0;
+ }
+ for (i = 0; i <= n; i++)
+ token_idx[i] = (unsigned char)i;
+
+ /*
+ * For tokens whose token_len >= 2, a larger index value indicates
+ * a higher occurrence frequency. See scripts/kallsyms.c
+ */
+ for (i = 255; i >= 0; i--) {
+ token = &kallsyms_token_table[kallsyms_token_index[i]];
+ token_len = strlen(token);
+ if (token_len <= 1)
+ continue;
+
+ /*
+ * Find and merge two tokens into one.
+ *
+ * |<-- new_token -->|
+ * | token1 | token2 |
+ * token_idx[]: j j+1 j+2
+ *
+ */
+ for (j = 0; j < n - 1; j++) {
+ len = token_idx[j + 2] - token_idx[j];
+ if (len == token_len &&
+ !strncmp(name + token_idx[j], token, len)) {
+ token_bak[token_idx[j]] = (unsigned char)i;
+ for (k = j + 1; k < n; k++)
+ token_idx[k] = token_idx[k + 1];
+ n--;
+ }
+ }
+ }
+
+ for (j = 0; j < n; j++) {
+ len = token_idx[j + 1] - token_idx[j];
+ if (len <= 1) {
+ buf[j] = name[token_idx[j]];
+ continue;
+ }
+
+ buf[j] = token_bak[token_idx[j]];
+ }
+ buf[n] = 0;
+
+ return n;
+}
+
/*
* Get symbol type information. This is encoded as a single char at the
* beginning of the symbol name.
@@ -192,20 +257,28 @@ unsigned long kallsyms_lookup_name(const char *name)
char namebuf[KSYM_NAME_LEN];
unsigned long i;
unsigned int off;
+ int len;
/* Skip the search for empty string. */
if (!*name)
return 0;
+ len = kallsyms_name_to_tokens(name, namebuf);
+ for (i = 0, off = 0; len && i < kallsyms_num_syms; i++) {
+ if (kallsyms_names[off] == len + 1 &&
+ !memcmp(&kallsyms_names[off + 2], namebuf, len))
+ return kallsyms_sym_address(i);
+
+ off += kallsyms_names[off] + 1;
+ }
+
for (i = 0, off = 0; i < kallsyms_num_syms; i++) {
off = kallsyms_expand_symbol(off, namebuf, ARRAY_SIZE(namebuf));
- if (strcmp(namebuf, name) == 0)
- return kallsyms_sym_address(i);
-
if (cleanup_symbol_name(namebuf) && strcmp(namebuf, name) == 0)
return kallsyms_sym_address(i);
}
+
return module_kallsyms_lookup_name(name);
}
--
2.25.1
next prev parent reply other threads:[~2022-09-20 7:14 UTC|newest]
Thread overview: 25+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-09-20 7:13 [PATCH v4 0/8] kallsyms: Optimizes the performance of lookup symbols Zhen Lei
2022-09-20 7:13 ` [PATCH v4 1/8] scripts/kallsyms: rename build_initial_tok_table() Zhen Lei
2022-09-21 7:47 ` Petr Mladek
2022-09-20 7:13 ` [PATCH v4 2/8] scripts/kallsyms: ensure that all possible combinations are compressed Zhen Lei
2022-09-21 8:00 ` Petr Mladek
2022-09-21 8:31 ` Leizhen (ThunderTown)
2022-09-21 12:46 ` Leizhen (ThunderTown)
2022-09-20 7:13 ` [PATCH v4 3/8] scripts/kallsyms: don't compress symbol types Zhen Lei
2022-09-21 9:00 ` Petr Mladek
2022-09-21 13:13 ` Leizhen (ThunderTown)
2022-09-20 7:13 ` Zhen Lei [this message]
2022-09-21 15:25 ` [PATCH v4 4/8] kallsyms: Improve the performance of kallsyms_lookup_name() Petr Mladek
2022-09-22 2:15 ` Leizhen (ThunderTown)
2022-09-22 7:02 ` Petr Mladek
2022-09-22 7:21 ` Leizhen (ThunderTown)
2022-09-22 13:17 ` Petr Mladek
2022-09-28 1:35 ` Leizhen (ThunderTown)
2022-09-30 11:37 ` Leizhen (ThunderTown)
2022-09-22 7:14 ` Leizhen (ThunderTown)
2022-09-20 7:13 ` [PATCH v4 5/8] kallsyms: Add helper kallsyms_on_each_match_symbol() Zhen Lei
2022-09-21 15:30 ` Petr Mladek
2022-09-22 2:16 ` Leizhen (ThunderTown)
2022-09-20 7:13 ` [PATCH v4 6/8] livepatch: Use kallsyms_on_each_match_symbol() to improve performance Zhen Lei
2022-09-20 7:13 ` [PATCH v4 7/8] livepatch: Improve the search performance of module_kallsyms_on_each_symbol() Zhen Lei
2022-09-20 7:13 ` [PATCH v4 8/8] kallsyms: Add self-test facility Zhen Lei
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20220920071317.1787-5-thunder.leizhen@huawei.com \
--to=thunder.leizhen@huawei.com \
--cc=akpm@linux-foundation.org \
--cc=ast@kernel.org \
--cc=jikos@kernel.org \
--cc=joe.lawrence@redhat.com \
--cc=jolsa@kernel.org \
--cc=jpoimboe@kernel.org \
--cc=keescook@chromium.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-modules@vger.kernel.org \
--cc=live-patching@vger.kernel.org \
--cc=masahiroy@kernel.org \
--cc=mbenes@suse.cz \
--cc=mcgrof@kernel.org \
--cc=pmladek@suse.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox