Move probe expansion into codegen #3155

viktormalik · 2024-05-07T10:52:30Z

The previous probe expansion approach tried to minimize the amount of LLVM functions generated by emitting a single function for all probe matches in most cases. While this was efficient, it came with a couple of drawbacks:

It is necessary to generate a separate LLVM function for each match (e.g. when the probe builtin is used). This leads to having two very similar loops for iterating matches in BPFtrace::add_probe and in CodegenLLVM::visit(Probe) which is quite confusing and hard to maintain.
libbpf needs one BPF program (i.e. one LLVM function) per probe so if we want to delegate program loading (and possibly attachment) to libbpf (which we do), we cannot use this approach. See [RFC] Probe expansion in codegen #3005 for more details.

This refactors probe expansion by moving most of it into codegen. Overall, we now distinguish two types of probe expansion:

Full expansion - A separate LLVM function is generated for each match. This is used for most expansions now.
Multi expansion - Used for k(u)probes when k(u)probe_multi is available. Generates one LLVM function and one BPF program for all matches and attaches the expanded functions via bpf_link_create_opts.

This allows to drop a lot of duplicated code. The expansion for "full" is done in CodegenLLVM::visit(Probe), the expansion for "multi" is done in BPFtrace::add_probe.

A drawback of this approach is that we generate substantially larger ELF objects for expansions of probe types which do not support multi-probes (e.g. kfuncs and tracepoints) as we generate duplicate LLVM functions. This is something we can live with for now since multi-attachment is not the main use-case for these probe types (e.g. attaching to many kfuncs is very slow) and there's usually an alternative to use multi-kprobes. In addition, we will probably add a third expansion type using LLVM symbol aliases but that requires libbpf changes.

One particular area where this refactoring caused problems is unit tests in tests/bpftrace.cpp. Previously, it was sufficient to generate a simple ast::Probe and pass it to BPFtrace::add_probe since that was where most of the expansion was done. Now that the expansion was moved to codegen, we need to do full parser -> field analyser -> clang parser -> semantic analyser -> codegen sequence.

Also, the problem comes with USDT probes as it is not possible to easily mock USDTHelper which is a fully static class. Since we need to override AttachPoint::usdt::num_locations from tests, we allow to do that via a new internal env variable BPFTRACE_TEST_USDT_NUM_LOCATIONS.

Checklist

Language changes are updated in man/adoc/bpftrace.adoc
User-visible and non-trivial changes updated in CHANGELOG.md
The new behaviour is covered by tests

src/ast/passes/codegen_llvm.cpp

viktormalik · 2024-05-07T13:25:18Z

Update: I had to drop unit tests for single-wildcard targets for uprobe/USDT added in #2757. The reason is that tests in tests/bpftrace.cpp now run semantic analyser which in turn runs discovery of all running processes on the system for such probes, which is not something that should be run in unit tests (and it cannot be mocked in this case). FWIW, I don't think that those tests would add much value b/c all the code added by #2757 would be replaced by mocks anyways.

ajor

Are "full expansion" and "multi expansion" meant to be mutually exclusive? An enum instead of two bools might be clearer/safer if that's the case.

I haven't dug into the cause, but I can trigger a crash on this branch by setting both full and multi expansion to true on a single probe with this script:

kprobe:*:__x64_sys_rea* { print("a") }

CHANGELOG.md

ajor · 2024-05-15T16:36:21Z

src/ast/attachpoint_parser.cpp

+ // As the C++ language supports function overload, a given function name
+ // (without parameters) could have multiple matches even when no
+ // wildcards are used.
+ if (has_wildcard(ap_->func) || has_wildcard(ap_->target) ||


Don't we need full expansion for wildcarded executable paths? i.e.:

if (has_wildcard(ap_->target) ap_->need_full_expansion = true;

In such a case, we do multi-expansion for each expanded path. See BPFtrace::add_probe for details.

viktormalik · 2024-05-16T11:19:14Z

Are "full expansion" and "multi expansion" meant to be mutually exclusive? An enum instead of two bools might be clearer/safer if that's the case.

Indeed they are, a great suggestion!

I haven't dug into the cause, but I can trigger a crash on this branch by setting both full and multi expansion to true on a single probe with this script:
kprobe:*:__x64_sys_rea* { print("a") }

Nice catch! It should be fixed now.

viktormalik · 2024-05-16T11:22:09Z

While rebasing onto master, I noticed that uprobe_multi expansion doesn't work with placing uprobes after the funtion prologue. I opened #3173 to track that issue and this PR respects the current behavior (i.e. when uprobe expansion is needed, use kprobe_multi and attach to function entries, otherwise attach after function prologue).

ajor

Just a partial review so far - I'll try and get through bpftrace.cpp tomorrow.

I'm not keen on using environment variables to mock behaviour - it runs the risk of something dodgy happening if a user sets the env var at run time. I've recently started thinking about how we can make bpftrace a "trusted compiler" to allow it to be used by a non-root user (for use in containers with BPF token), and for that to work the compiler's behaviour has to be pretty well defined.

How about this as a way of mocking static classes: https://godbolt.org/z/37czorj84

src/bpftrace.cpp

viktormalik · 2024-05-23T14:35:42Z

I'm not keen on using environment variables to mock behaviour - it runs the risk of something dodgy happening if a user sets the env var at run time. I've recently started thinking about how we can make bpftrace a "trusted compiler" to allow it to be used by a non-root user (for use in containers with BPF token), and for that to work the compiler's behaviour has to be pretty well defined.

It turned out that the only method that we need to mock doesn't need to be static so I made USDTHelper non-static and created MockUSDTHelper which mocks the find method.

ajor

Sorry for taking so long to review! Just got a few minor points, but overall looks good to me.

It's annoying that the wildcard-target unit tests had to go, but I can see that it'd be a fair bit of work to keep them.

I think ideally we should aim to get to the point where nothing in the compilation pipeline depends on any external state, so it can all be unit tested without mocks. All external state could be collected up-front and passed in as arguments - it'd help to simplify compilation (escpecially AOT) a fair bit too. Definitely too much work to do in this PR though

src/bpftrace.cpp

tests/bpftrace.cpp

viktormalik · 2024-06-05T10:21:51Z

Sorry for taking so long to review! Just got a few minor points, but overall looks good to me.

Thanks for the review! While updating the PR, I discovered that there was an error in handling expansion for wildcarded kprobe:module:function cases so I also added a couple of unit tests for that.

ttreyer · 2024-06-05T16:54:24Z

src/bpftrace.cpp

+ if (has_wildcard(ap.target)) {
+ // If we have a wildcard in the target path, we need to generate one
+ // probe per expanded target.
+ assert(type == ProbeType::uprobe || type == ProbeType::uretprobe);
+ std::unordered_map<std::string, Probe> target_map;
+ for (const auto &func : matches) {
+ ast::AttachPoint match_ap = ap.create_expansion_copy(func);
+ // Use the original (possibly wildcarded) function name
+ match_ap.func = ap.func;
+ auto found = target_map.find(match_ap.target);
+ if (found != target_map.end()) {
+ found->second.funcs.push_back(func);
 } else {
- // If we have a wildcard in the target path, we need to generate one
- // probe per expanded target.
- std::unordered_map<std::string, Probe> target_map;
- for (const auto &func : attach_funcs) {
- ast::AttachPoint ap = attach_point->create_expansion_copy(func);
- // Use the original (possibly wildcarded) function name
- ap.func = attach_point->func;
- auto found = target_map.find(ap.target);
- if (found != target_map.end()) {
- found->second.funcs.push_back(func);
- } else {
- auto probe = generate_probe(ap, p);
- probe.funcs.push_back(func);
- target_map.insert({ { ap.target, probe } });
- }
- }
- for (auto &pair : target_map) {
- resources.probes.push_back(std::move(pair.second));
- }
- continue;
+ auto probe = generate_probe(match_ap, p);
+ probe.funcs.push_back(func);
+ target_map.insert({ { match_ap.target, probe } });
 }
 }
- } else if ((probetype(attach_point->provider) == ProbeType::uprobe ||
- probetype(attach_point->provider) == ProbeType::uretprobe ||
- probetype(attach_point->provider) == ProbeType::watchpoint ||
- probetype(attach_point->provider) ==
- ProbeType::asyncwatchpoint) &&
- !attach_point->func.empty()) {
- std::set<std::string> matches;
-
- struct symbol sym = {};
- int err = resolve_uname(attach_point->func, &sym, attach_point->target);
-
- if (attach_point->lang == "cpp") {
- // As the C++ language supports function overload, a given function name
- // (without parameters) could have multiple matches even when no
- // wildcards are used.
- matches = probe_matcher_->get_matches_for_ap(*attach_point);
+ for (auto &pair : target_map) {
+ resources.probes.push_back(std::move(pair.second));
 }


Is this ever executed? It looks like target expansion takes place in SemanticAnalyser, so has_wildcard(ap.target) is always false.

Semantic analyser only resolves the binary path. The comment there says:

bpftrace/src/ast/passes/semantic_analyser.cpp

Lines 2605 to 2607 in ce8f81a

// If we are doing a PATH lookup (ie not glob), we follow shell

// behavior and take the first match.

// Otherwise we keep the target with glob, it will be expanded later

That is, when wildcards are used (and multiple matches exist), the wildcard is not expanded.

I double checked and this is indeed executed when attaching to uprobe:tests/testprogs/uprobe*:*.

src/bpftrace.cpp

ttreyer · 2024-06-05T17:03:10Z

src/ast/ast.h

+enum class ExpansionType {
+ NONE,
+ FULL,
+ MULTI,
+};


Could we detach the k(u)probe.multi feature from the ExpansionType?
What I'd love to be able to do is tell bpftrace which addresses to trap and it decides by itself if whether to use k(u)probe.multi, if available, or set-up the probe one-by-one for each address, if there are no multi probe support.

This is an excellent point. If we do AOT, we may compile the script on a machine with kprobe_multi support but then run it on a machine which doesn't have it. In such a case, the execution would probably not work.

This is quite tricky to implement, though. Since we're about to use libbpf to load the programs, for full expansion, we need to have one copy of the function for each match. I'll work on a libbpf extension to support copies via aliases (i.e. just entries in the symbol table pointing to the same code) but for now, we need full copies.

The only reasonable option that comes to mind is to replace the MULTI expansion by ALIAS (or better name if we can find one). For that expansion type, we would always emit just one copy of the program. Then, the runtime would detect if k(u)probe_multi is available and, if not, create the copies directly in the ELF. In future, we could replace that by just creating aliasing symbol table entries. I'll just need to check how complicated it is to do symbol copying in ELF.

In the meantime, if someone has a better idea, let me know.

ajor

Looks good to me!

danobi

nice cleanup. tbh, didn't look too closely since it seems like most code was just moved

tests/bpftrace.cpp

The previous probe expansion approach tried to minimize the amount of LLVM functions generated by emitting a single function for all probe matches in most cases. While this was efficient, it came with a couple of drawbacks: - It is necessary to generate a separate LLVM function for each match (e.g. when the 'probe' builtin is used). This leads to having two very similar loops for iterating matches in BPFtrace::add_probe and in CodegenLLVM::visit(Probe) which is quite confusing and hard to maintain. - libbpf needs one BPF program (i.e. one LLVM function) per probe so if we want to delegate program loading (and possibly attachment) to libbpf (which we do), we cannot use this approach. See [1] for more details. This refactors probe expansion by moving most of it into codegen. Overall, we now distinguish two types of probe expansion: Full expansion - A separate LLVM function is generated for each match. This is used for most expansions now. Multi expansion - Used for k(u)probes when k(u)probe_multi is available. Generates one LLVM function and one BPF program for all matches and attaches the expanded functions via bpf_link_create_opts. This allows to drop a lot of duplicated code. The expansion for "full" is done in CodegenLLVM::visit(Probe), the expansion for "multi" is done in BPFtrace::add_probe. A drawback of this approach is that we generate substantially larger ELF objects for expansions of probe types which do not support multi-probes (e.g. kfuncs and tracepoints) as we generate duplicate LLVM functions. This is something we can live with for now since multi-attachment is not the main use-case for these probe types (e.g. attaching to many kfuncs is very slow) and there's usually an alternative to use multi-kprobes. One particular area where this refactoring caused problems is unit tests in tests/bpftrace.cpp. Previously, it was sufficient to generate a simple ast::Probe and pass it to BPFtrace::add_probe since that was where most of the expansion was done. Now that the expansion was moved to codegen, we need to do full parser -> field analyser -> clang parser -> semantic analyser -> codegen sequence. With this change, some tests had to be dropped, especially the tests with a single wildcard for uprobe/USDT target. The reason is that semantic analyser expands these wildcards by searching all paths on the system which is something that cannot be mocked and therefore should not be run in unit tests (e.g. it prevents running the unit tests as non-root). Also, the problem comes with USDT probes as USDTHelper was originally fully static which prevented its mocking. Fortunately, we only need to mock the find() method which doesn't have to be static, so this refactors USDTHelper and some of its users and introduces MockUSDTHelper which mocks the find method for unit tests. [1] bpftrace#3005

When the kprobe:mod:func syntax is used with wildcards in either 'mod' or 'func', kprobe_multi expansion should not be used even if it is available on the system. The reason is that kprobe_multi doesn't support specifying the kernel module. Add unit tests checking that full expansion is always done, even if kprobe_multi is enabled.

viktormalik requested review from ajor, danobi and fbs as code owners May 7, 2024 10:52

viktormalik force-pushed the codegen-expansion branch from 222e56d to 3be0722 Compare May 7, 2024 10:53

github-advanced-security bot found potential problems May 7, 2024

View reviewed changes

src/ast/passes/codegen_llvm.cpp Dismissed Show resolved Hide resolved

viktormalik force-pushed the codegen-expansion branch from 3be0722 to 63e4c72 Compare May 7, 2024 13:20

viktormalik force-pushed the codegen-expansion branch 2 times, most recently from 10f6f74 to dde56bd Compare May 10, 2024 09:30

ajor self-assigned this May 15, 2024

ajor reviewed May 15, 2024

View reviewed changes

viktormalik force-pushed the codegen-expansion branch from dde56bd to 305b515 Compare May 16, 2024 11:19

viktormalik force-pushed the codegen-expansion branch from 305b515 to d3da3ae Compare May 17, 2024 09:04

ajor reviewed May 20, 2024

View reviewed changes

src/bpftrace.cpp Outdated Show resolved Hide resolved

viktormalik force-pushed the codegen-expansion branch from d3da3ae to 5dfc728 Compare May 23, 2024 14:32

viktormalik force-pushed the codegen-expansion branch from 5dfc728 to e18dac6 Compare May 23, 2024 14:57

ajor reviewed Jun 4, 2024

View reviewed changes

src/bpftrace.cpp Outdated Show resolved Hide resolved

src/bpftrace.cpp Outdated Show resolved Hide resolved

src/bpftrace.cpp Outdated Show resolved Hide resolved

tests/bpftrace.cpp Show resolved Hide resolved

tests/bpftrace.cpp Show resolved Hide resolved

viktormalik force-pushed the codegen-expansion branch from e18dac6 to f5d1b7b Compare June 5, 2024 10:20

viktormalik requested a review from ajor June 5, 2024 11:37

ttreyer reviewed Jun 5, 2024

View reviewed changes

ajor reviewed Jun 5, 2024

View reviewed changes

src/bpftrace.cpp Outdated Show resolved Hide resolved

ttreyer reviewed Jun 5, 2024

View reviewed changes

viktormalik force-pushed the codegen-expansion branch from f5d1b7b to 5663c33 Compare June 6, 2024 05:53

ajor approved these changes Jun 6, 2024

View reviewed changes

danobi self-assigned this Jun 6, 2024

danobi approved these changes Jun 6, 2024

View reviewed changes

tests/bpftrace.cpp Outdated Show resolved Hide resolved

viktormalik force-pushed the codegen-expansion branch from 5663c33 to 3fef506 Compare June 7, 2024 11:19

viktormalik added 2 commits June 7, 2024 13:54

viktormalik force-pushed the codegen-expansion branch from 3fef506 to 2b30fa9 Compare June 7, 2024 11:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move probe expansion into codegen #3155

Move probe expansion into codegen #3155

viktormalik commented May 7, 2024 •

edited

viktormalik commented May 7, 2024

ajor left a comment

ajor May 15, 2024

viktormalik May 16, 2024

viktormalik commented May 16, 2024

viktormalik commented May 16, 2024

ajor left a comment

viktormalik commented May 23, 2024

ajor left a comment

viktormalik commented Jun 5, 2024

ttreyer Jun 5, 2024

viktormalik Jun 6, 2024

ttreyer Jun 5, 2024

viktormalik Jun 6, 2024

ajor left a comment

danobi left a comment

	// If we are doing a PATH lookup (ie not glob), we follow shell
	// behavior and take the first match.
	// Otherwise we keep the target with glob, it will be expanded later

Move probe expansion into codegen #3155

Are you sure you want to change the base?

Move probe expansion into codegen #3155

Conversation

viktormalik commented May 7, 2024 • edited

Checklist

viktormalik commented May 7, 2024

ajor left a comment

Choose a reason for hiding this comment

ajor May 15, 2024

Choose a reason for hiding this comment

viktormalik May 16, 2024

Choose a reason for hiding this comment

viktormalik commented May 16, 2024

viktormalik commented May 16, 2024

ajor left a comment

Choose a reason for hiding this comment

viktormalik commented May 23, 2024

ajor left a comment

Choose a reason for hiding this comment

viktormalik commented Jun 5, 2024

ttreyer Jun 5, 2024

Choose a reason for hiding this comment

viktormalik Jun 6, 2024

Choose a reason for hiding this comment

ttreyer Jun 5, 2024

Choose a reason for hiding this comment

viktormalik Jun 6, 2024

Choose a reason for hiding this comment

ajor left a comment

Choose a reason for hiding this comment

danobi left a comment

Choose a reason for hiding this comment

viktormalik commented May 7, 2024 •

edited