diff options
| author | Andrii Nakryiko <andrii@kernel.org> | 2021-12-14 11:59:03 -0800 | 
|---|---|---|
| committer | Daniel Borkmann <daniel@iogearbox.net> | 2021-12-14 22:16:45 +0100 | 
| commit | e542f2c4cd16d49392abf3349341d58153d3c603 (patch) | |
| tree | 92d3513488c9c10079f5bff3db8541d2b194e02a /tools/lib/bpf/libbpf_legacy.h | |
| parent | 9fc205b413b3f3e9502fa92151fba63b91230454 (diff) | |
libbpf: Auto-bump RLIMIT_MEMLOCK if kernel needs it for BPF
The need to increase RLIMIT_MEMLOCK to do anything useful with BPF is
one of the first extremely frustrating gotchas that all new BPF users go
through and in some cases have to learn it a very hard way.
Luckily, starting with upstream Linux kernel version 5.11, BPF subsystem
dropped the dependency on memlock and uses memcg-based memory accounting
instead. Unfortunately, detecting memcg-based BPF memory accounting is
far from trivial (as can be evidenced by this patch), so in practice
most BPF applications still do unconditional RLIMIT_MEMLOCK increase.
As we move towards libbpf 1.0, it would be good to allow users to forget
about RLIMIT_MEMLOCK vs memcg and let libbpf do the sensible adjustment
automatically. This patch paves the way forward in this matter. Libbpf
will do feature detection of memcg-based accounting, and if detected,
will do nothing. But if the kernel is too old, just like BCC, libbpf
will automatically increase RLIMIT_MEMLOCK on behalf of user
application ([0]).
As this is technically a breaking change, during the transition period
applications have to opt into libbpf 1.0 mode by setting
LIBBPF_STRICT_AUTO_RLIMIT_MEMLOCK bit when calling
libbpf_set_strict_mode().
Libbpf allows to control the exact amount of set RLIMIT_MEMLOCK limit
with libbpf_set_memlock_rlim_max() API. Passing 0 will make libbpf do
nothing with RLIMIT_MEMLOCK. libbpf_set_memlock_rlim_max() has to be
called before the first bpf_prog_load(), bpf_btf_load(), or
bpf_object__load() call, otherwise it has no effect and will return
-EBUSY.
  [0] Closes: https://github.com/libbpf/libbpf/issues/369
Signed-off-by: Andrii Nakryiko <andrii@kernel.org>
Signed-off-by: Daniel Borkmann <daniel@iogearbox.net>
Link: https://lore.kernel.org/bpf/20211214195904.1785155-2-andrii@kernel.org
Diffstat (limited to 'tools/lib/bpf/libbpf_legacy.h')
| -rw-r--r-- | tools/lib/bpf/libbpf_legacy.h | 12 | 
1 files changed, 11 insertions, 1 deletions
| diff --git a/tools/lib/bpf/libbpf_legacy.h b/tools/lib/bpf/libbpf_legacy.h index bb03c568af7b..79131f761a27 100644 --- a/tools/lib/bpf/libbpf_legacy.h +++ b/tools/lib/bpf/libbpf_legacy.h @@ -45,7 +45,6 @@ enum libbpf_strict_mode {  	 * (positive) error code.  	 */  	LIBBPF_STRICT_DIRECT_ERRS = 0x02, -  	/*  	 * Enforce strict BPF program section (SEC()) names.  	 * E.g., while prefiously SEC("xdp_whatever") or SEC("perf_event_blah") were @@ -63,6 +62,17 @@ enum libbpf_strict_mode {  	 * Clients can maintain it on their own if it is valuable for them.  	 */  	LIBBPF_STRICT_NO_OBJECT_LIST = 0x08, +	/* +	 * Automatically bump RLIMIT_MEMLOCK using setrlimit() before the +	 * first BPF program or map creation operation. This is done only if +	 * kernel is too old to support memcg-based memory accounting for BPF +	 * subsystem. By default, RLIMIT_MEMLOCK limit is set to RLIM_INFINITY, +	 * but it can be overriden with libbpf_set_memlock_rlim_max() API. +	 * Note that libbpf_set_memlock_rlim_max() needs to be called before +	 * the very first bpf_prog_load(), bpf_map_create() or bpf_object__load() +	 * operation. +	 */ +	LIBBPF_STRICT_AUTO_RLIMIT_MEMLOCK = 0x10,  	__LIBBPF_STRICT_LAST,  }; | 
