[PATCH v13 0/5] perf tools: Add inject --aslr feature, early maps loading, and decoupling fixes
From: Ian Rogers
Date: Fri Jun 05 2026 - 15:55:38 EST
This patch series introduces the new 'perf inject --aslr' feature to
remap virtual memory addresses or drop physical memory event leaks
when profile record data is shared between machines. Bundled with this
feature is a bug fix inside the core map tracking tool that hardens
perf session analysis against concurrent lookup data races.
Detailed Mechanism of MMAP Mapping and ASLR virtual Address Allocation:
The ASLR tool virtualizes the address space of the recorded processes by
intercepting MMAP and MMAP2 events to build a consistent translation
database, which is subsequently used to rewrite sample addresses.
It maintains two primary lookup databases using hash maps:
1. 'remap_addresses': Maps an original mapping key to its new remapped
base address. The key uses topological invariant coordinates:
(machine, dso, invariant). The invariant is computed as (start - pgoff)
for DSO-backed mappings. This invariant remains constant even when
perf's internal overlap-resolution splits a VMA into fragmented
pieces, ensuring split maps resolve consistently back to the same
remapped base.
2. 'top_addresses': Tracks the allocation state per process (machine, pid).
It maintains 'remapped_max' (the highest allocated address in the
virtualized space) and 'orig_last_end' (the end address of the last
processed original mapping).
For each MMAP/MMAP2 event:
- We look up the DSO and invariant key in 'remap_addresses'. If found, we
reuse the translation, preserving the offset within the mapping.
- If not found, we allocate a new remapped address space:
- If the new mapping is contiguous to the previous one in the original
address space (start == orig_last_end), we place it contiguously in
the remapped space. This is critical to preserve the contiguity of
mappings for downstream merging (e.g. symbols split by HugeTLB, or
anonymous .bss segments adjacent to initialized data).
- If not contiguous, we insert a 1-page gap (using page_size) from the
previous maximum allocated address to prevent accidental merging of
unrelated VMAs.
- The event's start address (and pgoff for kernel maps) is rewritten,
and the event is delegated to the output writer.
To remain strictly conservative and guarantee security, the tool
scrubs breakpoint addresses (bp_addr) from all synthesized stream
headers, completely drops PERF_RECORD_TEXT_POKE events to prevent
absolute immediate pointer operands leaks, and drops unsupported
complex payloads (such as user register stacks, raw tracepoints, and
hardware AUX tracing frames).
Verification is reinforced with shell test ('inject_aslr.sh').
Prerequisite Bug Fix (Patch 1). During development, a core map
indexing issue was identified and resolved to prevent concurrent
lookup data races during session analysis.
Changes since v12:
- Patch 2: Fixed potential NULL pointer dereference in
remap_addresses__hash() when handling unmapped memory events (key->dso
is NULL) under REFCNT_CHECKING.
- Patch 2: Dynamically detect machine architecture bitness via
perf_env__kernel_is_64_bit() to select appropriate kernel_space_start
boundaries, avoiding 64-bit address injection on 32-bit platforms.
Changes since v11:
- Patch 1: Fixed struct dso name accessor in maps.c by using
dso__name() instead of ->name.
- Patch 2: Fixed hash function in aslr.c to hash the underlying
dso pointer using RC_CHK_ACCESS to support reference count checking.
Changes since v10:
- Patch 1: Added explicit tracking array logic in maps__load_maps()
to correctly accumulate valid maps (skipping NULL entries after
failures) and safely return the exact populated count, resolving
out-of-bounds pointer iteration panics.
- Patch 3: Fixed endianness bug during cross-endian sample parsing
by passing evsel->needs_swap instead of false to __evsel__parse_sample
in aslr.c, ensuring correct 32-bit field byte unswapping for packed
fields. Refactored evsel__parse_sample to take a needs_swap argument
via __evsel__parse_sample.
- Patch 4: Fixed inject_aslr.sh exit code handling in trap functions
to capture and propagate the correct pipeline failure status code
instead of unconditionally returning success or failing the test.
Changes since v9:
- Patch 1: Added `-ENOMEM` error check inside
`maps__find_symbol_by_name()` and return `NULL` early. Added map
sorting state invalidation on early return in `maps__load_maps()`.
- Patch 2: Fixed encapsulation by using `thread__maps()` and
`thread__pid()` accessors in `aslr_tool__findnew_mapping()`. Added
`pr_warning_once` warning when raw auxtrace data is dropped.
- Patch 3: Fixed encapsulation by using `thread__maps()` and
`thread__pid()` accessors in `aslr_tool__remap_address()`. Wrapped
`evsel__parse_sample()` to temporarily disable `needs_swap` to avoid
branch stack endianness corruption on cross-endian files. Fixed ISO
C90 warning for declaration-after-statement for `orig_needs_swap`.
- Patch 4: Fixed duplicate cleanup by explicitly removing trap
handlers (`trap - EXIT TERM INT`) inside the `cleanup()` function.
- Patch 5: Fixed heap corruption by adding size bounds checking before
writing to `sample_regs_user` and `sample_regs_intr` fields. Added
missing register mask clearing logic for the `itrace` synthesis path
of `perf_event__repipe_attr()`.
Ian Rogers (5):
perf maps: Add maps__mutate_mapping
perf inject/aslr: Add ASLR tool infrastructure and MMAP tracking
perf inject/aslr: Implement sample address remapping
perf test: Add inject ASLR test
perf aslr: Strip sample registers
tools/perf/builtin-inject.c | 79 +-
tools/perf/tests/shell/inject_aslr.sh | 519 ++++++++++
tools/perf/util/Build | 1 +
tools/perf/util/aslr.c | 1268 +++++++++++++++++++++++++
tools/perf/util/aslr.h | 41 +
tools/perf/util/evsel.c | 6 +-
tools/perf/util/evsel.h | 10 +-
tools/perf/util/machine.c | 32 +-
tools/perf/util/maps.c | 149 ++-
tools/perf/util/maps.h | 3 +
tools/perf/util/symbol-elf.c | 41 +-
tools/perf/util/symbol.c | 17 +-
12 files changed, 2101 insertions(+), 65 deletions(-)
create mode 100755 tools/perf/tests/shell/inject_aslr.sh
create mode 100644 tools/perf/util/aslr.c
create mode 100644 tools/perf/util/aslr.h
--
2.54.0.1032.g2f8565e1d1-goog