Re: [PATCH v6 bpf-next 1/3] perf: enable branch record for software events

From: Peter Zijlstra
Date: Fri Sep 10 2021 - 14:40:40 EST


On Fri, Sep 10, 2021 at 06:27:36PM +0000, Song Liu wrote:

> This works great and saves 3 entries! We have the following now:

Yay!

> ID: 0 from bpf_get_branch_snapshot+18 to intel_pmu_snapshot_branch_stack+0

is unavoidable, we need to end up in intel_pmu_snapshot_branch_stack()
eventually.

> ID: 1 from __brk_limit+477143934 to bpf_get_branch_snapshot+0

could be elided by having the JIT emit the call to
intel_pmu_snapshot_branch_stack directly, instead of laundering it
through that helper I suppose.

> ID: 2 from __brk_limit+477192263 to __brk_limit+477143880 # trampoline
> ID: 3 from __bpf_prog_enter+34 to __brk_limit+477192251

-ENOCLUE

> ID: 4 from migrate_disable+60 to __bpf_prog_enter+9
> ID: 5 from __bpf_prog_enter+4 to migrate_disable+0

I suppose we can reduce that to a single branch if we inline
migrate_disable() here, that thing unfortunately needs one branch
itself.

> ID: 6 from bpf_testmod_loop_test+20 to __bpf_prog_enter+0

And this is the first branch out of the test program, giving 7 entries
now, of which we can remove at least 2 more with a bit of elbow greace,
right?

> ID: 7 from bpf_testmod_loop_test+20 to bpf_testmod_loop_test+13
> ID: 8 from bpf_testmod_loop_test+20 to bpf_testmod_loop_test+13
>
> I will fold this in and send v7.

Excellent.