Lines Matching +full:ip +full:- +full:core
1 perf-amd-ibs(1)
5 ----
6 perf-amd-ibs - Support for AMD Instruction-Based Sampling (IBS) with perf tool
9 --------
11 'perf record' -e ibs_op//
12 'perf record' -e ibs_fetch//
15 -----------
17 Instruction-Based Sampling (IBS) provides precise Instruction Pointer (IP)
20 execution (micro-op execution to be precise) with details like d-cache
21 hit/miss, d-TLB hit/miss, cache miss latency, load/store data source, branch
23 with details like i-cache hit/miss, i-TLB hit/miss, fetch latency etc. IBS is
24 per-smt-thread i.e. each SMT hardware thread contains standalone IBS units.
39 IBS VS. REGULAR CORE PMU
40 ------------------------
42 IBS gives samples with precise IP, i.e. the IP recorded with IBS sample has
43 no skid. Whereas the IP recorded by regular core PMU will have some skid
44 (sample was generated at IP X but perf would record it at IP X+n). Hence,
45 regular core PMU might not help for profiling with instruction level
47 question. On the other hand, regular core PMU has it's own advantages like
51 Three regular core PMU events are internally forwarded to IBS Op PMU when
54 -e cpu-cycles:p becomes -e ibs_op//
55 -e r076:p becomes -e ibs_op//
56 -e r0C1:p becomes -e ibs_op/cnt_ctl=1/
59 --------
64 System-wide profile, cycles event, sampling period: 100000
66 # perf record -e ibs_op// -c 100000 -a
68 Per-cpu profile (cpu10), cycles event, sampling period: 100000
70 # perf record -e ibs_op// -c 100000 -C 10
72 Per-cpu profile (cpu10), cycles event, sampling freq: 1000
74 # perf record -e ibs_op// -F 1000 -C 10
76 System-wide profile, uOps event, sampling period: 100000
78 # perf record -e ibs_op/cnt_ctl=1/ -c 100000 -a
82 # perf record -e ibs_op/cnt_ctl=1/ -c 100000 -a --raw-samples
84 System-wide profile, uOps event, sampling period: 100000, L3MissOnly (Zen4 onward)
86 # perf record -e ibs_op/cnt_ctl=1,l3missonly=1/ -c 100000 -a
90 # perf record -e ibs_op/cnt_ctl=1/ -c 100000 -p 1234
94 # perf record -e ibs_op/cnt_ctl=1/ -c 100000 -- ls
105 Raw dump of IBS registers when profiled with --raw-samples
107 # perf report -D
133 https://lore.kernel.org/r/20220921063638.2489-1-[email protected]
140 System-wide profile, fetch ops event, sampling period: 100000
142 # perf record -e ibs_fetch// -c 100000 -a
144 System-wide profile, fetch ops event, sampling period: 100000, Random enable
146 # perf record -e ibs_fetch/rand_en=1/ -c 100000 -a
155 ---------------------
161 # perf mem record -c 100000 -- make
167 # perf mem report -F mem,sample,snoop
177 Remote core, same node Any cache hit 1572 HitM
178 Remote core, same node Any cache hit 514 N/A
186 --------
188 linkperf:perf-record[1], linkperf:perf-script[1], linkperf:perf-report[1],
189 linkperf:perf-mem[1], linkperf:perf-c2c[1]