Lines Matching +full:pcie +full:- +full:root +full:- +full:port +full:- +full:1

9 * NVLink-C2C0
10 * NVLink-C2C1
12 * PCIE
15 ----------
19 PMUs are managed by a common driver "arm-cs-arch-pmu". This driver describes
29 -------
32 strongly-ordered (SO) PCIE write traffic to local/remote memory. Please see
37 see /sys/bus/event_source/devices/nvidia_scf_pmu_<socket-id>.
43 perf stat -a -e nvidia_scf_pmu_0/event=0x0/
45 * Count event id 0x0 in socket 1::
47 perf stat -a -e nvidia_scf_pmu_1/event=0x0/
49 NVLink-C2C0 PMU
50 --------------------
52 The NVLink-C2C0 PMU monitors incoming traffic from a GPU/CPU connected with
53 NVLink-C2C (Chip-2-Chip) interconnect. The type of traffic captured by this PMU
63 PCIE device of the remote SoC.
69 see /sys/bus/event_source/devices/nvidia_nvlink_c2c0_pmu_<socket-id>.
75 perf stat -a -e nvidia_nvlink_c2c0_pmu_0/event=0x0/
77 * Count event id 0x0 from the GPU/CPU connected with socket 1::
79 perf stat -a -e nvidia_nvlink_c2c0_pmu_1/event=0x0/
83 perf stat -a -e nvidia_nvlink_c2c0_pmu_2/event=0x0/
87 perf stat -a -e nvidia_nvlink_c2c0_pmu_3/event=0x0/
89 The NVLink-C2C has two ports that can be connected to one GPU (occupying both
90 ports) or to two GPUs (one GPU per port). The user can use "port" bitmap
91 parameter to select the port(s) to monitor. Each bit represents the port number,
92 e.g. "port=0x1" corresponds to port 0 and "port=0x3" is for port 0 and 1. The
95 Example for port filtering:
97 * Count event id 0x0 from the GPU connected with socket 0 on port 0::
99 perf stat -a -e nvidia_nvlink_c2c0_pmu_0/event=0x0,port=0x1/
101 * Count event id 0x0 from the GPUs connected with socket 0 on port 0 and port 1::
103 perf stat -a -e nvidia_nvlink_c2c0_pmu_0/event=0x0,port=0x3/
105 NVLink-C2C1 PMU
106 -------------------
108 The NVLink-C2C1 PMU monitors incoming traffic from a GPU connected with
109 NVLink-C2C (Chip-2-Chip) interconnect. This PMU captures untranslated GPU
110 traffic, in contrast with NvLink-C2C0 PMU that captures ATS translated traffic.
115 see /sys/bus/event_source/devices/nvidia_nvlink_c2c1_pmu_<socket-id>.
121 perf stat -a -e nvidia_nvlink_c2c1_pmu_0/event=0x0/
123 * Count event id 0x0 from the GPU connected with socket 1::
125 perf stat -a -e nvidia_nvlink_c2c1_pmu_1/event=0x0/
129 perf stat -a -e nvidia_nvlink_c2c1_pmu_2/event=0x0/
133 perf stat -a -e nvidia_nvlink_c2c1_pmu_3/event=0x0/
135 The NVLink-C2C has two ports that can be connected to one GPU (occupying both
136 ports) or to two GPUs (one GPU per port). The user can use "port" bitmap
137 parameter to select the port(s) to monitor. Each bit represents the port number,
138 e.g. "port=0x1" corresponds to port 0 and "port=0x3" is for port 0 and 1. The
141 Example for port filtering:
143 * Count event id 0x0 from the GPU connected with socket 0 on port 0::
145 perf stat -a -e nvidia_nvlink_c2c1_pmu_0/event=0x0,port=0x1/
147 * Count event id 0x0 from the GPUs connected with socket 0 on port 0 and port 1::
149 perf stat -a -e nvidia_nvlink_c2c1_pmu_0/event=0x0,port=0x3/
152 ---------------
154 The CNVLink PMU monitors traffic from GPU and PCIE device on remote sockets
155 to local memory. For PCIE traffic, this PMU captures read and relaxed ordered
160 see /sys/bus/event_source/devices/nvidia_cnvlink_pmu_<socket-id>.
165 socket 1 to 3. The PMU will monitor all remote sockets by default if not
167 /sys/bus/event_source/devices/nvidia_cnvlink_pmu_<socket-id>/format/rem_socket
172 traffic from remote GPU and PCIE devices.
176 * Count event id 0x0 for the traffic from remote socket 1, 2, and 3 to socket 0::
178 perf stat -a -e nvidia_cnvlink_pmu_0/event=0x0,rem_socket=0xE/
180 * Count event id 0x0 for the traffic from remote socket 0, 2, and 3 to socket 1::
182 perf stat -a -e nvidia_cnvlink_pmu_1/event=0x0,rem_socket=0xD/
184 * Count event id 0x0 for the traffic from remote socket 0, 1, and 3 to socket 2::
186 perf stat -a -e nvidia_cnvlink_pmu_2/event=0x0,rem_socket=0xB/
188 * Count event id 0x0 for the traffic from remote socket 0, 1, and 2 to socket 3::
190 perf stat -a -e nvidia_cnvlink_pmu_3/event=0x0,rem_socket=0x7/
193 PCIE PMU
194 ------------
196 The PCIE PMU monitors all read/write traffic from PCIE root ports to
201 see /sys/bus/event_source/devices/nvidia_pcie_pmu_<socket-id>.
203 Each SoC socket can support multiple root ports. The user can use
204 "root_port" bitmap parameter to select the port(s) to monitor, i.e.
205 "root_port=0xF" corresponds to root port 0 to 3. The PMU will monitor all root
207 /sys/bus/event_source/devices/nvidia_pcie_pmu_<socket-id>/format/root_port
212 * Count event id 0x0 from root port 0 and 1 of socket 0::
214 perf stat -a -e nvidia_pcie_pmu_0/event=0x0,root_port=0x3/
216 * Count event id 0x0 from root port 0 and 1 of socket 1::
218 perf stat -a -e nvidia_pcie_pmu_1/event=0x0,root_port=0x3/
223 ----------------
232 * SOCKET-A * * SOCKET-B *
235 * : PCIE : * * : PCIE : *
241 * : GPU :<--NVLink-->: Grace :<---CNVLink--->: Grace :<--NVLink-->: GPU : *
256 | Following table contains traffic coverage of Grace SoC PMU in socket-A:
260 +--------------+-------+-----------+-----------+-----+----------+----------+
262 + +-------+-----------+-----------+-----+----------+----------+
263 | Destination | |GPU ATS |GPU Not-ATS| | Socket-B | Socket-B |
267 | Local | PCIE |NVLink-C2C0|NVLink-C2C1| SCF | SCF PMU | CNVLink |
269 +--------------+-------+-----------+-----------+-----+----------+----------+
270 | Local GMEM | PCIE | N/A |NVLink-C2C1| SCF | SCF PMU | CNVLink |
272 +--------------+-------+-----------+-----------+-----+----------+----------+
273 | Remote | PCIE |NVLink-C2C0|NVLink-C2C1| SCF | | |
276 +--------------+-------+-----------+-----------+-----+----------+----------+
277 | Remote GMEM | PCIE |NVLink-C2C0|NVLink-C2C1| SCF | | |
279 +--------------+-------+-----------+-----------+-----+----------+----------+
289 * SOCKET-A * * SOCKET-B *
292 * : PCIE : * * : PCIE : *
298 * : Grace :<--------NVLink------->: Grace : *
313 | Following table contains traffic coverage of Grace SoC PMU in socket-A:
317 +-----------------+-----------+---------+----------+-------------+
319 + +-----------+---------+----------+-------------+
320 | Destination | | | Socket-B | Socket-B |
324 | Local | PCIE PMU | SCF PMU | SCF PMU | NVLink-C2C0 |
326 +-----------------+-----------+---------+----------+-------------+
328 | SYSRAM/CMEM | PCIE PMU | SCF PMU | N/A | N/A |
329 | over NVLink-C2C | | | | |
330 +-----------------+-----------+---------+----------+-------------+