xref: /aosp_15_r20/external/clpeak/results/NVIDIA_CUDA/GeForce_GTX_960.log (revision 1cd03ba3888297bc945f2c84574e105e3ced3e34)
1
2Platform: NVIDIA CUDA
3  Device: GeForce GTX 960
4    Driver version  : 355.11 (Linux x64)
5    Compute units   : 8
6    Clock frequency : 1329 MHz
7
8    Global memory bandwidth (GBPS)
9      float   : 82.67
10      float2  : 85.63
11      float4  : 87.22
12      float8  : 81.16
13      float16 : 83.39
14
15    Single-precision compute (GFLOPS)
16      float   : 2550.71
17      float2  : 2747.97
18      float4  : 2793.35
19      float8  : 2728.88
20      float16 : 2760.22
21
22    Double-precision compute (GFLOPS)
23      double   : 89.67
24      double2  : 89.63
25      double4  : 89.46
26      double8  : 89.10
27      double16 : 88.42
28
29    Integer compute (GIOPS)
30      int   : 761.99
31      int2  : 803.24
32      int4  : 816.24
33      int8  : 815.58
34      int16 : 826.16
35
36    Transfer bandwidth (GBPS)
37      enqueueWriteBuffer         : 6.58
38      enqueueReadBuffer          : 6.56
39      enqueueMapBuffer(for read) : 6.27
40        memcpy from mapped ptr   : 7.07
41      enqueueUnmap(after write)  : 6.76
42        memcpy to mapped ptr     : 7.12
43
44    Kernel launch latency : 5.16 us
45
46