xref: /aosp_15_r20/external/clpeak/results/NVIDIA_CUDA/GeForce_GTX_660.log (revision 1cd03ba3888297bc945f2c84574e105e3ced3e34)
1
2Platform: NVIDIA CUDA
3  Device: GeForce GTX 660
4    Driver version : 331.20 (Linux x86)
5    Compute units  : 5
6
7    Global memory bandwidth (GBPS)
8      float   : 107.96
9      float2  : 111.36
10      float4  : 113.08
11      float8  : 57.77
12      float16 : 37.33
13
14    Single-precision compute (GFLOPS)
15      float   : 1412.18
16      float2  : 1862.79
17      float4  : 1785.61
18      float8  : 1832.08
19      float16 : 1784.82
20
21    Double-precision compute (GFLOPS)
22      double   : 89.72
23      double2  : 89.60
24      double4  : 89.42
25      double8  : 89.10
26      double16 : 88.38
27
28    Integer compute (GIOPS)
29      int   : 358.32
30      int2  : 358.40
31      int4  : 358.11
32      int8  : 358.62
33      int16 : 358.41
34
35    Transfer bandwidth (GBPS)
36      enqueueWriteBuffer         : 6.53
37      enqueueReadBuffer          : 6.58
38      enqueueMapBuffer(for read) : 2.07
39        memcpy from mapped ptr   : 10.12
40      enqueueUnmap(after write)  : 3.78
41        memcpy to mapped ptr     : 10.29
42
43    Kernel launch latency : 6.89 us
44
45