1 2Platform: AMD Accelerated Parallel Processing 3 Device: Loveland 4 Driver version : 1113.2 (Linux x86) 5 Compute units : 2 6 7 Global memory bandwidth (GBPS) 8 float : 6.92 9 float2 : 7.16 10 float4 : 7.03 11 float8 : 3.62 12 float16 : 1.91 13 14 Single-precision compute (GFLOPS) 15 float : 15.74 16 float2 : 31.40 17 float4 : 62.36 18 float8 : 62.60 19 float16 : 63.09 20 21 No double precision support! Skipped 22 23 Integer compute (GIOPS) 24 int : 7.87 25 int2 : 15.72 26 int4 : 15.71 27 int8 : 15.70 28 int16 : 15.71 29 30 Transfer bandwidth (GBPS) 31 enqueueWriteBuffer : 2.94 32 enqueueReadBuffer : 1.93 33 enqueueMapBuffer(for read) : 1462.07 34 memcpy from mapped ptr : 1.91 35 enqueueUnmap(after write) : 263.46 36 memcpy to mapped ptr : 1.98 37 38 Kernel launch latency : 460.72 us 39 40 Device: AMD E-350 Processor 41 Driver version : 1113.2 (sse2) (Linux x86) 42 Compute units : 2 43 44 Global memory bandwidth (GBPS) 45 float : 2.41 46 float2 : 1.67 47 float4 : 2.40 48 float8 : 2.22 49 float16 : 2.19 50 51 Single-precision compute (GFLOPS) 52 float : 1.28 53 float2 : 1.28 54 float4 : 5.04 55 float8 : 5.03 56 float16 : 1.54 57 58 Double-precision compute (GFLOPS) 59 double : 0.91 60 double2 : 1.82 61 double4 : 1.82 62 double8 : 1.82 63 double16 : 0.58 64 65 Integer compute (GIOPS) 66 int : 1.59 67 int2 : 0.66 68 int4 : 0.73 69 int8 : 0.73 70 int16 : 0.75 71 72 Transfer bandwidth (GBPS) 73 enqueueWriteBuffer : 3.00 74 enqueueReadBuffer : 1.54 75 enqueueMapBuffer(for read) : 9304.52 76 memcpy from mapped ptr : 1.55 77 enqueueUnmap(after write) : 6804.45 78 memcpy to mapped ptr : 1.48 79 80 Kernel launch latency : 183.70 us 81 82