------------------------------------------------------------- STREAM version $Revision: 5.10 $ ------------------------------------------------------------- This system uses 8 bytes per array element. ------------------------------------------------------------- Array size = 10000000 (elements), Offset = 0 (elements) Memory per array = 76.3 MiB (= 0.1 GiB). Total memory required = 228.9 MiB (= 0.2 GiB). Each kernel will be executed 10 times. The *best* time for each kernel (excluding the first iteration) will be used to compute the reported bandwidth. ------------------------------------------------------------- Your clock granularity/precision appears to be 10 microseconds. Each test below will take on the order of 19174 microseconds. (= 1917 clock ticks) Increase the size of the arrays if this shows that you are not getting at least 20 clock ticks per test. ------------------------------------------------------------- WARNING -- The above is only a rough guideline. For best results, please be sure you know the precision of your system timer. ------------------------------------------------------------- Function Best Rate MB/s Avg time Min time Max time Copy: 10251.9 0.019098 0.015607 0.031235 Scale: 10242.5 0.024307 0.015621 0.031268 Add: 15376.9 0.020832 0.015608 0.031253 Triad: 15370.8 0.024548 0.015614 0.032512 ------------------------------------------------------------- Solution Validates: avg error less than 1.000000e-013 on all three arrays -------------------------------------------------------------