Excercises_GeorgMandl/BSP_3_A/stream_result.txt
g.mandl 14b51cb405 Dateien nach „BSP_3_A“ hochladen
files for executing can be found via the exercise sheet
2025-11-13 02:26:15 +01:00

30 lines
No EOL
1.6 KiB
Text

-------------------------------------------------------------
STREAM version $Revision: 5.10 $
-------------------------------------------------------------
This system uses 8 bytes per array element.
-------------------------------------------------------------
Array size = 10000000 (elements), Offset = 0 (elements)
Memory per array = 76.3 MiB (= 0.1 GiB).
Total memory required = 228.9 MiB (= 0.2 GiB).
Each kernel will be executed 10 times.
The *best* time for each kernel (excluding the first iteration)
will be used to compute the reported bandwidth.
-------------------------------------------------------------
Your clock granularity/precision appears to be 10 microseconds.
Each test below will take on the order of 19174 microseconds.
(= 1917 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Best Rate MB/s Avg time Min time Max time
Copy: 10251.9 0.019098 0.015607 0.031235
Scale: 10242.5 0.024307 0.015621 0.031268
Add: 15376.9 0.020832 0.015608 0.031253
Triad: 15370.8 0.024548 0.015614 0.032512
-------------------------------------------------------------
Solution Validates: avg error less than 1.000000e-013 on all three arrays
-------------------------------------------------------------