60 lines
2.6 KiB
Text
60 lines
2.6 KiB
Text
rm -f *.exe *.o
|
|
gcc -O3 -c -o mysecond.o mysecond.c
|
|
gcc -O3 -c mysecond.c
|
|
gfortran -O3 -DSTREAM_ARRAY_SIZE=80000000 -DNTIMES=20 -c stream.f
|
|
gfortran -O3 stream.o mysecond.o -o stream_f.exe
|
|
gcc -O3 -DSTREAM_ARRAY_SIZE=80000000 -DNTIMES=20 stream.c -o stream_c.exe
|
|
gcc -O3 -DUNIX flops.c -o flops.exe
|
|
./stream_c.exe
|
|
-------------------------------------------------------------
|
|
STREAM version $Revision: 5.10 $
|
|
-------------------------------------------------------------
|
|
This system uses 8 bytes per array element.
|
|
-------------------------------------------------------------
|
|
Array size = 80000000 (elements), Offset = 0 (elements)
|
|
Memory per array = 610.4 MiB (= 0.6 GiB).
|
|
Total memory required = 1831.1 MiB (= 1.8 GiB).
|
|
Each kernel will be executed 20 times.
|
|
The *best* time for each kernel (excluding the first iteration)
|
|
will be used to compute the reported bandwidth.
|
|
-------------------------------------------------------------
|
|
Your clock granularity/precision appears to be 1 microseconds.
|
|
Each test below will take on the order of 59858 microseconds.
|
|
(= 59858 clock ticks)
|
|
Increase the size of the arrays if this shows that
|
|
you are not getting at least 20 clock ticks per test.
|
|
-------------------------------------------------------------
|
|
WARNING -- The above is only a rough guideline.
|
|
For best results, please be sure you know the
|
|
precision of your system timer.
|
|
-------------------------------------------------------------
|
|
Function Best Rate MB/s Avg time Min time Max time
|
|
Copy: 23508.2 0.063228 0.054449 0.074427
|
|
Scale: 18323.7 0.089940 0.069855 0.116932
|
|
Add: 19762.6 0.276166 0.097153 3.054857
|
|
Triad: 19559.9 0.123390 0.098160 0.156530
|
|
-------------------------------------------------------------
|
|
Solution Validates: avg error less than 1.000000e-13 on all three arrays
|
|
-------------------------------------------------------------
|
|
./flops.exe
|
|
|
|
FLOPS C Program (Double Precision), V2.0 18 Dec 1992
|
|
|
|
Module Error RunTime MFLOPS
|
|
(usec)
|
|
1 4.0146e-13 0.0029 4831.3737
|
|
2 -1.4166e-13 0.0006 11258.2969
|
|
3 4.7184e-14 0.0031 5448.3769
|
|
4 -1.2557e-13 0.0030 5042.5895
|
|
5 -1.3800e-13 0.0060 4867.7339
|
|
6 3.2380e-13 0.0054 5322.4399
|
|
7 -8.4583e-11 0.0031 3907.7854
|
|
8 3.4867e-13 0.0056 5323.6214
|
|
|
|
Iterations = 512000000
|
|
NullTime (usec) = 0.0000
|
|
MFLOPS(1) = 8348.0311
|
|
MFLOPS(2) = 4650.8807
|
|
MFLOPS(3) = 5016.3434
|
|
MFLOPS(4) = 5297.2428
|
|
|