Ex8 and minor improvements
This commit is contained in:
parent
2195a9db0a
commit
77bc8c6aa3
50 changed files with 214845 additions and 43 deletions
|
|
@ -19,8 +19,8 @@ Each kernel will be executed 20 times.
|
|||
will be used to compute the reported bandwidth.
|
||||
-------------------------------------------------------------
|
||||
Your clock granularity/precision appears to be 1 microseconds.
|
||||
Each test below will take on the order of 46252 microseconds.
|
||||
(= 46252 clock ticks)
|
||||
Each test below will take on the order of 59858 microseconds.
|
||||
(= 59858 clock ticks)
|
||||
Increase the size of the arrays if this shows that
|
||||
you are not getting at least 20 clock ticks per test.
|
||||
-------------------------------------------------------------
|
||||
|
|
@ -29,10 +29,10 @@ For best results, please be sure you know the
|
|||
precision of your system timer.
|
||||
-------------------------------------------------------------
|
||||
Function Best Rate MB/s Avg time Min time Max time
|
||||
Copy: 28478.6 0.047858 0.044946 0.054333
|
||||
Scale: 20551.4 0.066044 0.062283 0.077807
|
||||
Add: 22534.2 0.089671 0.085204 0.099586
|
||||
Triad: 22709.5 0.088864 0.084546 0.098536
|
||||
Copy: 23508.2 0.063228 0.054449 0.074427
|
||||
Scale: 18323.7 0.089940 0.069855 0.116932
|
||||
Add: 19762.6 0.276166 0.097153 3.054857
|
||||
Triad: 19559.9 0.123390 0.098160 0.156530
|
||||
-------------------------------------------------------------
|
||||
Solution Validates: avg error less than 1.000000e-13 on all three arrays
|
||||
-------------------------------------------------------------
|
||||
|
|
@ -42,19 +42,19 @@ Solution Validates: avg error less than 1.000000e-13 on all three arrays
|
|||
|
||||
Module Error RunTime MFLOPS
|
||||
(usec)
|
||||
1 4.0146e-13 0.0021 6622.7552
|
||||
2 -1.4166e-13 0.0006 12723.3419
|
||||
3 4.7184e-14 0.0027 6253.2599
|
||||
4 -1.2557e-13 0.0026 5758.6323
|
||||
5 -1.3800e-13 0.0051 5740.4851
|
||||
6 3.2380e-13 0.0051 5674.2511
|
||||
7 -8.4583e-11 0.0031 3827.0478
|
||||
8 3.4867e-13 0.0053 5610.0203
|
||||
1 4.0146e-13 0.0029 4831.3737
|
||||
2 -1.4166e-13 0.0006 11258.2969
|
||||
3 4.7184e-14 0.0031 5448.3769
|
||||
4 -1.2557e-13 0.0030 5042.5895
|
||||
5 -1.3800e-13 0.0060 4867.7339
|
||||
6 3.2380e-13 0.0054 5322.4399
|
||||
7 -8.4583e-11 0.0031 3907.7854
|
||||
8 3.4867e-13 0.0056 5323.6214
|
||||
|
||||
Iterations = 512000000
|
||||
NullTime (usec) = 0.0000
|
||||
MFLOPS(1) = 9507.3864
|
||||
MFLOPS(2) = 5042.7572
|
||||
MFLOPS(3) = 5597.4972
|
||||
MFLOPS(4) = 5766.1547
|
||||
MFLOPS(1) = 8348.0311
|
||||
MFLOPS(2) = 4650.8807
|
||||
MFLOPS(3) = 5016.3434
|
||||
MFLOPS(4) = 5297.2428
|
||||
|
||||
Loading…
Add table
Add a link
Reference in a new issue