Jump to content

OpenCL Benchmark - CPU vs GPU / DO NOT USE ANYMORE !


mitch_de
 Share

100 posts in this topic

Recommended Posts

My test

*****

 

 

 

Last login: Wed Aug 26 16:57:16 on console

/Users/peterdavidanderson/Desktop/OpenCLBench_as_terminal_tool/OpenCL2_Bench_V020 ; exit;

noname:~ peterdavidanderson$ /Users/peterdavidanderson/Desktop/OpenCLBench_as_terminal_tool/OpenCL2_Bench_V020 ; exit;

...........................................................

.................. OpenCL Bench V 0.15 by mitch ...........

...... C2D 3GHz = 30 sec vs Nvidia 9600GT = 3.10 sec ......

....... .......

........My test code (simple adds) is cpu friedly..........

.more gpu friedly+complexer code (raytracing/video encod.).

... may give much more speed advantage - at least on C2Ds .

...........................................................

 

....CL_DEVICE_NAME: Intel® Xeon® CPU 5150 @ 2.66GHz .....

CL_DEVICE_VENDOR: Intel

CL_DEVICE_MAX_CLOCK_FREQUENCY: 2660 MHz

CL_DEVICE_MAX_COMPUTE_UNITS: 4

Now computing - please be patient....

time used: 16.817684

Number of elements computed: 2097152

 

....CL_DEVICE_NAME: GeForce 8800 GT .....

CL_DEVICE_VENDOR: NVIDIA

CL_DEVICE_MAX_CLOCK_FREQUENCY: 1500 MHz

CL_DEVICE_MAX_COMPUTE_UNITS: 112

Now computing - please be patient....

time used: 2.608059

Number of elements computed: 2097152

Now checking if results are valid - please be patient....

:( Validate results test passed - GPU=CPU :)

logout

 

[Prozess beendet]

Link to comment
Share on other sites

cpu + 2xgpu

 

...........................................................

.................. OpenCL Bench V 0.15 by mitch ...........

...... C2D 3GHz = 30 sec vs Nvidia 9600GT = 3.10 sec ......

....... .......

........My test code (simple adds) is cpu friedly..........

.more gpu friedly+complexer code (raytracing/video encod.).

... may give much more speed advantage - at least on C2Ds .

...........................................................

 

....CL_DEVICE_NAME: Intel® Core2 Quad CPU @ 2.40GHz .....

CL_DEVICE_VENDOR: Intel

CL_DEVICE_MAX_CLOCK_FREQUENCY: 2400 MHz

CL_DEVICE_MAX_COMPUTE_UNITS: 4

Now computing - please be patient....

time used: 28.956915

Number of elements computed: 2097152

 

....CL_DEVICE_NAME: GeForce 9600 GT .....

CL_DEVICE_VENDOR: NVIDIA

CL_DEVICE_MAX_CLOCK_FREQUENCY: 1750 MHz

CL_DEVICE_MAX_COMPUTE_UNITS: 64

Now computing - please be patient....

time used: 2.694709

Number of elements computed: 2097152

 

....CL_DEVICE_NAME: GeForce 9600 GT .....

CL_DEVICE_VENDOR: NVIDIA

CL_DEVICE_MAX_CLOCK_FREQUENCY: 1750 MHz

CL_DEVICE_MAX_COMPUTE_UNITS: 64

Now computing - please be patient....

time used: 2.797374

Number of elements computed: 2097152

Now checking if results are valid - please be patient....

:hysterical: Validate results test passed - GPU=CPU ;)

logout

Link to comment
Share on other sites

THANKS !

Question: Do you have 2 indentical GPUs´?

Also (will not change the times) always use the lastest BUILD, ist V020.

Has much error handling code for ATI users (NVIDIAs seems to run without errors so far :) )

Link to comment
Share on other sites

mbp/late 2008 result:

....CL_DEVICE_NAME: Intel(R) Core(TM)2 Duo CPU	 P8600  @ 2.40GHz .....
CL_DEVICE_VENDOR: Intel
CL_DEVICE_MAX_CLOCK_FREQUENCY: 2400 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 2
Now computing - please be patient....
time used: 56.190952
Number of elements computed: 2097152

....CL_DEVICE_NAME: GeForce 9600M GT .....
CL_DEVICE_VENDOR: NVIDIA
CL_DEVICE_MAX_CLOCK_FREQUENCY: 1250 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 32
Now computing - please be patient....
time used: 10.169043
Number of elements computed: 2097152

....CL_DEVICE_NAME: GeForce 9600M GT .....
CL_DEVICE_VENDOR: NVIDIA
CL_DEVICE_MAX_CLOCK_FREQUENCY: 1250 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 32
Now computing - please be patient....
time used: 10.120525
Number of elements computed: 2097152
Now checking if results are valid - please be patient....
:) Validate results test passed - GPU=CPU :)

 

seems to be a bug, you are testing the same gpu twice (or just printing out the info of the first gpu device twice?). the second gpu should be a 9400M.

 

This program seems to multi-thread very well according to SL's CPU Usage monitor.

Thanks for that detail !

I think the 10.6 changes "in the deep" will exspecially use much Cores better than 10.5 - even without special

source coding changes. But recompiling source with newest Xcode & using 10.6 dev framework needed , i think.

thats an opencl feature, or the purpose of opencl :) scale a small program/kernel well to many cores - be it cpu or gpu.

Link to comment
Share on other sites

Upgrade from Open CL Bench V 020.

 

...........................................................
.................. OpenCL Bench V 0.15 by mitch ...........
...... C2D 3GHz = 30 sec vs Nvidia 9600GT = 3.10 sec ......
.......											 .......
........My test code (simple adds) is cpu friedly..........
.more gpu friedly+complexer code (raytracing/video encod.).
... may give much more speed advantage - at least on C2Ds .
...........................................................

....CL_DEVICE_NAME: Pentium(R) Dual-Core  CPU	  E5200  @ 2.50GHz .....
CL_DEVICE_VENDOR: Intel
CL_DEVICE_MAX_CLOCK_FREQUENCY: 3129 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 2
Now computing - please be patient....
time used: 28.777699
Number of elements computed: 2097152

....CL_DEVICE_NAME: GeForce 8800 GT .....
CL_DEVICE_VENDOR: NVIDIA
CL_DEVICE_MAX_CLOCK_FREQUENCY: 1600 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 112
Now computing - please be patient....
time used: 2.618950
Number of elements computed: 2097152
Now checking if results are valid - please be patient....
:) Validate results test passed - GPU=CPU :)

Link to comment
Share on other sites

./OpenCL2_Bench_V020

...........................................................

.................. OpenCL Bench V 0.15 by mitch ...........

...... C2D 3GHz = 30 sec vs Nvidia 9600GT = 3.10 sec ......

....... .......

........My test code (simple adds) is cpu friedly..........

.more gpu friedly+complexer code (raytracing/video encod.).

... may give much more speed advantage - at least on C2Ds .

...........................................................

 

....CL_DEVICE_NAME: Intel� Coreâ„¢2 Quad CPU Q9450 @ 2.66GHz .....

CL_DEVICE_VENDOR: Intel

CL_DEVICE_MAX_CLOCK_FREQUENCY: 3072 MHz

CL_DEVICE_MAX_COMPUTE_UNITS: 4

Now computing - please be patient....

time used: 14.658403

Number of elements computed: 2097152

 

....CL_DEVICE_NAME: Radeon HD 4870 .....

CL_DEVICE_VENDOR: AMD

CL_DEVICE_MAX_CLOCK_FREQUENCY: 750 MHz

CL_DEVICE_MAX_COMPUTE_UNITS: 4

Now computing - please be patient....

///here gui freezes immedeatly

time used: 27.399342

Number of elements computed: 2097152

Now checking if results are valid - please be patient....

:) Validate results test passed - GPU=CPU :)

 

 

x2000.kexts dumps ** GPU Debug Info ** to dmesg

 

maybe its too much loops and so too much memory used by arrays, decreased loops number to 1000(edited source inside bin), now no crash.

Link to comment
Share on other sites

mbp/late 2008 result:

....CL_DEVICE_NAME: Intel(R) Core(TM)2 Duo CPU	 P8600  @ 2.40GHz .....
CL_DEVICE_VENDOR: Intel
CL_DEVICE_MAX_CLOCK_FREQUENCY: 2400 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 2
Now computing - please be patient....
time used: 56.190952
Number of elements computed: 2097152

....CL_DEVICE_NAME: GeForce 9600M GT .....
CL_DEVICE_VENDOR: NVIDIA
CL_DEVICE_MAX_CLOCK_FREQUENCY: 1250 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 32
Now computing - please be patient....
time used: 10.169043
Number of elements computed: 2097152

....CL_DEVICE_NAME: GeForce 9600M GT .....
CL_DEVICE_VENDOR: NVIDIA
CL_DEVICE_MAX_CLOCK_FREQUENCY: 1250 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 32
Now computing - please be patient....
time used: 10.120525
Number of elements computed: 2097152
Now checking if results are valid - please be patient....
:) Validate results test passed - GPU=CPU :)

 

seems to be a bug, you are testing the same gpu twice (or just printing out the info of the first gpu device twice?). the second gpu should be a 9400M.

 

Thanks.

I will fix that bug soon.

A workaround for that bug:

Please post result again after you disabled the 9600M GT ( so 9400M is the alone GPU).

The 10 sec results is for the 9600MGT - 9400M will run slower.

Link to comment
Share on other sites

This is my result:

...........................................................
.................. OpenCL Bench V 0.15 by mitch ...........
...... C2D 3GHz = 30 sec vs Nvidia 9600GT = 3.10 sec ......
.......											 .......
........My test code (simple adds) is cpu friedly..........
.more gpu friedly+complexer code (raytracing/video encod.).
... may give much more speed advantage - at least on C2Ds .
...........................................................

....CL_DEVICE_NAME: Intel(R) Core(TM)2 Quad CPU	Q9550  @ 2.83GHz .....
CL_DEVICE_VENDOR: Intel
CL_DEVICE_MAX_CLOCK_FREQUENCY: 2836 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 4
Now computing - please be patient....
time used: 15.836717
Number of elements computed: 2097152

....CL_DEVICE_NAME: GeForce 9600 GT .....
CL_DEVICE_VENDOR: NVIDIA
CL_DEVICE_MAX_CLOCK_FREQUENCY: 1625 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 64
Now computing - please be patient....
time used: 2.700367
Number of elements computed: 2097152
Now checking if results are valid - please be patient....
:) Validate results test passed - GPU=CPU :)

Link to comment
Share on other sites

./OpenCL2_Bench_V020

...........................................................

.................. OpenCL Bench V 0.15 by mitch ...........

...... C2D 3GHz = 30 sec vs Nvidia 9600GT = 3.10 sec ......

....... .......

........My test code (simple adds) is cpu friedly..........

.more gpu friedly+complexer code (raytracing/video encod.).

... may give much more speed advantage - at least on C2Ds .

...........................................................

 

....CL_DEVICE_NAME: Intel� Core�„�2 Quad CPU Q9450 @ 2.66GHz .....

CL_DEVICE_VENDOR: Intel

CL_DEVICE_MAX_CLOCK_FREQUENCY: 3072 MHz

CL_DEVICE_MAX_COMPUTE_UNITS: 4

Now computing - please be patient....

time used: 14.658403

Number of elements computed: 2097152

 

....CL_DEVICE_NAME: Radeon HD 4870 .....

CL_DEVICE_VENDOR: AMD

CL_DEVICE_MAX_CLOCK_FREQUENCY: 750 MHz

CL_DEVICE_MAX_COMPUTE_UNITS: 4

Now computing - please be patient....

///here gui freezes immedeatly

time used: 27.399342

Number of elements computed: 2097152

Now checking if results are valid - please be patient....

:D Validate results test passed - GPU=CPU :D

 

 

x2000.kexts dumps ** GPU Debug Info ** to dmesg

 

maybe its too much loops and so too much memory used by arrays, decreased loops number to 1000(edited source inside bin), now no crash.

 

 

Thanks, i also thougt about this memory problem.

Will compile an ATI_debug version soon and post it below other DL Link.

Sure, if i will decrease the loop down from 5000 to 1000, the time results of very fast GPUs like GTX 285 will also decrease from 0.8 down to 0,0xy :police:

I am working on an other solution, which does more complex work but not in such an huge loop.

Link to comment
Share on other sites

THANKS !

Question: Do you have 2 indentical GPUs´?

Also (will not change the times) always use the lastest BUILD, ist V020.

Has much error handling code for ATI users (NVIDIAs seems to run without errors so far :thumbsup_anim: )

 

yep 2 palit nvidia 9600 gt on 2 pciex x16 ports

Link to comment
Share on other sites

Doesn't seem to test my setup correctly.

 

CL_DEVICE_NAME: Intel® Xeon® CPU X5365 @ 3.00GHz

CL_DEVICE_VENDOR: Intel

Now computing - please be patient....

time used: 7.710562

Number of elements computed: 2097152

CL_DEVICE_NAME: GeForce 8800 GT

CL_DEVICE_VENDOR: NVIDIA

Now computing - please be patient....

time used: 2.492461

Number of elements computed: 2097152

CL_DEVICE_NAME: GeForce 8800 GT

CL_DEVICE_VENDOR: NVIDIA

Now computing - please be patient....

time used: 2.489143

Number of elements computed: 2097152

Now checking if results are valid - please be patient....

:thumbsup_anim: Validate results test passed - GPU=CPU :D

 

 

 

I have an 8800GT in Slot 3 working as the helper card to a GTX260. Looks like it's testing the 8800GT twice.

Link to comment
Share on other sites

Doesn't seem to test my setup correctly.

 

 

I have an 8800GT in Slot 3 working as the helper card to a GTX260. Looks like it's testing the 8800GT twice.

 

 

NEW VERSION on the road - DL V025, lots of changes (and hopefully fixed ATI + > 2 gpu tests)

Link to comment
Share on other sites

BINGO

 

 

Number of OpenCL devices found: 3

OpenCL Device # 0 = GeForce 8800 GT

Device 0 is an: GPU with max. 1500 MHz and 112 units/cores

Now computing - please be patient....

time used: 0.683 seconds

 

OpenCL Device # 1 = GeForce GTX 260

Device 1 is an: GPU with max. 1400 MHz and 216 units/cores

Now computing - please be patient....

time used: 0.365 seconds

 

OpenCL Device # 2 = Intel® Xeon® CPU X5365 @ 3.00GHz

Device 2 is an: CPU with max. 3000 MHz and 8 units/cores

Now computing - please be patient....

time used: 3.094 seconds

 

Now checking if results are valid - please be patient....

:thumbsup_anim: Validate test passed - GPU results=CPU results :D

Link to comment
Share on other sites

BINGO

 

 

Number of OpenCL devices found: 3

OpenCL Device # 0 = GeForce 8800 GT

Device 0 is an: GPU with max. 1500 MHz and 112 units/cores

Now computing - please be patient....

time used: 0.683 seconds

 

OpenCL Device # 1 = GeForce GTX 260

Device 1 is an: GPU with max. 1400 MHz and 216 units/cores

Now computing - please be patient....

time used: 0.365 seconds

 

OpenCL Device # 2 = Intel® Xeon® CPU X5365 @ 3.00GHz

Device 2 is an: CPU with max. 3000 MHz and 8 units/cores

Now computing - please be patient....

time used: 3.094 seconds

 

Now checking if results are valid - please be patient....

:) Validate test passed - GPU results=CPU results :)

 

YEAH !

Hope also ATI users will not get an freezed system anymore - their GPUs got overloaded with old code - NVIDIAs not :star_smile:

 

Remember : the V025 time used results of V025 cant be compared 1:1 with the old version ones.

Thats because of code changes for ATI users :wacko:

Link to comment
Share on other sites

Here are my scores from V0.25 script:

 

...........................................................

.................. OpenCL Bench V 0.25 by mitch ...........

...... C2D 3GHz = 12 sec vs Nvidia 9600GT = 0,93 sec ......

... time results are not comparable to older version! .....

...........................................................

 

Number of OpenCL devices found: 2

OpenCL Device # 0 = GeForce GTX 285

Device 0 is an: GPU with max. 1584 MHz and 240 units/cores

Now computing - please be patient....

time used: 0.231 seconds

 

OpenCL Device # 1 = Intel® Core i7 CPU 920 @ 2.67GHz

Device 1 is an: CPU with max. 4280 MHz and 8 units/cores

Now computing - please be patient....

time used: 1.296 seconds

 

Now checking if results are valid - please be patient....

:rolleyes: Validate test passed - GPU results=CPU results :)

Link to comment
Share on other sites

...........................................................
.................. OpenCL Bench V 0.25 by mitch ...........
...... C2D 3GHz = 12 sec vs Nvidia 9600GT = 0,93 sec ......
... time results are not comparable to older version! .....
...........................................................

Number of OpenCL devices found: 3
OpenCL Device # 0 = Radeon HD 4870
Device 0 is an: GPU with max. 750 MHz and 4 units/cores 
Now computing - please be patient....
time used:  4.126 seconds

OpenCL Device # 1 = GeForce GT 120
Device 1 is an: GPU with max. 1400 MHz and 32 units/cores 
Error: clBuildProgram for device # 1 
ERROR NUMBER = -11

Link to comment
Share on other sites

This just shows how SAD the ATI Drivers currently are!!! OUCH!!!!

 

 

_tool-1/OpenCL2_Bench_V025 ; exit;

...........................................................

.................. OpenCL Bench V 0.25 by mitch ...........

...... C2D 3GHz = 12 sec vs Nvidia 9600GT = 0,93 sec ......

... time results are not comparable to older version! .....

...........................................................

 

Number of OpenCL devices found: 2

OpenCL Device # 0 = Radeon HD 4870

Device 0 is an: GPU with max. 750 MHz and 4 units/cores

Now computing - please be patient....

time used: 4.065 seconds

 

OpenCL Device # 1 = Intel® Xeon® CPU 5160 @ 3.00GHz

Device 1 is an: CPU with max. 3000 MHz and 4 units/cores

Now computing - please be patient....

time used: 6.079 seconds

 

Now checking if results are valid - please be patient....

:thumbsup_anim: Validate test passed - GPU results=CPU results :bag:

logout

 

[Process completed]

Link to comment
Share on other sites

tool-1/OpenCL2_Bench_V025; exit;
...........................................................
.................. OpenCL Bench V 0.25 by mitch ...........
...... C2D 3GHz = 12 sec vs Nvidia 9600GT = 0,93 sec ......
... time results are not comparable to older version! .....
...........................................................

Number of OpenCL devices found: 2
OpenCL Device # 0 = GeForce GTX 260
Device 0 is an: GPU with max. 1242 MHz and 192 units/cores 
Now computing - please be patient....
time used:  0.357 seconds

OpenCL Device # 1 = Intel(R) Core(TM)2 Duo CPU	 E8400  @ 3.00GHz
Device 1 is an: CPU with max. 3600 MHz and 2 units/cores 
Now computing - please be patient....
time used: 10.433 seconds

Now checking if results are valid - please be patient....
:) Validate test passed - GPU results=CPU results :) 
logout

Link to comment
Share on other sites

Hello mitch, can you make available the source code? Thanks!! :P

 

 

My results:

Number of OpenCL devices found: 3

OpenCL Device # 0 = GeForce 9600M GT

Device 0 is an: GPU with max. 1250 MHz and 32 units/cores

Now computing - please be patient....

time used: 2.798 seconds

 

OpenCL Device # 1 = GeForce 9400M

Device 1 is an: GPU with max. 1100 MHz and 16 units/cores

Now computing - please be patient....

time used: 9.549 seconds

 

OpenCL Device # 2 = Intel® Core2 Duo CPU P8600 @ 2.40GHz

Device 2 is an: CPU with max. 2400 MHz and 2 units/cores

Now computing - please be patient....

time used: 15.800 seconds

 

Now checking if results are valid - please be patient....

:) Validate test passed - GPU results=CPU results <_<

Link to comment
Share on other sites

 Share

×
×
  • Create New...