Jump to content
100 posts in this topic

Recommended Posts

It didn't work on my ATI HD4870 after try to compute GPU. can move only mouse & got to press a reset button.

 

Also, see http://netkas.org/?p=164

 

Have you used the lastest V020 - which added lost of error handling code ?

Please look / write down reported errors / error messages.

My test

*****

 

 

 

Last login: Wed Aug 26 16:57:16 on console

/Users/peterdavidanderson/Desktop/OpenCLBench_as_terminal_tool/OpenCL2_Bench_V020 ; exit;

noname:~ peterdavidanderson$ /Users/peterdavidanderson/Desktop/OpenCLBench_as_terminal_tool/OpenCL2_Bench_V020 ; exit;

...........................................................

.................. OpenCL Bench V 0.15 by mitch ...........

...... C2D 3GHz = 30 sec vs Nvidia 9600GT = 3.10 sec ......

....... .......

........My test code (simple adds) is cpu friedly..........

.more gpu friedly+complexer code (raytracing/video encod.).

... may give much more speed advantage - at least on C2Ds .

...........................................................

 

....CL_DEVICE_NAME: Intel® Xeon® CPU 5150 @ 2.66GHz .....

CL_DEVICE_VENDOR: Intel

CL_DEVICE_MAX_CLOCK_FREQUENCY: 2660 MHz

CL_DEVICE_MAX_COMPUTE_UNITS: 4

Now computing - please be patient....

time used: 16.817684

Number of elements computed: 2097152

 

....CL_DEVICE_NAME: GeForce 8800 GT .....

CL_DEVICE_VENDOR: NVIDIA

CL_DEVICE_MAX_CLOCK_FREQUENCY: 1500 MHz

CL_DEVICE_MAX_COMPUTE_UNITS: 112

Now computing - please be patient....

time used: 2.608059

Number of elements computed: 2097152

Now checking if results are valid - please be patient....

:( Validate results test passed - GPU=CPU :)

logout

 

[Prozess beendet]

cpu + 2xgpu

 

...........................................................

.................. OpenCL Bench V 0.15 by mitch ...........

...... C2D 3GHz = 30 sec vs Nvidia 9600GT = 3.10 sec ......

....... .......

........My test code (simple adds) is cpu friedly..........

.more gpu friedly+complexer code (raytracing/video encod.).

... may give much more speed advantage - at least on C2Ds .

...........................................................

 

....CL_DEVICE_NAME: Intel® Core2 Quad CPU @ 2.40GHz .....

CL_DEVICE_VENDOR: Intel

CL_DEVICE_MAX_CLOCK_FREQUENCY: 2400 MHz

CL_DEVICE_MAX_COMPUTE_UNITS: 4

Now computing - please be patient....

time used: 28.956915

Number of elements computed: 2097152

 

....CL_DEVICE_NAME: GeForce 9600 GT .....

CL_DEVICE_VENDOR: NVIDIA

CL_DEVICE_MAX_CLOCK_FREQUENCY: 1750 MHz

CL_DEVICE_MAX_COMPUTE_UNITS: 64

Now computing - please be patient....

time used: 2.694709

Number of elements computed: 2097152

 

....CL_DEVICE_NAME: GeForce 9600 GT .....

CL_DEVICE_VENDOR: NVIDIA

CL_DEVICE_MAX_CLOCK_FREQUENCY: 1750 MHz

CL_DEVICE_MAX_COMPUTE_UNITS: 64

Now computing - please be patient....

time used: 2.797374

Number of elements computed: 2097152

Now checking if results are valid - please be patient....

:hysterical: Validate results test passed - GPU=CPU ;)

logout

THANKS !

Question: Do you have 2 indentical GPUs´?

Also (will not change the times) always use the lastest BUILD, ist V020.

Has much error handling code for ATI users (NVIDIAs seems to run without errors so far :) )

mbp/late 2008 result:

....CL_DEVICE_NAME: Intel(R) Core(TM)2 Duo CPU	 P8600  @ 2.40GHz .....
CL_DEVICE_VENDOR: Intel
CL_DEVICE_MAX_CLOCK_FREQUENCY: 2400 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 2
Now computing - please be patient....
time used: 56.190952
Number of elements computed: 2097152

....CL_DEVICE_NAME: GeForce 9600M GT .....
CL_DEVICE_VENDOR: NVIDIA
CL_DEVICE_MAX_CLOCK_FREQUENCY: 1250 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 32
Now computing - please be patient....
time used: 10.169043
Number of elements computed: 2097152

....CL_DEVICE_NAME: GeForce 9600M GT .....
CL_DEVICE_VENDOR: NVIDIA
CL_DEVICE_MAX_CLOCK_FREQUENCY: 1250 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 32
Now computing - please be patient....
time used: 10.120525
Number of elements computed: 2097152
Now checking if results are valid - please be patient....
:) Validate results test passed - GPU=CPU :)

 

seems to be a bug, you are testing the same gpu twice (or just printing out the info of the first gpu device twice?). the second gpu should be a 9400M.

 

This program seems to multi-thread very well according to SL's CPU Usage monitor.

Thanks for that detail !

I think the 10.6 changes "in the deep" will exspecially use much Cores better than 10.5 - even without special

source coding changes. But recompiling source with newest Xcode & using 10.6 dev framework needed , i think.

thats an opencl feature, or the purpose of opencl :) scale a small program/kernel well to many cores - be it cpu or gpu.

Upgrade from Open CL Bench V 020.

 

...........................................................
.................. OpenCL Bench V 0.15 by mitch ...........
...... C2D 3GHz = 30 sec vs Nvidia 9600GT = 3.10 sec ......
.......											 .......
........My test code (simple adds) is cpu friedly..........
.more gpu friedly+complexer code (raytracing/video encod.).
... may give much more speed advantage - at least on C2Ds .
...........................................................

....CL_DEVICE_NAME: Pentium(R) Dual-Core  CPU	  E5200  @ 2.50GHz .....
CL_DEVICE_VENDOR: Intel
CL_DEVICE_MAX_CLOCK_FREQUENCY: 3129 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 2
Now computing - please be patient....
time used: 28.777699
Number of elements computed: 2097152

....CL_DEVICE_NAME: GeForce 8800 GT .....
CL_DEVICE_VENDOR: NVIDIA
CL_DEVICE_MAX_CLOCK_FREQUENCY: 1600 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 112
Now computing - please be patient....
time used: 2.618950
Number of elements computed: 2097152
Now checking if results are valid - please be patient....
:) Validate results test passed - GPU=CPU :)

./OpenCL2_Bench_V020

...........................................................

.................. OpenCL Bench V 0.15 by mitch ...........

...... C2D 3GHz = 30 sec vs Nvidia 9600GT = 3.10 sec ......

....... .......

........My test code (simple adds) is cpu friedly..........

.more gpu friedly+complexer code (raytracing/video encod.).

... may give much more speed advantage - at least on C2Ds .

...........................................................

 

....CL_DEVICE_NAME: Intel� Coreâ„¢2 Quad CPU Q9450 @ 2.66GHz .....

CL_DEVICE_VENDOR: Intel

CL_DEVICE_MAX_CLOCK_FREQUENCY: 3072 MHz

CL_DEVICE_MAX_COMPUTE_UNITS: 4

Now computing - please be patient....

time used: 14.658403

Number of elements computed: 2097152

 

....CL_DEVICE_NAME: Radeon HD 4870 .....

CL_DEVICE_VENDOR: AMD

CL_DEVICE_MAX_CLOCK_FREQUENCY: 750 MHz

CL_DEVICE_MAX_COMPUTE_UNITS: 4

Now computing - please be patient....

///here gui freezes immedeatly

time used: 27.399342

Number of elements computed: 2097152

Now checking if results are valid - please be patient....

:) Validate results test passed - GPU=CPU :)

 

 

x2000.kexts dumps ** GPU Debug Info ** to dmesg

 

maybe its too much loops and so too much memory used by arrays, decreased loops number to 1000(edited source inside bin), now no crash.

mbp/late 2008 result:

....CL_DEVICE_NAME: Intel(R) Core(TM)2 Duo CPU	 P8600  @ 2.40GHz .....
CL_DEVICE_VENDOR: Intel
CL_DEVICE_MAX_CLOCK_FREQUENCY: 2400 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 2
Now computing - please be patient....
time used: 56.190952
Number of elements computed: 2097152

....CL_DEVICE_NAME: GeForce 9600M GT .....
CL_DEVICE_VENDOR: NVIDIA
CL_DEVICE_MAX_CLOCK_FREQUENCY: 1250 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 32
Now computing - please be patient....
time used: 10.169043
Number of elements computed: 2097152

....CL_DEVICE_NAME: GeForce 9600M GT .....
CL_DEVICE_VENDOR: NVIDIA
CL_DEVICE_MAX_CLOCK_FREQUENCY: 1250 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 32
Now computing - please be patient....
time used: 10.120525
Number of elements computed: 2097152
Now checking if results are valid - please be patient....
:) Validate results test passed - GPU=CPU :)

 

seems to be a bug, you are testing the same gpu twice (or just printing out the info of the first gpu device twice?). the second gpu should be a 9400M.

 

Thanks.

I will fix that bug soon.

A workaround for that bug:

Please post result again after you disabled the 9600M GT ( so 9400M is the alone GPU).

The 10 sec results is for the 9600MGT - 9400M will run slower.

This is my result:

...........................................................
.................. OpenCL Bench V 0.15 by mitch ...........
...... C2D 3GHz = 30 sec vs Nvidia 9600GT = 3.10 sec ......
.......											 .......
........My test code (simple adds) is cpu friedly..........
.more gpu friedly+complexer code (raytracing/video encod.).
... may give much more speed advantage - at least on C2Ds .
...........................................................

....CL_DEVICE_NAME: Intel(R) Core(TM)2 Quad CPU	Q9550  @ 2.83GHz .....
CL_DEVICE_VENDOR: Intel
CL_DEVICE_MAX_CLOCK_FREQUENCY: 2836 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 4
Now computing - please be patient....
time used: 15.836717
Number of elements computed: 2097152

....CL_DEVICE_NAME: GeForce 9600 GT .....
CL_DEVICE_VENDOR: NVIDIA
CL_DEVICE_MAX_CLOCK_FREQUENCY: 1625 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 64
Now computing - please be patient....
time used: 2.700367
Number of elements computed: 2097152
Now checking if results are valid - please be patient....
:) Validate results test passed - GPU=CPU :)

./OpenCL2_Bench_V020

...........................................................

.................. OpenCL Bench V 0.15 by mitch ...........

...... C2D 3GHz = 30 sec vs Nvidia 9600GT = 3.10 sec ......

....... .......

........My test code (simple adds) is cpu friedly..........

.more gpu friedly+complexer code (raytracing/video encod.).

... may give much more speed advantage - at least on C2Ds .

...........................................................

 

....CL_DEVICE_NAME: Intel� Core�„�2 Quad CPU Q9450 @ 2.66GHz .....

CL_DEVICE_VENDOR: Intel

CL_DEVICE_MAX_CLOCK_FREQUENCY: 3072 MHz

CL_DEVICE_MAX_COMPUTE_UNITS: 4

Now computing - please be patient....

time used: 14.658403

Number of elements computed: 2097152

 

....CL_DEVICE_NAME: Radeon HD 4870 .....

CL_DEVICE_VENDOR: AMD

CL_DEVICE_MAX_CLOCK_FREQUENCY: 750 MHz

CL_DEVICE_MAX_COMPUTE_UNITS: 4

Now computing - please be patient....

///here gui freezes immedeatly

time used: 27.399342

Number of elements computed: 2097152

Now checking if results are valid - please be patient....

:D Validate results test passed - GPU=CPU :D

 

 

x2000.kexts dumps ** GPU Debug Info ** to dmesg

 

maybe its too much loops and so too much memory used by arrays, decreased loops number to 1000(edited source inside bin), now no crash.

 

 

Thanks, i also thougt about this memory problem.

Will compile an ATI_debug version soon and post it below other DL Link.

Sure, if i will decrease the loop down from 5000 to 1000, the time results of very fast GPUs like GTX 285 will also decrease from 0.8 down to 0,0xy :police:

I am working on an other solution, which does more complex work but not in such an huge loop.

THANKS !

Question: Do you have 2 indentical GPUs´?

Also (will not change the times) always use the lastest BUILD, ist V020.

Has much error handling code for ATI users (NVIDIAs seems to run without errors so far :thumbsup_anim: )

 

yep 2 palit nvidia 9600 gt on 2 pciex x16 ports

Doesn't seem to test my setup correctly.

 

CL_DEVICE_NAME: Intel® Xeon® CPU X5365 @ 3.00GHz

CL_DEVICE_VENDOR: Intel

Now computing - please be patient....

time used: 7.710562

Number of elements computed: 2097152

CL_DEVICE_NAME: GeForce 8800 GT

CL_DEVICE_VENDOR: NVIDIA

Now computing - please be patient....

time used: 2.492461

Number of elements computed: 2097152

CL_DEVICE_NAME: GeForce 8800 GT

CL_DEVICE_VENDOR: NVIDIA

Now computing - please be patient....

time used: 2.489143

Number of elements computed: 2097152

Now checking if results are valid - please be patient....

:thumbsup_anim: Validate results test passed - GPU=CPU :D

 

 

 

I have an 8800GT in Slot 3 working as the helper card to a GTX260. Looks like it's testing the 8800GT twice.

Doesn't seem to test my setup correctly.

 

 

I have an 8800GT in Slot 3 working as the helper card to a GTX260. Looks like it's testing the 8800GT twice.

 

 

NEW VERSION on the road - DL V025, lots of changes (and hopefully fixed ATI + > 2 gpu tests)

BINGO

 

 

Number of OpenCL devices found: 3

OpenCL Device # 0 = GeForce 8800 GT

Device 0 is an: GPU with max. 1500 MHz and 112 units/cores

Now computing - please be patient....

time used: 0.683 seconds

 

OpenCL Device # 1 = GeForce GTX 260

Device 1 is an: GPU with max. 1400 MHz and 216 units/cores

Now computing - please be patient....

time used: 0.365 seconds

 

OpenCL Device # 2 = Intel® Xeon® CPU X5365 @ 3.00GHz

Device 2 is an: CPU with max. 3000 MHz and 8 units/cores

Now computing - please be patient....

time used: 3.094 seconds

 

Now checking if results are valid - please be patient....

:thumbsup_anim: Validate test passed - GPU results=CPU results :D

BINGO

 

 

Number of OpenCL devices found: 3

OpenCL Device # 0 = GeForce 8800 GT

Device 0 is an: GPU with max. 1500 MHz and 112 units/cores

Now computing - please be patient....

time used: 0.683 seconds

 

OpenCL Device # 1 = GeForce GTX 260

Device 1 is an: GPU with max. 1400 MHz and 216 units/cores

Now computing - please be patient....

time used: 0.365 seconds

 

OpenCL Device # 2 = Intel® Xeon® CPU X5365 @ 3.00GHz

Device 2 is an: CPU with max. 3000 MHz and 8 units/cores

Now computing - please be patient....

time used: 3.094 seconds

 

Now checking if results are valid - please be patient....

:) Validate test passed - GPU results=CPU results :)

 

YEAH !

Hope also ATI users will not get an freezed system anymore - their GPUs got overloaded with old code - NVIDIAs not :star_smile:

 

Remember : the V025 time used results of V025 cant be compared 1:1 with the old version ones.

Thats because of code changes for ATI users :wacko:

Here are my scores from V0.25 script:

 

...........................................................

.................. OpenCL Bench V 0.25 by mitch ...........

...... C2D 3GHz = 12 sec vs Nvidia 9600GT = 0,93 sec ......

... time results are not comparable to older version! .....

...........................................................

 

Number of OpenCL devices found: 2

OpenCL Device # 0 = GeForce GTX 285

Device 0 is an: GPU with max. 1584 MHz and 240 units/cores

Now computing - please be patient....

time used: 0.231 seconds

 

OpenCL Device # 1 = Intel® Core i7 CPU 920 @ 2.67GHz

Device 1 is an: CPU with max. 4280 MHz and 8 units/cores

Now computing - please be patient....

time used: 1.296 seconds

 

Now checking if results are valid - please be patient....

:rolleyes: Validate test passed - GPU results=CPU results :)

...........................................................
.................. OpenCL Bench V 0.25 by mitch ...........
...... C2D 3GHz = 12 sec vs Nvidia 9600GT = 0,93 sec ......
... time results are not comparable to older version! .....
...........................................................

Number of OpenCL devices found: 3
OpenCL Device # 0 = Radeon HD 4870
Device 0 is an: GPU with max. 750 MHz and 4 units/cores 
Now computing - please be patient....
time used:  4.126 seconds

OpenCL Device # 1 = GeForce GT 120
Device 1 is an: GPU with max. 1400 MHz and 32 units/cores 
Error: clBuildProgram for device # 1 
ERROR NUMBER = -11

This just shows how SAD the ATI Drivers currently are!!! OUCH!!!!

 

 

_tool-1/OpenCL2_Bench_V025 ; exit;

...........................................................

.................. OpenCL Bench V 0.25 by mitch ...........

...... C2D 3GHz = 12 sec vs Nvidia 9600GT = 0,93 sec ......

... time results are not comparable to older version! .....

...........................................................

 

Number of OpenCL devices found: 2

OpenCL Device # 0 = Radeon HD 4870

Device 0 is an: GPU with max. 750 MHz and 4 units/cores

Now computing - please be patient....

time used: 4.065 seconds

 

OpenCL Device # 1 = Intel® Xeon® CPU 5160 @ 3.00GHz

Device 1 is an: CPU with max. 3000 MHz and 4 units/cores

Now computing - please be patient....

time used: 6.079 seconds

 

Now checking if results are valid - please be patient....

:thumbsup_anim: Validate test passed - GPU results=CPU results :bag:

logout

 

[Process completed]

tool-1/OpenCL2_Bench_V025; exit;
...........................................................
.................. OpenCL Bench V 0.25 by mitch ...........
...... C2D 3GHz = 12 sec vs Nvidia 9600GT = 0,93 sec ......
... time results are not comparable to older version! .....
...........................................................

Number of OpenCL devices found: 2
OpenCL Device # 0 = GeForce GTX 260
Device 0 is an: GPU with max. 1242 MHz and 192 units/cores 
Now computing - please be patient....
time used:  0.357 seconds

OpenCL Device # 1 = Intel(R) Core(TM)2 Duo CPU	 E8400  @ 3.00GHz
Device 1 is an: CPU with max. 3600 MHz and 2 units/cores 
Now computing - please be patient....
time used: 10.433 seconds

Now checking if results are valid - please be patient....
:) Validate test passed - GPU results=CPU results :) 
logout

Hello mitch, can you make available the source code? Thanks!! :P

 

 

My results:

Number of OpenCL devices found: 3

OpenCL Device # 0 = GeForce 9600M GT

Device 0 is an: GPU with max. 1250 MHz and 32 units/cores

Now computing - please be patient....

time used: 2.798 seconds

 

OpenCL Device # 1 = GeForce 9400M

Device 1 is an: GPU with max. 1100 MHz and 16 units/cores

Now computing - please be patient....

time used: 9.549 seconds

 

OpenCL Device # 2 = Intel® Core2 Duo CPU P8600 @ 2.40GHz

Device 2 is an: CPU with max. 2400 MHz and 2 units/cores

Now computing - please be patient....

time used: 15.800 seconds

 

Now checking if results are valid - please be patient....

:) Validate test passed - GPU results=CPU results <_<

×
×
  • Create New...