Jump to content

OpenCL Benchmark - CPU vs GPU / DO NOT USE ANYMORE !


  • Please log in to reply
100 replies to this topic

#1
mitch_de

mitch_de

    InsanelyMacaholic

  • Retired
  • 2,885 posts
  • Gender:Male
  • Location:Stuttgart / Germany
:welcomeani:
The DL link is at the end of that posting.
- Mac OS X 10.6 - Snow Leopard ONLY ! (will not run in 10.5 / 10.4....)


VERY OLD THREAD !
Please use now the main OpenCL Thread
New MAIN openCL Thread

Attached Files



#2
STLVNUB

STLVNUB

    InsanelyMac Legend

  • Coders
  • 1,137 posts
  • Gender:Male
Here mine mitch...
[codebox]Last login: Tue Aug 25 19:34:34 on console
...........................................................
...................OpenCL Bench V 0.1 by mitch.............
.......C2D 3GHz = 30 sec vs Nvidia 9600GT = 3.10 sec.......
....... .......
........My test code (simple adds) is cpu friedly..........
.more gpu friedly+complexer code (raytracing/video encod.).
....may give much more speed advantage - at least on C2Ds..
...........................................................
CL_DEVICE_NAME: Intel® Core™2 Duo CPU E8200 @ 2.66GHz
CL_DEVICE_VENDOR: Intel
Now computing - please be patient....
time used: 33.682335
Number of elements computed: 2097152
CL_DEVICE_NAME: GeForce 9800 GT
CL_DEVICE_VENDOR: NVIDIA
Now computing - please be patient....
time used: 2.639566
Number of elements computed: 2097152
Now checking if results are valid - please be patient....
:welcomeani: Validate results test passed - GPU=CPU ;)
[/codebox]

#3
mitch_de

mitch_de

    InsanelyMacaholic

  • Retired
  • 2,885 posts
  • Gender:Male
  • Location:Stuttgart / Germany
Thanks ! I hope all get SMILIES :) as the validate result!
PS: I cant test the dual GPU card bench - all cards should be benched. I hope some with 2 GPUs (like MacBookPro) didnt run in an error.

#4
macwanabe

macwanabe

    InsanelyMac Protégé

  • Members
  • PipPip
  • 77 posts
CL_DEVICE_NAME: Intel® Core™2 Quad CPU Q6600 @ 2.40GHz
CL_DEVICE_VENDOR: Intel
Now computing - please be patient....
time used: 15.900080
Number of elements computed: 2097152
CL_DEVICE_NAME: GeForce 8800 GT
CL_DEVICE_VENDOR: NVIDIA
Now computing - please be patient....
time used: 2.618529
Number of elements computed: 2097152
Now checking if results are valid - please be patient....
:thumbsup_anim: Validate results test passed - GPU=CPU :P

#5
Sherry Haibara

Sherry Haibara

    InsanelyMac Legend

  • Members
  • PipPipPipPipPipPipPip
  • 699 posts
Seems something isn't working here:

...........................................................
...................OpenCL Bench V 0.1 by mitch.............
.......C2D 3GHz = 30 sec vs Nvidia 9600GT = 3.10 sec.......
....... .......
........My test code (simple adds) is cpu friedly..........
.more gpu friedly+complexer code (raytracing/video encod.).
....may give much more speed advantage - at least on C2Ds..
...........................................................
CL_DEVICE_NAME: Intel Core™2 Duo CPU P8700 @ 2.53GHz
CL_DEVICE_VENDOR: Intel
Now computing - please be patient....
time used: 37.822647
Number of elements computed: 2097152
CL_DEVICE_NAME: GeForce 9400M
CL_DEVICE_VENDOR: NVIDIA
Now computing - please be patient....
time used: 12.428713
Number of elements computed: 2097152
Now checking if results are valid - please be patient....
:thumbsup_anim: Validate results test - results compute on gpu <> compute cpu

Sherry Haibara

EDIT: Second run:
...........................................................
...................OpenCL Bench V 0.1 by mitch.............
.......C2D 3GHz = 30 sec vs Nvidia 9600GT = 3.10 sec.......
....... .......
........My test code (simple adds) is cpu friedly..........
.more gpu friedly+complexer code (raytracing/video encod.).
....may give much more speed advantage - at least on C2Ds..
...........................................................
CL_DEVICE_NAME: Intel Core™2 Duo CPU P8700 @ 2.53GHz
CL_DEVICE_VENDOR: Intel
Now computing - please be patient....
time used: 37.613495
Number of elements computed: 2097152
CL_DEVICE_NAME: GeForce 9400M
CL_DEVICE_VENDOR: NVIDIA
Now computing - please be patient....
time used: 15.683911
Number of elements computed: 2097152
Now checking if results are valid - please be patient....
:P Validate results test passed - GPU=CPU :)


By the way, am I supposed to run this with no applications open?

#6
morfy

morfy

    InsanelyMac Legend

  • Members
  • PipPipPipPipPipPipPip
  • 890 posts
...........................................................
...................OpenCL Bench V 0.1 by mitch.............
.......C2D 3GHz = 30 sec vs Nvidia 9600GT = 3.10 sec.......
....... .......
........My test code (simple adds) is cpu friedly..........
.more gpu friedly+complexer code (raytracing/video encod.).
....may give much more speed advantage - at least on C2Ds..
...........................................................
CL_DEVICE_NAME: Pentium® Dual-Core CPU E5200 @ 2.50GHz (overclock 3.11ghz)
CL_DEVICE_VENDOR: Intel
Now computing - please be patient....
time used: 28.961924
Number of elements computed: 2097152
CL_DEVICE_NAME: GeForce 8800 GT
CL_DEVICE_VENDOR: NVIDIA
Now computing - please be patient....
time used: 2.580805
Number of elements computed: 2097152
Now checking if results are valid - please be patient....
:P Validate results test passed - GPU=CPU :)
logout

#7
cparm

cparm

    InsanelyMac Legend

  • Retired
  • 997 posts
  • Gender:Male
CL_DEVICE_NAME: Intel Core™2 Duo CPU E8500 @ 3.16GHz
CL_DEVICE_VENDOR: Intel
Now computing - please be patient....
time used: 28.509935
Number of elements computed: 2097152
CL_DEVICE_NAME: GeForce 8800 GT
CL_DEVICE_VENDOR: NVIDIA
Now computing - please be patient....
time used: 2.507916
Number of elements computed: 2097152
Now checking if results are valid - please be patient....
:) Validate results test passed - GPU=CPU ;)

i suppose that the bench stat are value in red , but what really mean valid results GPU=CPU , mitch can you explain ?

#8
johan

johan

    Nuke ATWT fan

  • Members
  • PipPipPipPipPipPip
  • 427 posts
  • Gender:Male
  • Location:The Netherlands
CL_DEVICE_NAME: Intel® Core™2 Quad CPU @ 2.40GHz
CL_DEVICE_VENDOR: Intel
Now computing - please be patient....
time used: 15.142966
Number of elements computed: 2097152
CL_DEVICE_NAME: GeForce 8800 GTX
CL_DEVICE_VENDOR: NVIDIA
Now computing - please be patient....
time used: 1.761477
Number of elements computed: 2097152
Now checking if results are valid - please be patient....
:rolleyes: Validate results test passed - GPU=CPU :wacko:

#9
uuid

uuid

    InsanelyMac Protégé

  • Members
  • Pip
  • 17 posts

i suppose that the bench stat are value in red , but what really mean valid results GPU=CPU , mitch can you explain ?



I guess that he does some benchmark computations in the gpu and in the cpu and then compares whether they gave the same result (as a number). It seems that in some cases, either because of lacking float precision or due to some flipped bit or whatnot, the results differ.

Also, another question to mitch: does this implementation of opencl use the cpu alongside the gpu? I thought I read somewhere that opencl was a rather generic abstraction platform where cpu cores are treated as just another computational unit. (That would mean that the gpu scores are a bit too fast to be real).

PS. Thanks for making the tool!!

#10
miketress

miketress

    InsanelyMac Protégé

  • Members
  • Pip
  • 22 posts
Number of OpenCL devices found: 2
OpenCL Device # 0 = GeForce 9600 GT
Device 0 is an: GPU with max. 1625 MHz and 64 units/cores
Now computing - please be patient....
time used: 0.753 seconds

OpenCL Device # 1 = Intel® Core™ i7 CPU 920 @ 2.67GHz
Device 1 is an: CPU with max. 3800 MHz and 8 units/cores
Now computing - please be patient....
time used: 3.137 seconds

EDIT: updated to v025

#11
mitch_de

mitch_de

    InsanelyMacaholic

  • Retired
  • 2,885 posts
  • Gender:Male
  • Location:Stuttgart / Germany
Updated to V015. Hope fixed output for > 1 GPU
same speed (sure a vary of 2-5% between runs are normal)

to Question1:
The validate of GPU=CPU says:
compared the results which GPU has computed with that what shoud be the result.
For example, 1+1 should be 2 , not 2,1 or 3 :)

to Q2:
Both beches are done by OpenCL - CPU and GPU.
I ony do an validate of the results by "Normal" cpu code.
Seems that OpenCL (running on CPU if no GPU there) does an good job !
i7920 runs really fast !!!
Maybe an real MacPro 2009 with 2 * XEON "i7" will be faster on CPU than GPU - at least with an GT120 (default gpu).

Hope we can see some ATI´s here ;)
And of course some Geforce GT285 !!!! :)

#12
cparm

cparm

    InsanelyMac Legend

  • Retired
  • 997 posts
  • Gender:Male

For example, 1+1 should be 2 , not 2,1 or 3 ;)

:hysterical:

thank you for that precision, I always thought that 1+1 was equal to 4 :D

edit:

last version work also

CL_DEVICE_NAME: Intel(R) Core(TM)2 Duo CPU	 E8500  @ 3.16GHz .....
CL_DEVICE_VENDOR: Intel
CL_DEVICE_MAX_CLOCK_FREQUENCY: 3166 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 2
Now computing - please be patient....
time used: 28.503862
Number of elements computed: 2097152

....CL_DEVICE_NAME: GeForce 8800 GT .....
CL_DEVICE_VENDOR: NVIDIA
CL_DEVICE_MAX_CLOCK_FREQUENCY: 1650 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 112
Now computing - please be patient....
time used: 2.525435
Number of elements computed: 2097152
Now checking if results are valid - please be patient....
:) Validate results test passed - GPU=CPU :)


#13
mitch_de

mitch_de

    InsanelyMacaholic

  • Retired
  • 2,885 posts
  • Gender:Male
  • Location:Stuttgart / Germany
"thank you for that precision, I always thought that 1+1 was equal to 4 "
Yes, but that happens only on Windows :(

#14
Ruben-P

Ruben-P

    InsanelyMac Protégé

  • Members
  • Pip
  • 18 posts
  • Gender:Male
....CL_DEVICE_NAME: Intel(R) Core(TM)2 CPU		  6600  @ 2.40GHz .....

CL_DEVICE_VENDOR: Intel

CL_DEVICE_MAX_CLOCK_FREQUENCY: 3096 MHz

CL_DEVICE_MAX_COMPUTE_UNITS: 2

Now computing - please be patient....

time used: 29.940746

Number of elements computed: 2097152



....CL_DEVICE_NAME: GeForce 9800 GTX/9800 GTX+ .....

CL_DEVICE_VENDOR: NVIDIA

CL_DEVICE_MAX_CLOCK_FREQUENCY: 1836 MHz

CL_DEVICE_MAX_COMPUTE_UNITS: 128

Now computing - please be patient....

time used: 2.056581

Number of elements computed: 2097152

Now checking if results are valid - please be patient....

:) Validate results test passed - GPU=CPU :)


#15
blackosx

blackosx

    InsanelyMacaholic

  • Coders
  • 3,102 posts
  • Gender:Male
  • Location:UK
Hi mitch. Nice tool :P
[codebox]...........................................................
.................. OpenCL Bench V 0.15 by mitch ...........
...... C2D 3GHz = 30 sec vs Nvidia 9600GT = 3.10 sec ......
....... .......
........My test code (simple adds) is cpu friedly..........
.more gpu friedly+complexer code (raytracing/video encod.).
... may give much more speed advantage - at least on C2Ds .
...........................................................

....CL_DEVICE_NAME: Intel® Core™2 Duo CPU E7300 @ 2.66GHz .....
CL_DEVICE_VENDOR: Intel
CL_DEVICE_MAX_CLOCK_FREQUENCY: 2666 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 2
Now computing - please be patient....
time used: 39.562576
Number of elements computed: 2097152

....CL_DEVICE_NAME: GeForce 8800 GT .....
CL_DEVICE_VENDOR: NVIDIA
CL_DEVICE_MAX_CLOCK_FREQUENCY: 1650 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 112
Now computing - please be patient....
time used: 2.386418
Number of elements computed: 2097152
Now checking if results are valid - please be patient....
:) Validate results test passed - GPU=CPU :) [/codebox]

#16
catboy

catboy

    InsanelyMac Protégé

  • Members
  • Pip
  • 46 posts
  • Gender:Male
  • Location:Portugal
...........................................................
.................. OpenCL Bench V 0.15 by mitch ...........
...... C2D 3GHz = 30 sec vs Nvidia 9600GT = 3.10 sec ......
....... .......
........My test code (simple adds) is cpu friedly..........
.more gpu friedly+complexer code (raytracing/video encod.).
... may give much more speed advantage - at least on C2Ds .
...........................................................

....CL_DEVICE_NAME: Intel® Core™2 CPU 6600 @ 2.40GHz .....
CL_DEVICE_VENDOR: Intel
CL_DEVICE_MAX_CLOCK_FREQUENCY: 2400 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 2
Now computing - please be patient....
time used: 38.881557
Number of elements computed: 2097152

....CL_DEVICE_NAME: GeForce 9800 GT .....
CL_DEVICE_VENDOR: NVIDIA
CL_DEVICE_MAX_CLOCK_FREQUENCY: 1715 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 112
Now computing - please be patient....
time used: 2.566827
Number of elements computed: 2097152
Now checking if results are valid - please be patient....
:) Validate results test passed - GPU=CPU :)

#17
Beerkex'd

Beerkex'd

    Content Provider

  • Members
  • PipPipPipPipPipPipPipPipPipPipPip
  • 3,000 posts
  • Gender:Male
  • Location:Belo Horizonte - Brazil
Weird, V015 doesn't work here, this is the only output I get:

dyld: unknown required load command 0x80000022
Trace/BPT trap

10.5.8 vanilla, Core 2 Duo E8500, 9800GTX+ with latest drivers from Nvidia, NVEnabler.kext.

/Edit

Doh!

Failed the Snow Leopard test!!

#18
proengin

proengin

    InsanelyMac Protégé

  • Members
  • Pip
  • 13 posts
Here is my "updated" score from SL.

...........................................................
.................. OpenCL Bench V 0.15 by mitch ...........
...... C2D 3GHz = 30 sec vs Nvidia 9600GT = 3.10 sec ......
....... .......
........My test code (simple adds) is cpu friedly..........
.more gpu friedly+complexer code (raytracing/video encod.).
... may give much more speed advantage - at least on C2Ds .
...........................................................

....CL_DEVICE_NAME: Intel® Core™ i7 CPU 920 @ 2.67GHz .....
CL_DEVICE_VENDOR: Intel
CL_DEVICE_MAX_CLOCK_FREQUENCY: 4280 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 8
Now computing - please be patient....
time used: 3.834852
Number of elements computed: 2097152

....CL_DEVICE_NAME: GeForce GTX 285 .....
CL_DEVICE_VENDOR: NVIDIA
CL_DEVICE_MAX_CLOCK_FREQUENCY: 1584 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 240
Now computing - please be patient....
time used: 0.861248
Number of elements computed: 2097152
Now checking if results are valid - please be patient....
;) Validate results test passed - GPU=CPU :D

This program seems to multi-thread very well according to SL's CPU Usage monitor.

#19
cparm

cparm

    InsanelyMac Legend

  • Retired
  • 997 posts
  • Gender:Male

Weird, V015 doesn't work here, this is the only output I get:

dyld: unknown required load command 0x80000022
Trace/BPT trap

10.5.8 vanilla, Core 2 Duo E8500, 9800GTX+ with latest drivers from Nvidia, NVEnabler.kext.


this tool is for 10.6 only

#20
miketress

miketress

    InsanelyMac Protégé

  • Members
  • Pip
  • 22 posts
Mitch,

I'm running a 9600GT 512Mb (like you) instead of the 8800GTx you wrote in your post.





0 user(s) are reading this topic

0 members, 0 guests, 0 anonymous users

© 2014 InsanelyMac  |   News  |   Forum  |   Downloads  |   OSx86 Wiki  |   Mac Netbook  |   PHP hosting by CatN  |   Designed by Ed Gain  |   Logo by irfan  |   Privacy Policy