Jump to content

OpenCL Oceanwave Bench and (new) CompuBench CL


mitch_de
 Share

367 posts in this topic

Recommended Posts

Great!

Some AMD have problems (other than the only cosmetic glitch) , they get OpenCL init failed.

I mad an new OceanWave_ML Mountain Lion ONLY!!! version. Wasnt easy, because it was not only OpenCL 10.8 SDK recompile action, also OpenGL seems to be changed much (also GLUT). Therefor i must enable multisampling to avoid that glitch (flashing + wrong OpenGL window).

The results with multisampling enabled cant be compared with the older version - the slower the gpu the more fps diff between both.

My 9600 GT gone down from 104 to 89 fps using multisampling. Faster gpu may not have that much diff. Diff comes because gpu has a bit more to do - beside OpenCL it must also do the opengl display - and multisampling does take some more gpu time - at least for slow gpus.

 

OceanWave_ML Mountain Lion ONLY!!! - wont run 10.6 / 10.7

you may need to setup mail account first, if you want to submit results.

OpenCL_OceanWave_ML.zip

Link to comment
Share on other sites

Great to see that 32 Bit Version is working on your ATOM / NV 9400M :)

Do you see any big fps diff comparing 64 Bit vs 32 Bit verson using i7 / GTX 570 ?

 

from cmf:

OpenCL 10.8 DP1: opencl works ootb on fermi , but apple seems to have removed the sm_20 code and hard setting it to 1.2 or 1.3 won't work either , means: luxmark and all other programs that want to use more complex opencl stuff (like local memory atomics) will fail)

Link to comment
Share on other sites

  • 2 weeks later...

Great to see that 32 Bit Version is working on your ATOM / NV 9400M :)

Do you see any big fps diff comparing 64 Bit vs 32 Bit verson using i7 / GTX 570 ?

 

from cmf:

OpenCL 10.8 DP1: opencl works ootb on fermi , but apple seems to have removed the sm_20 code and hard setting it to 1.2 or 1.3 won't work either , means: luxmark and all other programs that want to use more complex opencl stuff (like local memory atomics) will fail)

 

Then this luxmark test on 10.8 DP1 with GTX470 (sm_20) is confusing.. Score is ~10 times higher than the test applied in lion using similar cards.

 

http://i.imm.io/hPfv.png

 

hPfv.png

Link to comment
Share on other sites

Nope !

Look at the rendered window result and compare with the 10.6 / 10.7 screenshoots!

10.8 DP1 and Luxmark 2.0 didnt work really (rendered preview wrong + unsualbe results) and need to be updated in the future. Its OpenCL code itself AND also QT (the QT SDK from Nokia )

Will take much time, because actual QT SDK doesnt support 10.8.

Link to comment
Share on other sites

Nope !

Look at the rendered window result and compare with the 10.6 / 10.7 screenshoots!

10.8 DP1 and Luxmark 2.0 didnt work really (rendered preview wrong + unsualbe results) and need to be updated in the future. Its OpenCL code itself AND also QT (the QT SDK from Nokia )

Will take much time, because actual QT SDK doesnt support 10.8.

 

Yes you are right.

It also did not perform the test on GTX 560 (sm_21) which seems to be ok on 10.8 DP1.

Link to comment
Share on other sites

  • 1 month later...
  • 2 weeks later...
  • 3 weeks later...

UPDATE to Mai 2012 Version

Updated Apple changes in my Benchmark,

"

Version 1.3, 2012-05-22

Fix warnings pertaining to use of deprecated APIs Fix a bug related to redundant call to glutSwapBuffer"

added fullscreen mode (look first post), DL first post

Link to comment
Share on other sites

  • 3 weeks later...

GA-P35-DS3P (rev 1.1)

Q6700 @2.3 GHz 4GB RAM

GeForce GTX460 @ 1430 MHz - Netkas OpenCL patch applied to GeForceGLDriver.bundle and NVDAGF100Hal.kext in both 10.7.4 and 10.8DP4

 

Lion (10.7.4)

Windowed: 215.2 fps post-240958-0-08825400-1340001779_thumb.png

Full Screen: 149.9 fps post-240958-0-64091800-1340001755_thumb.png

 

Mountain Lion (10.8DP4)

Windowed: 241.1 fps post-240958-0-39557600-1340001804_thumb.png

Full Screen: 164.4 fps post-240958-0-37911600-1340001793_thumb.png

  • Like 1
Link to comment
Share on other sites

  • 1 month later...
  • 2 weeks later...

moved over to 10.8...can't run the benchmark now...tried the opencl fix from before...but still errors... :hammer:

Edit: actually installing the bundle after patching helps :thumbsup_anim:

here's my results

INITIALISING: 1 OpenCL platform found!
[Platform 0]
Name:   Apple
Vendor:  Apple
Version:  OpenCL 1.2 (Jun 20 2012 14:18:19)
Profile:  FULL_PROFILE
Extensions:    cl_APPLE_SetMemObjectDestructor cl_APPLE_ContextLoggingFunctions cl_APPLE_clut cl_APPLE_query_kernel_names cl_APPLE_gl_sharing cl_khr_gl_event

[OpenCL-only Context]
2 OpenCL devices found!
[Device 0]
Name:    Intel(R) Core(TM)2 Duo CPU	 E6550  @ 2.33GHz
Vendor:   Intel
Type:    CPU
Device Version:  OpenCL 1.2
Driver Version:  1.1
Compute Units:   2
Work Group Size:  1024
Clock:    2327 MHz
Global Memory (Total):  8192 MB
Global Memory (Host):  8192 MB
Global Memory (PCIe):  0 MB
Local Memory:   32 KB
Cache Size:   0.0625 KB
Cache Line Size:  4194304 Bytes
Available:   Yes
Double-Precision:  Yes
Extensions:
   cl_APPLE_SetMemObjectDestructor
   cl_APPLE_ContextLoggingFunctions
   cl_APPLE_clut
   cl_APPLE_query_kernel_names
   cl_APPLE_gl_sharing
   cl_khr_gl_event
   cl_khr_fp64
   cl_khr_global_int32_base_atomics
   cl_khr_global_int32_extended_atomics
   cl_khr_local_int32_base_atomics
   cl_khr_local_int32_extended_atomics
   cl_khr_byte_addressable_store
   cl_khr_int64_base_atomics
   cl_khr_int64_extended_atomics
   cl_khr_3d_image_writes
   cl_APPLE_fp64_basic_ops
   cl_APPLE_fixed_alpha_channel_orders
   cl_APPLE_biased_fixed_point_image_formats
[Device 1]
Name:    GeForce GT 520
Vendor:   NVIDIA
Type:    GPU
Device Version:  OpenCL 1.1
Driver Version:  CLH 1.0
Compute Units:   1
Work Group Size:  1024
Clock:    1620 MHz
Global Memory:   1024 MB
Local Memory:   48 KB
Cache Size:   0 KB
Cache Line Size:  0 Bytes
Available:   Yes
Double-Precision:  No
Extensions:
   cl_APPLE_SetMemObjectDestructor
   cl_APPLE_ContextLoggingFunctions
   cl_APPLE_clut
   cl_APPLE_query_kernel_names
   cl_APPLE_gl_sharing
   cl_khr_gl_event
   cl_khr_byte_addressable_store
   cl_khr_global_int32_base_atomics
   cl_khr_global_int32_extended_atomics
   cl_khr_local_int32_base_atomics
   cl_khr_local_int32_extended_atomics
   cl_APPLE_fp64_basic_ops
[shared OpenCL+OpenGL Context]
2 OpenCL devices found!
[Device 0]
Name:    GeForce GT 520
Vendor:   NVIDIA
Type:    GPU
Device Version:  OpenCL 1.1
Driver Version:  CLH 1.0
Compute Units:   1
Work Group Size:  1024
Clock:    1620 MHz
Global Memory:   1024 MB
Local Memory:   48 KB
Cache Size:   0 KB
Cache Line Size:  0 Bytes
Available:   Yes
Double-Precision:  No
Extensions:
   cl_APPLE_SetMemObjectDestructor
   cl_APPLE_ContextLoggingFunctions
   cl_APPLE_clut
   cl_APPLE_query_kernel_names
   cl_APPLE_gl_sharing
   cl_khr_gl_event
   cl_khr_byte_addressable_store
   cl_khr_global_int32_base_atomics
   cl_khr_global_int32_extended_atomics
   cl_khr_local_int32_base_atomics
   cl_khr_local_int32_extended_atomics
   cl_APPLE_fp64_basic_ops
[Device 1]
Name:    Intel(R) Core(TM)2 Duo CPU	 E6550  @ 2.33GHz
Vendor:   Intel
Type:    CPU
Device Version:  OpenCL 1.2
Driver Version:  1.1
Compute Units:   2
Work Group Size:  1024
Clock:    2327 MHz
Global Memory (Total):  8192 MB
Global Memory (Host):  8192 MB
Global Memory (PCIe):  0 MB
Local Memory:   32 KB
Cache Size:   0.0625 KB
Cache Line Size:  4194304 Bytes
Available:   Yes
Double-Precision:  Yes
Extensions:
   cl_APPLE_SetMemObjectDestructor
   cl_APPLE_ContextLoggingFunctions
   cl_APPLE_clut
   cl_APPLE_query_kernel_names
   cl_APPLE_gl_sharing
   cl_khr_gl_event
   cl_khr_fp64
   cl_khr_global_int32_base_atomics
   cl_khr_global_int32_extended_atomics
   cl_khr_local_int32_base_atomics
   cl_khr_local_int32_extended_atomics
   cl_khr_byte_addressable_store
   cl_khr_int64_base_atomics
   cl_khr_int64_extended_atomics
   cl_khr_3d_image_writes
   cl_APPLE_fp64_basic_ops
   cl_APPLE_fixed_alpha_channel_orders
   cl_APPLE_biased_fixed_point_image_formats

Edited by robertx
Link to comment
Share on other sites

  • 1 month later...
 Share

×
×
  • Create New...