Jump to content

OpenCL Oceanwave Bench and (new) CompuBench CL


mitch_de
 Share

367 posts in this topic

Recommended Posts

HACKINTOSH MacPro3,1 Mac OS X 10.7.2 Q8200 @ 2.33GHz 2333 MHzGPU GeForce GTX 260 1350 MHz FPS= 164.96

Thanks - in that range an GTX 260 should work if AGPM works (clocking hih on gpu load) as i said by getting the very low 44 fps from an other GTX 260 user (see list / some post in the past).

 

If you ATI gpu is listed as ATI Prototype XT, please add in the mail the real gpu type :)

Link to comment
Share on other sites

HACKINTOSH

MacPro3,1

Mac OS X 10.7.2

Intel® Core2 Quad CPU Q8300 @ 2.50GHz 2500 MHzGPU

GeForce GTX 460 0 MHz

RESULT: FPS= 591.72

 

with same hardware, only changed smbios to MacPro5,1

 

HACKINTOSH

MacPro5,1

Mac OS X 10.7.2

Intel® Core2 Quad CPU Q8300 @ 2.50GHz 2500 MHzGPU

GeForce GTX 460 0 MHz

RESULT: FPS= 43.05

 

Képernyőfotó%202011-11-14%20-%2017.21.51.png

Link to comment
Share on other sites

THANKS !

99% sure: the big diff between both is because of AGPM diff of using different smbios mac type.

Maybe same problem for the GTX 260 user with only 44 fps (normal 140-170 fps) - gpu runs in idle (slow) mode, even on gpu loads.

OpenGL should run, in this case (MacPro5,1) also much slower (not only OpenCL).

Link to comment
Share on other sites

Thanks, i got a lot of new fps submits and added them into the fps list on first posting.

INFO: The MHz for the gpu comes from the OpenCL driver and may be 0 (gpu type not fully supported) or diff to your real (OCed) card. Depends on the gpu type if its real MHz or some "phantasia" value. I cant fix that.

 

INFO2: Some GPU type now are many times in the list, like my Geforce 9600 GT - low end but not bad in OpenCL.

Link to comment
Share on other sites

My results :)

 

iMac9,1 Mac OS X 10.7.2 Intel® Core2 CPU 6420 @ 2.13GHz 2800 MHzGPU GeForce GTX 550 Ti 0 MHz Mac OS X 10.7.2 RESULT: FPS= 223.59

 

is it good? Or can i do something to improve my fps? Thanks!!

  • Like 1
Link to comment
Share on other sites

Thanks for first GTX 550 Ti result. I think its good - you cant do anything.

Only when fast gpus getting only 44 fps (1/4 - 1/5 of normal speed), like some GTX 260 it may help to check AGPM / change smbios MacModell . But in this cases the user will have seen also very slow OpenGL speed before.

Link to comment
Share on other sites

Hello,

 

As you can see there is no way to select the GPU to use for this test. In Slot-1 I have the stock 8800GT and in Slot-2 I have a XFX flashed 5770 card.

In different places it sees one card and another in the other places.

 

MacPro3,1 Mac OS X 10.7 Intel® Xeon® CPU E5462 @ 2.80GHz 2800 MHz GPU 2: GeForce 8800 GT 1500 MHz Mac OS X 10.7 RESULT: FPS= 102.34

OpenGL Extensions Viewer 4.0 has no issues in changing rendering device.

 

post-176965-1321386698_thumb.jpg

Link to comment
Share on other sites

Yepp, GPU card selection isnst implemented in that version - maybe in V1.2 :)

 

Cool, not sure how hard that will be for you to code. Might you have any thoughts as to why some parts of OSX see the 8800GT and others see the 5770?

 

10.7.3 11D16

same hardware as before...

Now can you do a comparison to the MP3,1 value?

Link to comment
Share on other sites

INITIALISING: 1 OpenCL platform found!

 

[Platform 0]

Name: Apple

Vendor: Apple

Version: OpenCL 1.1 (Jul 25 2011 15:56:07)

Profile: FULL_PROFILE

Extensions: cl_APPLE_SetMemObjectDestructor cl_APPLE_ContextLoggingFunctions cl_APPLE_clut cl_APPLE_query_kernel_names cl_APPLE_gl_sharing cl_khr_gl_event

 

 

[OpenCL-only Context]

1 OpenCL device found!

 

[Device 0]

Name: Intel® Core i7 CPU 930 @ 2.80GHz

Vendor: Intel

Type: CPU

Device Version: OpenCL 1.1

Driver Version: 1.1

Compute Units: 8

Work Group Size: 1024

Clock: 2803 MHz

Global Memory (Total): 6144 MB

Global Memory (Host): 6144 MB

Global Memory (PCIe): 0 MB

Local Memory: 32 KB

Cache Size: 0.0625 KB

Cache Line Size: 8388608 Bytes

Available: Yes

Double-Precision: Yes

Extensions:

cl_APPLE_SetMemObjectDestructor

cl_APPLE_ContextLoggingFunctions

cl_APPLE_clut

cl_APPLE_query_kernel_names

cl_APPLE_gl_sharing

cl_khr_gl_event

cl_khr_fp64

cl_khr_global_int32_base_atomics

cl_khr_global_int32_extended_atomics

cl_khr_local_int32_base_atomics

cl_khr_local_int32_extended_atomics

cl_khr_byte_addressable_store

cl_khr_int64_base_atomics

cl_khr_int64_extended_atomics

cl_khr_3d_image_writes

cl_APPLE_fp64_basic_ops

cl_APPLE_fixed_alpha_channel_orders

cl_APPLE_biased_fixed_point_image_formats

 

[shared OpenCL+OpenGL Context]

1 OpenCL device found!

 

[Device 0]

Name: Intel® Core i7 CPU 930 @ 2.80GHz

Vendor: Intel

Type: CPU

Device Version: OpenCL 1.1

Driver Version: 1.1

Compute Units: 8

Work Group Size: 1024

Clock: 2803 MHz

Global Memory (Total): 6144 MB

Global Memory (Host): 6144 MB

Global Memory (PCIe): 0 MB

Local Memory: 32 KB

Cache Size: 0.0625 KB

Cache Line Size: 8388608 Bytes

Available: Yes

Double-Precision: Yes

Extensions:

cl_APPLE_SetMemObjectDestructor

cl_APPLE_ContextLoggingFunctions

cl_APPLE_clut

cl_APPLE_query_kernel_names

cl_APPLE_gl_sharing

cl_khr_gl_event

cl_khr_fp64

cl_khr_global_int32_base_atomics

cl_khr_global_int32_extended_atomics

cl_khr_local_int32_base_atomics

cl_khr_local_int32_extended_atomics

cl_khr_byte_addressable_store

cl_khr_int64_base_atomics

cl_khr_int64_extended_atomics

cl_khr_3d_image_writes

cl_APPLE_fp64_basic_ops

cl_APPLE_fixed_alpha_channel_orders

cl_APPLE_biased_fixed_point_image_formats

 

OpenCL OceanWave:-------------------------------------------------------------------------------

[CL_INVALID_DEVICE] : OpenCL Error : Failed to create context from share group! No CL device found!

 

Failed to initialize CL

 

 

On my gtx470 I get this error. My system runs smooth.

 

What by the way openCL gives us?

 

10.7.2

Link to comment
Share on other sites

First: OpenCL always detects at least one OpenCL device - the cpu.

The gpu maybe NOT detected if the gpu isnt supported in general (like ATI 3xxx gpu or Geforce 7xxx) OR the gpu is unknown by the Apple drivers and must be patched for OpenCL. OpenGL may work normal, even on unpatched drivers.

 

"Cool, not sure how hard that will be for you to code. Might you have any thoughts as to why some parts of OSX see the 8800GT and others see the 5770?"

OpenCL is an special thing in this case compared to OpenGL.

The OpenCL code must be edited for using not the standard (first in the OpenCL info list (log of the app).

Link to comment
Share on other sites

NEW Version V1.2

- error handling for non OpenCL gpus (cant do OpenCL at all) OR OpenCL errors by OpenCL driver failures (Fermi with unpatched driver). Users get an messagebox insted of unknown fps results

- updated results list

- no changes in the main OpenCL bench

Sorry, but no option to select gpu for OpenCL - very complex Apple code to add that feature.

Link to comment
Share on other sites

When running this first I got about 14fps..... then I modified the AppleGraphicsPowerManagement.kext, now I got 65fps :unsure: But I don't know is good enough, see below.

 

I can play MKV (high bitrate) better in Windows7 than I can in OSX. But that perhaps is that I installed codecs in W7, do I need some codecs in OSX also ????

 

 

HACKINTOSH  MacBookPro6,1 Mac OS X 10.7.2 Intel(R) Core(TM) i7 CPU		Q 720  @ 1.60GHz 1596 MHzGPU			  GeForce GT 230M	 1100 MHz   _FPS= 65.97

Link to comment
Share on other sites

Yep, decoding video depends on the codec softwarespeed. MKV is an container - codec can be everything, but most MP4 or h.264 :D

On windows some gpu drivers accererate videodecoding, on OS X only some do that and only if the software uses Apple video decoding hw accell framework. Normally doesnt matter (beside higher cpu loads) but may be a problem for slower cpus.

Link to comment
Share on other sites

Added LUXMARK (OpemCL Benchmark) info + dl Link + result link on first posting.

LUXMARK can bench ALL OpenCL gpus - if you have more than one (and both not lowend) , maybe interesting.

It has much higher CPU usage (beside OpenCL) / load so cpu speed does matter compared to OceanWave with very less cpu loads.

To get close to the luxmark HEROs you need 8 * GTX 580 for results around 70000! :)

Bildschirmfoto_2011_11_21_um_10.48.43.jpg

Link to comment
Share on other sites

HACKINTOSH MacPro5,1 Mac OS X 10.7.2 Intel® Core i7 CPU 920 @ 2.67GHz 3500 MHz GPU ATI Radeon Barts XT Prototype 930 MHz FPS= 386.16

(AMD Radeon HD 6870)

 

LuxMark:

 

post-413183-1321890506_thumb.png

 

8 * GTX 580? who needs that much power? o.o

Link to comment
Share on other sites

Added many submitted FPS results in the list on first posting.

THANKS.

If you submit very different FPS (like 25 and 80 fps) please comment what you have done (Mac modell in .plist, changes in AGPM of the gpu,...)

 

With that many results we can see that OceanWave produced valid results, like same GPU type much differnt CPU - same result. :(

HACKINTOSH MacPro3,1 Mac OS X 10.7.2 CPU Q9400 @ 3400 MHz GPU GeForce 9800 GT 1500 MHz FPS= 98.68

HACKINTOSH MacPro3,1 Mac OS X 10.7.2 CPU Q6600 @ 2997 MHz GPU GeForce 9800 GT 1500 MHz FPS= 98.68

If your result is much slower to similar gpu check if your gpu may run in slow mode (1/3 - 1/2 gpu of max. clock) even the gpu has lot of work. In such cases the OpemGL performance will be also much lower than expected. Has to do with AGPM settings for that gpu / mac modell type.

Link to comment
Share on other sites

  • 2 weeks later...

Motherboard: ASUS P8Z68-V PRO

BIOS: 1101 (All default settings. No Over-clocking, No XMP)

CPU: Intel Core i7 2600K 3.4 GHz

GPU: nVidia GeForce GTX 560 Ti (Gigabyte GV-N560OC-1GI 900 MHz 1024 MB)

RAM: 16 GB (4 x 4 GB) (G.Skill Sniper 1600 MHz)

OS: Mac OS X Lion 10.7.2

Boot: Chameleon 2.1 R1713

SMBios: Mac Mini Server (Mid 2011)

Kexts Installed: FakeSMC, ALC8xxHDA, HDAEnabler892, AppleHDA (10.6.2)

 

I'm not using any DSDT or similar. Vainilla system. AppleIntelCPUPowerManagement patched with SpeedStepper 1.2. Added my device ID in NVDAGF100Hal, patched GeForceGLDriver and libclh.dylib for OpenCL support. Edited AppleGraphicsPowerManagement with this:

 

  <key>Macmini5,3</key>
  <dict>
   <key>Vendor10deDevice1200</key>
   <dict>
 <key>BoostPState</key>
 <array>
  <integer>0</integer>
  <integer>1</integer>
  <integer>2</integer>
  <integer>3</integer>
 </array>
 <key>BoostTime</key>
 <array>
  <integer>3</integer>
  <integer>3</integer>
  <integer>3</integer>
  <integer>3</integer>
 </array>
 <key>Heuristic</key>
 <dict>
  <key>ID</key>
  <integer>0</integer>
  <key>IdleInterval</key>
  <integer>250</integer>
  <key>SensorOption</key>
  <integer>1</integer>
  <key>SensorSampleRate</key>
  <integer>4</integer>
  <key>TargetCount</key>
  <integer>5</integer>
  <key>Threshold_High</key>
  <array>
   <integer>60</integer>
   <integer>60</integer>
   <integer>60</integer>
   <integer>100</integer>
  </array>
  <key>Threshold_Low</key>
  <array>
   <integer>0</integer>
   <integer>75</integer>
   <integer>90</integer>
   <integer>98</integer>
  </array>
 </dict>
 <key>LogControl</key>
 <integer>1</integer>
 <key>control-id</key>
 <integer>17</integer>
   </dict>
   <key>LogControl</key>
   <integer>1</integer>
   <key>default-control-id</key>
   <integer>17</integer>
  </dict>

 

AGPM works fine. Stays in 2 most of the time when I'm using the computer. Goes 0 when I run this test. And I get this:

 

opcyhd.jpg

 

But according the first page I should get about 420 fps with this GPU and CPU. Am I doing something wrong? Thanks

Link to comment
Share on other sites

You can try to run the bench a few times. The faster the gpu the more little diff in the results.

Your result is OK for that card. Other, faster results may come from little more systembus/pcie speed and/or little faster cpu andor little faster VRAM speed/ GPU {censored}.

AGPM State 0 (when run this test) is perfect - runs as fastest gpu clock.

Link to comment
Share on other sites

 Share

×
×
  • Create New...