Jump to content

OpenCL Oceanwave Bench and (new) CompuBench CL


mitch_de
 Share

367 posts in this topic

Recommended Posts

looks OK!

Error only, if result pic shows no waves or no sun or wont run at all.

 

As all benches: Your fps result (compared to others) may get different ranking compared to ranking in galaxies or luxmark results.

OpenCL Code + usage of OpenGL (parallel) + usage of CPU give that wide range of different rankings of different OpenCL benches using different gpus+cpus.

Link to comment
Share on other sites

  • 2 weeks later...

OK, not exactly sure what I did here..Although I still have some cosmetic overclock settings entered in Clover's config.plist, I put all the BIOS overclock settings to bare stock except XMP for RAM...And just ran the best series of benchmarks I've ever done, both GPU ones and CPU ones. I'll post a few others in their threads.

OpenCL OceanWave Benchmark V1.2.jpg

Wow! I've never seen it say PS=xxxx before, but have never seen that big of a number either.

Edited by eep357
invalid benchmark, inflated by Clover config error
Link to comment
Share on other sites

It does not work anymore for me, the system is in the sig. The app opens, i click benchmark and then the submission form shows up, no benchmark, no sun ,no sea and no score. only the submission window.

 

Any clue as to why this happens ?

 

PS: i get this in log :

 

 

 

 

 

 

OpenCL OceanWave:bash: line 1: 310 Segmentation fault: 11 ./OpenCL_OceanWave

OpenCL OceanWave:bash: line 1: 313 Segmentation fault: 11 ./OpenCL_OceanWave

OpenCL OceanWave:bash: line 1: 319 Segmentation fault: 11 ./OpenCL_OceanWave

OpenCL OceanWave:bash: line 1: 324 Segmentation fault: 11 ./OpenCL_OceanWave

 

Link to comment
Share on other sites

It does not work anymore for me, the system is in the sig. The app opens, i click benchmark and then the submission form shows up, no benchmark, no sun ,no sea and no score. only the submission window.

 

Any clue as to why this happens ?

 

PS: i get this in log : seg fault 11

 

Never seen this before.

First try to download again (and unzip).

OpenCL_Oceanwave (the code called in the bash) is in 64 Bit only - maybe your system doenst run 64 Bit?

Link to comment
Share on other sites

Never seen this before.

First try to download again (and unzip).

OpenCL_Oceanwave (the code called in the bash) is in 64 Bit only - maybe your system doenst run 64 Bit?

 

 

"64-bit Kernel and Extensions: Yes"

 

i'm on OS X 10.7.4 , could it be that ?
Link to comment
Share on other sites

Acer 8930G core2duo / GT 9600m 512

 

 

Caractéristiques de l'objet Point d'origine: Guangdong China (Mainland)

 

Description

 

Carte vidéo GeForce 9600M GS 512MB, DDR3 graphiques II originaux des tous neufs de Nvidia MXM

 

Dispositifs :

 

Fabricant NVIDIA Série GeForce 9M Nom de code NB9P Canalisations 32 - unifié Vitesse de noyau * 430 mégahertz Vitesse de Shader * 1075 mégahertz Vitesse de mémoire * 800 mégahertz Largeur d'autobus de mémoire Bit 128 Type de mémoire GDDR2 Quantité maximale de mémoire Mb 256 Mémoire partagée non

DirectX DirectX 10, Shader 4.0 Transistors 314 millions technologie 65 nanomètre Dispositifs PCI-E 2.0, 400 mégahertz de Speichertakt d'und du bei GDDR2 bei GDDR3 de 800 mégahertz Taille de cahier moyen Date d'annonce 03.06.2008

 

 

 

 

 

 

957144Capturede769cran20130129a768234656.png

Link to comment
Share on other sites

 

i'm on OS X 10.7.4 , could it be that ?

 

Normally not, even i build it with 10.8 SDK.

Try OpenCL_Oceanwave ( the OpenCL code part, put unzipped into OpenCL_Oceanwave/Content/Resources) which was build using SDK 10.7

 

For me, 10.8.3 doenst matter in speed / working same using the 10.7 SDK build.

OpenCL_OceanWave.zip

Link to comment
Share on other sites

Yep, but small FPS diffs can happen between different CPu /Bus / PCIe speeds and different clocked (GPU/VRAM) speeds to.

Even on same system speed diffs between two runs can happen - in area of 400 FPS at least 10-20 fps diff between runs is normal.

 

Perhaps the OpenCLbandwidth test may help to find out such speed diffs (CPUGPU data transferspeed, GPUGPU(=devicetodevice) speeds)

 

Here are my results for 9600 GT:

Last value, device to device shows VRAM acess speed, limited by VRAM type + VRAM clock and VRAM bandwidth (how many Bits, like 64 bit for low end gpus like GT210/GT220, 128 bits, 192 bits or 256 Bits)

The first two values are CPUPCIGPU transferspeeds and limited by PCIe speed, CPU type and GPU type+clock

 

 

 

Developer/GPU Computing/OpenCL/bin/darwin/release/oclBandwidthTest Starting...

Running on...

GeForce 9600 GT

Quick Mode

Host to Device Bandwidth, 1 Device(s), Paged memory, direct access

Transfer Size (Bytes) Bandwidth(MB/s)

33554432 2200.0

 

Device to Host Bandwidth, 1 Device(s), Paged memory, direct access

Transfer Size (Bytes) Bandwidth(MB/s)

33554432 1784.8

 

Device to Device Bandwidth, 1 Device(s)

Transfer Size (Bytes) Bandwidth(MB/s)

33554432 27501.7

 

[oclBandwidthTest] test results...

PASSED > exiting in 3 seconds: 3...2...1...done! logout

oclBandwidthTest.zip

  • Like 1
Link to comment
Share on other sites

Very much true that there will always be slight differences when having same gpu with different cpu/bus/pci speeds etc but looking at his system with an i5 paired with the 7870, I'd only assume that my cpu overclock has to do with the higher fps from the gpu (just an assumption I'll test with stock clock settings to verify).

 

And thank you mitch_de the test proves quite interesting to see those differences in bandwidth. The last one involving the vram speed was the most interesting for me to see as I've never seen the actual speeds in any tests before.

 

I might pick at the other guy to run the test as well so I could see how the numbers look on his end.

 

Running on...
ATI Radeon Barts XT Prototype
Quick Mode
Host to Device Bandwidth, 1 Device(s), Paged memory, direct access
Transfer Size (Bytes) Bandwidth(MB/s)
33554432 4531.3
Device to Host Bandwidth, 1 Device(s), Paged memory, direct access
Transfer Size (Bytes) Bandwidth(MB/s)
33554432 5770.6
Device to Device Bandwidth, 1 Device(s)
Transfer Size (Bytes) Bandwidth(MB/s)
33554432 102000.4
[oclBandwidthTest] test results...
PASSED
> exiting in 3 seconds: 3...2...1...done!

Link to comment
Share on other sites

Mh, here are my results:

 

/Users/alex/Downloads/oclBandwidthTest Starting...

Running on...

AMD Radeon HD Pitcairn XT Prototype Compute Engine

Quick Mode

Host to Device Bandwidth, 1 Device(s), Paged memory, direct access

Transfer Size (Bytes) Bandwidth(MB/s)

33554432 9859.1

Device to Host Bandwidth, 1 Device(s), Paged memory, direct access

Transfer Size (Bytes) Bandwidth(MB/s)

33554432 11576.8

Device to Device Bandwidth, 1 Device(s)

Transfer Size (Bytes) Bandwidth(MB/s)

33554432 84047.9

[oclBandwidthTest] test results...

PASSED

> exiting in 3 seconds: 3...2...1...done!

  • Like 1
Link to comment
Share on other sites

Talk about quick to post, thank you k3nny :)

 

As I somehow assumed, it's definitely only the gpu not so much the OC because your first 2 test scores no doubt beat mine since the cpu is in the party.

 

Both our cards share the same 256 bit interface so the difference lies within the cards independent of the other system components.

Link to comment
Share on other sites

Yep, interesting is the big speed diff for transferspeed CPU>PCIe bus> GPU (VRAM) and GPU (VRAM) > PCIe bus> CPU.

Twice as fast for the one system compared to other. 4500 vs 9800 MB/sand 5700 vs 11500 MB/s !

GPU VRAM >GPU VRAM transferspeed diff isnt that much 84000 vs 102000.

Because OpenCL Apps must always (more or less) submit data (for computing) to gpu and receive result data that transferspeed (CPUPCieGPU) can

make the difference even gpu+VRAM speed has less differences.

Thats why real (science) gpu computing tries to minimise transfered data or like PCie 3.0 (+ MB Chipset support + GPU chipset support) bus systems.

(Old AGP or PCI bus would be a no go for today gpu computing : only 1/4 - 1/10 of your transferspeeds over PCie)

  • Like 1
Link to comment
Share on other sites

DOes anyone know what the Hex edits are for OSX 10.8.2 in:

 

/System/Library/Extensions/

GeForceGLDriver.bundle/Contents/MacOS/GeForceGLDriver

 

and

 

/System/Library/Extensions/

GeForceGLDriver.bundle/Contents/MacOS/libclh.dylib

Link to comment
Share on other sites

*First Kepler score*

 

892.2 FPS

 

Brand new Mountain Lion 10.8.2 install

Nvidia 304.00.05f02 driver, CUDA 5.0.37

OpenCL unpatched

No AGPM edits

GraphicsEnabler=n - no injection!

MacPro3,1 system definition

EVGA Geforce GTX 660 2GB (vanilla model)

Core 2 Duo E8500 @ 3.16GHz

Asus P5Q-E (P45 Express/ICH10R) 4GB RAM

 

Screen Shot 2013-02-04 at 1.07.30 AM.png

 

Running on...

GeForce GTX 660

Quick Mode

Host to Device Bandwidth, 1 Device(s), Paged memory, direct access
Transfer Size (Bytes) Bandwidth(MB/s)
33554432 4227.2

Device to Host Bandwidth, 1 Device(s), Paged memory, direct access
Transfer Size (Bytes) Bandwidth(MB/s)
33554432 5542.3

Device to Device Bandwidth, 1 Device(s)
Transfer Size (Bytes) Bandwidth(MB/s)
33554432 81757.6

[oclBandwidthTest] test results...
PASSED

  • Like 1
Link to comment
Share on other sites

 Share

×
×
  • Create New...