Jump to content

All OpenCL Benches: RAYTRACING/Galaxies/Grass/qJulia/Displacement...


  • Please log in to reply
124 replies to this topic

#41
cwestpha

cwestpha

    InsanelyMac Geek

  • Members
  • PipPipPip
  • 161 posts
  • Gender:Male
  • Location:USA
Hmm on my 2008 Mac Pro 2.8 Ghz 8-core with 285 GTX running the 32K galaxies V2 under 10.6.1:
CPU: 11
multi core: 88
GPU: 329
hybrid: 123

#42
n00b32

n00b32

    InsanelyMac Protégé

  • Members
  • Pip
  • 10 posts
Hi,

how come CUDA on Mac OS X? Where did you get these drivers from?

All my results are with cuda drivers.



#43
shoarthing

shoarthing

    InsanelyMac Legend

  • Members
  • PipPipPipPipPipPipPip
  • 833 posts
  • Location:Blighty

Hi,

how come CUDA on Mac OS X? Where did you get these drivers from?

. . . Nvidia downloads

NB: v2.3x [the current one] 32-bit only AFAIK . . to get an idea of where this is at see the relevant section of the NV forums

#44
n00b32

n00b32

    InsanelyMac Protégé

  • Members
  • Pip
  • 10 posts
thnx, didn't know that

right now only the relevant app's aren't there, yet ;-)

#45
MarceloDub

MarceloDub

    InsanelyMac Protégé

  • Members
  • Pip
  • 5 posts
good

#46
byronrock

byronrock

    InsanelyMac Protégé

  • Just Joined
  • Pip
  • 4 posts
  • Gender:Male
  • Location:El Salvador!
Is there a way to run the other benchmark in a hackintosh?
I just can run Galaxy, (by the way i get 30G with my Athlon x4 720)

I want to run displacement but says "bad cpu type in executable" logout
I have a 9400gt

am i doing something wrong??

#47
@ROBASEFR

@ROBASEFR

    InsanelyMac Geek

  • Members
  • PipPipPipPipPip
  • 337 posts
  • Gender:Male
  • Location:France
Hi

I've got success with ATI HD4850 Gainward GS 512 under OSX 10.6.2 in my HAckintosh

all test with 1920x1080x32x60hz LCD HD monitor
Displacement:43 fps.
Galaxies 32K V2.0 and 8k V2.0 did work !

when i toggle with the S key i get:
2;13;47;69;52 Gflops

OpenCL Bench V 0.20 by mitch

....CL_DEVICE_NAME: Intel Core™ i7 CPU 920 @ 2.67GHz .....
CL_DEVICE_VENDOR: Intel
CL_DEVICE_MAX_CLOCK_FREQUENCY: 3096 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 8
Now computing - please be patient....
time used: 9.933120
Number of elements computed: 2097152
....CL_DEVICE_NAME: Radeon HD 4870 .....
CL_DEVICE_VENDOR: AMD
CL_DEVICE_MAX_CLOCK_FREQUENCY: 750 MHz
CL_DEVICE_MAX_COMPUTE_UNITS: 10
Now computing - please be patient....
time used: 16.656227
Number of elements computed: 2097152
Now checking if results are valid - please be patient....
;) Validate results test passed - GPU=CPU :P
logout

And: Transpose bandwith test
Tests/Open\ CL/OpenCL\ Tranpose\ Bandwidhttest/transpose
Performing Matrix Transpose [256 x 4096]...
Bandwidth Achieved = 2.755923 GB/sec
Results Validated!

:unsure:

#48
dudelolchris

dudelolchris

    InsanelyMac Protégé

  • Just Joined
  • Pip
  • 2 posts
All the OpenCL demos crash on my brand new Late 2009 iMac with the ATi 4670 graphics.
This makes me sad.

#49
computergek80

computergek80

    InsanelyMac Protégé

  • Members
  • Pip
  • 3 posts
They crash for me too, 27" iMac Radeon HD 4670. ANyone know whats up?

#50
mitch_de

mitch_de

    InsanelyMacaholic

  • Retired
  • 2,896 posts
  • Gender:Male
  • Location:Stuttgart / Germany
Be pattient.
Apple will for sure fix that problems with OpenCL until spring 2010.
Even after 3+ months of 10.6 there is NO Application out which needs/uses OpenCL.
Also Apple didnt use OpenCL in any of its own Apps (sure, it would be an failture if they had did that).
Upcoming (Spring 2010++) newer versions of iTunes , iMovie, iDVD, FCP, Logic,.... will have OpenCl speedups!

So all problems didnt hurt really, if only demos+benches wil not work on your gpu.

I will update the benches soon with newer versions (updated Apple OpenCL demos).

#51
J the Ninja

J the Ninja

    InsanelyMac Protégé

  • Just Joined
  • Pip
  • 1 posts
Hey, everybody. I wanted to post this link in here, it's to an iTunes Visualizer that makes an OpenCL fluid simulation dance to your music:

http://www.mutantquartz.com/?p=40


(No, I did not make it, that is not my blog. I just found it when the author posted it on the MacRumors forums).

Summary of the important-for-benchmarking controls for those who don't feel like referring to the link constantly:

F - Toggle framerate counter
X - Toggle hi-res mode (requires reset to apply)
R - Reset simulation
H - Toggle hight-mapping mode


Oh, and it has a memory leak. Not that big of a deal, you just need to restart iTunes to clear it, but be forewarned. Also, it didn't used to work on ATI but that may have changed in 10.6.2, I don't have an ATI card to test it with.

Some numbers from my MBP's 256MB 9600M GT on 10.6.2 (all of these are with music sensitivity, speed, and color set at the defaults):

Normal mode: ~18fps
Hight-map: ~18fps
High-res: Slideshow (didn't even try high-map)

#52
dudelolchris

dudelolchris

    InsanelyMac Protégé

  • Just Joined
  • Pip
  • 2 posts
On ati radeon HD 4670: ~ 9 fps in 2D and heightmap. high-res is awful. CPU use is oddly high, ~177%.

#53
animeboy

animeboy

    InsanelyMac Protégé

  • Members
  • Pip
  • 1 posts
Hi,

I just got a new upgrade for my Early 2008 Mac Pro, the EVGA GTX285 card.
2 x 2.8 GHz, 24 Gig Ram. 10.6.2
Using the 32k Galaxies, I'm getting 392 Gigaflops with the GPU.

Average about 90-98fps with Grass.

#54
nvidia2008

nvidia2008

    InsanelyMac Protégé

  • Just Joined
  • Pip
  • 2 posts

On ati radeon HD 4670: ~ 9 fps in 2D and heightmap. high-res is awful. CPU use is oddly high, ~177%.


Macbook Pro 2.4GHZ 2GB RAM, 9600M GT 256MB

Earfluid 0.1
About 25 fps in 2D and heightmap. High-res kills it to less than 10 fps. CPU use around just less than 50%

Galaxies 32K V2
About 50 Gigaflops

Macbook (Aluminium) 2GHZ 4GB RAM, 9400M

Earfluid 0.1
About 11 fps in 2D and heightmap. High-res kills it to less than 5 fps. CPU use around just less than 40%
Not as bad as I thought. This 9400M ain't that shabby compared to the 9600M GT. Definitely better than anything Intel for years.

.................
.................

Hey, everybody. I wanted to post this link in here, it's to an iTunes Visualizer that makes an OpenCL fluid simulation dance to your music:

http://www.mutantquartz.com/?p=40


(No, I did not make it, that is not my blog. I just found it when the author posted it on the MacRumors forums).

Summary of the important-for-benchmarking controls for those who don't feel like referring to the link constantly:

F - Toggle framerate counter
X - Toggle hi-res mode (requires reset to apply)
R - Reset simulation
H - Toggle hight-mapping mode


Oh, and it has a memory leak. Not that big of a deal, you just need to restart iTunes to clear it, but be forewarned. Also, it didn't used to work on ATI but that may have changed in 10.6.2, I don't have an ATI card to test it with.

Some numbers from my MBP's 256MB 9600M GT on 10.6.2 (all of these are with music sensitivity, speed, and color set at the defaults):

Normal mode: ~18fps
Hight-map: ~18fps
High-res: Slideshow (didn't even try high-map)


Nice find ;) ...Not sure why your fps is lower than mine though.

#55
rushko

rushko

    InsanelyMac Protégé

  • Members
  • Pip
  • 9 posts
Hi,

Any news here?

My results for Galaxies 32K

nVidia 9600GSO - 174 Gflops

Core i5 - 40 Gflops

#56
mitch_de

mitch_de

    InsanelyMacaholic

  • Retired
  • 2,896 posts
  • Gender:Male
  • Location:Stuttgart / Germany
YEP !
Galaxies 32K , NV 8800GTX : 197 Gigaflops.

3"]NEW OpenCL Raytraycing Benchmark (updated 1. Posting)[/size]

smallluxGPU .
Does raytraycing by GPU, GPU+CPU or CPU only
Very complex (real life) computing, so less advantage for weak GPU than running more low level OpenCL Demos.
Does much better hybrid (CPU+GPU) than Galaxies = even an NV9400M make sense and supports the CPU by 15-20% speedgain!
Uses ALL openCL GPUs (up to 4) which it find.
Also works with ATI 48XX GPUs .

MORE in the 1. posting!

#57
blackosx

blackosx

    InsanelyMacaholic

  • Coders
  • 3,139 posts
  • Gender:Male
  • Location:UK
Hi mitch_de

Well done with keeping up to date with the latest benchmarks :(

Specs:
Video: 8800GT 512MB OC
CPU: E7300 @ 2.66Ghz

Results for the new OpenCL Raytraycing benchmark:
GPU only: ./smallluxGPU 0 0 1 64 = Avg. rays/sec 2755K
CPU + GPU: ./smallluxGPU 2 0 1 64 = Avg. rays/sec 3160K
CPU only: ./smallluxGPU 2 0 0 64 = Avg. rays/sec 540K
Note: The CPU only test started at 707K but settled to 540K after 36seconds.

#58
mitch_de

mitch_de

    InsanelyMacaholic

  • Retired
  • 2,896 posts
  • Gender:Male
  • Location:Stuttgart / Germany
Ist normal that the ray/sec needs some time to stay stable (less changes in rays/sek).
Users may change the workgroupsize from 64 to 128, 256 or 32 . Workgroupsize is an OpenCL parameter which depends on the GPU. Larger workgroupsize may speedup the GPU only part on faster / modern GPUs .
But mostly an bigger workgroupsize will NOT change the GPU only speed significant - at least not on GPUs like mine (8800 GT) - but an GTX 260/ GTX 275/ GTX 285 may perform better with 128 or 256.
To big workgroupsize may slow down or even crash the OpenCL Programm.
Example:
./smallluxGPU 0 0 1 64
./smallluxGPU 0 0 1 256 (workgroupsize 64 > 256)


Also , if you want to "see whats done" you can switch into the interactive mode :
./smallluxGPU 0 0 0 1 0 640 480 scenes/luxball.scn (GPUs only, workgroupsize=0=default)
./smallluxGPU 0 2 0 1 0 640 480 scenes/luxball.scn (CPU 2 Threads + GPUs , workgroupsize=0=default)
Interactive mode mosly a bit slower rays/sek than the benchmark mode, because the OpenCL App must do all the screen output to !

#59
dodusman

dodusman

    InsanelyMac Protégé

  • Members
  • Pip
  • 48 posts
Hello

snow lepoard 10.6.2 64 bits nvidia 250 gts
I have this message when i try smallluxGPU :

<low latency mode enabled (0 or 1)> <native thread count> <use CPU device (0 or 1)> <use GPU device (0 or 1)> <GPU workgroup size (0=default value or anything > 0)> <window width> <window height> <halt time in secs> <scene file>
Reading scene: scenes/simple.scn
terminate called after throwing an instance of 'std::ios_base::failure'
  what():  basic_ios::clear
Abort trap


#60
mitch_de

mitch_de

    InsanelyMacaholic

  • Retired
  • 2,896 posts
  • Gender:Male
  • Location:Stuttgart / Germany
PLEASE READ THE "HOW TO RUN" file within the zip.
Its an command line / Terminal app - doesnt run by double click it !!!

you must start the terminal, change dir to the main folder of the command line app
you must add command line option to the app !!!

run benchmark GPU only (CTRL + C to abort)
./smallluxGPU 0 0 1 64
YOu may use also 128 or 256 tinsted of the 64 above to get more rays/sek - but that only make sense on very modern GPUS (GTX 275+)

run interactive GPU only
./smallluxGPU 0 0 0 1 0 640 480 scenes/luxball.scn





0 user(s) are reading this topic

0 members, 0 guests, 0 anonymous users

© 2014 InsanelyMac  |   News  |   Forum  |   Downloads  |   OSx86 Wiki  |   Mac Netbook  |   PHP hosting by CatN  |   Designed by Ed Gain  |   Logo by irfan  |   Privacy Policy