All OpenCL Benches: RAYTRACING/Galaxies/Grass/qJulia/Displacement..., UPDATED 20.03: smalllux 143: GUI fixes, slg 143, luxball standard |
Welcome to the OS X Snow Leopard (10.6) discussion.
Please note: This sub-forum is not for OSx86 questions! If you have an OSx86 Snow Leopard question, you should post your topic Here
- The InsanelyMac Staff.
All OpenCL Benches: RAYTRACING/Galaxies/Grass/qJulia/Displacement..., UPDATED 20.03: smalllux 143: GUI fixes, slg 143, luxball standard |
|
mitch_de
InsanelyMac Legend
|
![]() |
Aug 30 2009, 07:08 PM Post #1
|
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]()
|
All OpenCL GPU FPS Benches:
NEW OpenCL Raytraycing Benchmark (updated 1. Posting) smallluxGPU Does raytraycing by GPU, GPU+CPU or CPU only Very complex (real life) computing, so less advantage for weak GPU than running more low level OpenCL Demos. Does much better hybrid (CPU+GPU) than Galaxies. Uses ALL openCL GPUs (up to 4) parallel which it find. Also works with ATI 48XX GPUs . Update to V170 (always same link )[/b] http://www.macupdate.com/info.php/id/33632/smallluxgpu Major update with console tab (you can see informations the gui also shows but now even more + errors) Happy benching (times type 0 no changes, type 1 maybe little faster)
33632_scr.jpg ( 253.91K )
Number of downloads: 27************ Older stuff / mostly not much real world like smalllux ! Galaxies32K V2 + Galaxies 8K V2 + Grass + Displacement +AO (raytraycing CPU/GPU) + Transpose Bandwith [/size] Snow Leopard + Intel Macs ONLY ! ATI OpenCL GPUs (4850&4870) not really working! - i am in contact with ATI DEVs -problems with OpemCL Drivers/Framework - must+will be fixed with 10.6.1 or an ATI Driver Update HOT NEWS - always updated here - New Galaxies OpenCL Bench V2 build - Apple updated / fixed some OpenCL API usage (maybe help ATI) - little speed up (10% on my GT 9600) Now i build an 32K and an 8K Version - 32K use for fast/highend GPU/CPU and 8K for lowend CPU/GPUs. If GPU limits, there will be no difference in GPU Gigaflops. But on very fast GPUs 32K may give much higher GPU Gigaflops - more GPU load=less waste of OpenCL overhead time. DL Links on 1 post - OS X 10.6.1 updated ATI/Intel+Nvidia OpenGL drivers, but the OpenCL Framework stays same. So ATI will fail (i am sure) also with 10.6.1 running Apple OpenCL Demos (here listed). Apple does an rewrite of the GALAXIES OpenCL Demo/Bench - Nvidia GTX285 will rise from 280 GigaFlops to around 400 GigaFlops The CPU GigaFlops stays same around 28 GigaFlops C2D /100 GigaFlops MacPro 2009 . This new Version will compiled and shared here like the last one. Slower OpenCL GPUs, like mine 9600 GT should not expect such an big GigaFlop boost with that new Galaxies (N-Body) Apple Demo. OPENCL - Good to know : - OpenCL is an API for universal GPU(CPU) computing - main difference to CUDA / ATI STEAM is: both only working with their "own" gpu. an CUDA (NV) app like badaboom(h264 on GPU) cant work on an ATI gpu and vice versa - OpenCL is universal for different gpu vendors means: - Xcode / GCC compiles an code which includes the source (in C as an string) for the gpu programm that c source is , different to CUDA/ATI STEAM , is compiled later by OpenCL Framework at runtime ! So same App can run on complete different gpus and also , without/less codechange om CPU if no OpemCL gpu (newer ones) is found The source (example below) for the gpu programm will be really compiled at runtime, not only interpreted. So little differences between run of my bench may happen because of that compile on the run NEWS: form An Information from ATI OS X OpenCL divison dev team: Thank you for the quick response and I hope you extend the benchmark application since it’s a really good idea. Regarding the sample applications posted on the developer.apple.com website (eg. Galaxies, Qjulia, etc), we are aware that some fail (or even crash) on AMD hardware and working to track down all these issues. We suspect that most of these issues will be resolved for the next graphics driver update in Snow Leopard. BTW, I ran the demos on a iMac with a Radeon 4850 and I get the following results: OpenCL Benches: G A L A X I E S - an CPU vs GPU GigaFlop Bench UPDATE: Due to an hint from ATI, i increased the count of thing to compute from 4K to 16K New 16K version is available , shows 16K at the legend. Apple reduces that count for GPUs without discrete VRAM = 9400M / 8600M, so it is set to 4K, even legend shows 16K. You cant compare 9400M / 8600M 1:1 with the other, run 16K count Using of GALAXIES: Start Galaxy key s = switch compute Modes >CPU>Single/Multi, CPU-Vector/SSE Single/Multi>GPU> GPU+CPU> (bold=start Mode) key SPACE = Pause/go on key 6 = Reset Szene key Q = QUIT DOWNLOADs each 6 MB: NEW: http://rapidshare.com/files/286234291/Galaxies_32K_V2.zip NEW: http://rapidshare.com/files/286235157/Galaxies_8K_V2.zip mitch (C2D 3GHZ, NV 9600 GT , 1280x1024) V1 /V2 24 Gigaflops : CPU ( SIM: Vector Multi-Core CPU. Mode) 112 Gigaflops[/b] : NV 9600 GT 16K V2 142 Gigaflops : NV 9600 GT 32K[/b] Users results (new 32K V2 Version): CPU 18 G Nvidia 9600gt gpu 149 GigaFlops 32k v2.0 at 1680x1050 Gigabyte Ga-eg45m-ud2r, Intel e6750, Mac Pro Nehalem 8 core 2.93GHz:http://www.barefeats.com/index.html All tests 2500x1600! Nvidia GTX 285 ...... soon!! Mac Pro Nehalem 8 core CPU = ..... New OpenCL Transpose Bandwith - measures Bandwith of Matrix-Transpose DL Link at the end of posting (very small, run like all other terminal OpenCL Bench apps) mitch: Nvidia 9600GT: around 39 Gigabyte/Sec Mac Pro (1,1) 2.66Ghz 4GB RAM, 4870 1GB sapphire Performing Matrix Transpose [256 x 4096]... Bandwidth Achieved = 3.160816 GB/sec MacPro 2009, NVidia GTX 285 Mac Bandwidth Achieved = around 80 GB/sec UPDATE: New OpenCL AO Bench (512*512 insted of 256*256 barefeat = barefeat results / 2) DL Link at the end (very small, SSE4 optimized, 512*512 Window ) mitch NV 9600 GT : 8 FPS (512*512) C2D 3 GHZ : 0.8 FPS So ATI users may try the new compiled Procedural Geometric Displacement FPS Bench Download for ATI + Nvidia USERS: OpenCL_Displacement_Bench.zip - with step by step HOW TO RUN readme - Update: An new compiled Displacement (the app only) which was build with GCC 4.2 very less optimzed compiler settings seems to run on ATI 4870 more stable / reliable. If you have such problems with displacement, I attatched the small dl at the first post as displacement_ATi for overwrite+usage with the whole (normal) 7 MB dl. QJulia1024 Results (the qJulia with 1024x1024 window size) 9600 GT , around 13 FPS static - please let the bench first (wait a few seconds) show static FPS before you switch to animate (SPACE) 8-16 FPS when animating Rob GTX 285 Mac 1024x1024 = 44 fps eVGA GTX-285 1024MB 46.70fps 8800 GT 22.46 FPS qJulia Results (800x800 window) 9600GT : around 29 FPS static, 16 - 60 FPS when animating (key SPACE) eVGA GTX-285 1024MB 98.79 fps Rob GTX 285 Mac 800x800 = 93 fps 2.4GHz C2D, 4GB RAM, 8600M GT (256MB) qJulia (800x800): static shows 10 - 11 FPS animated shows 9 - 11 FPS MacBook Pro 13", GPU GeForce 9400M running at 6,25 fps (6-6,50) OpenCL Displacement FPS results (ATI should work !) mitch 9600 GT 80 FPS first (white background+shadow) / 102 FPS second (with texture in backround) ATI 4870 both around 90 FPS - but only 1/3 of start the bench was successfull - so also that bench didnt work 100% well - wait for OS X 10.6.1 Geforce GTX285 Mac : around 220 FPS, second shader test Radeon 4870 1GB (sapphire) Mac Pro (1,1) 2.66Ghz quad core, 4GB RAM both szenes near 90 FPS Most 4870 are near together between 84 and 90 FPS - but some test fail and some get bad result window graphics GRASS simulates an scene grass sticks moving in the wind Grass Results 4 Meg triangles + 170.000 Sticks to compute - big szene! ( 1024x1024 window size) 9600 GT , around 53 FPS 2.4GHz C2D, 4GB RAM, 8600M GT (256MB) 27 - 29 FPS 8800 GT 56.97 fps i920 Overclocked to 4.4Ghz, 1760Mhz DDR3, PCIE-100Mhz, eVGA GTX-285 1024MB 95.50 fps Rob ( barefeats! Test mule is Mac Pro Nehalem 2.93 Octo) GeForce GTX 285 Mac = 88 - 91 fps Quadro FX 4800 = 77 fps steady GeForce 8800 GT = 54 fps steady GeForce GT 120 = 35 fps steady DL for qJulia + Grass (has GUI) (at the end) Read the readme - you will ger an file not found error (loads the qjulia.cl OpemCL source, if you didnt changed terminal directory to the app folder before running the command line app. For all GLUT (Terminal Apps, Transpose+qJluia+AO) check the app preferences of SYNC is OFF (screenshoot OpenCL AO preferences)
Attached File(s)
OpenCL_Transpose__Bandwidh.zip ( 6.24K )
Number of downloads: 515
displacement_ATI.zip ( 22.07K )
Number of downloads: 414
opencl_aobench.zip ( 145.75K )
Number of downloads: 651
Grass.zip ( 429.8K )
Number of downloads: 868
qJulia1024.zip ( 12.68K )
Number of downloads: 498
OpenCL_Qjulia_GPU.zip ( 92.24K )
Number of downloads: 706
Bildschirmfoto_2009_09_01_um_09.02.58.jpg ( 102.32K )
Number of downloads: 198
Bildschirmfoto_2009_09_01_um_19.39.37.jpg ( 388.74K )
Number of downloads: 245
Bildschirmfoto_2009_09_04_um_14.46.23.jpg ( 88.81K )
Number of downloads: 306
FPS_8.1_Nvidia_9600GT.jpg ( 79.43K )
Number of downloads: 152
Bildschirmfoto_2009_09_06_um_11.20.42.jpg ( 69.81K )
Number of downloads: 216
Bildschirmfoto_2009_09_28_um_23.18.48.jpg ( 82.5K )
Number of downloads: 246
screen.jpg ( 170.61K )
Number of downloads: 55 |
mitch_de All OpenCL Benches: RAYTRACING/Galaxies/Grass/qJulia/Displacement... Aug 30 2009, 07:08 PM
GLXOZ QUOTE (mitch_de @ Aug 31 2009, 06:08 AM) ... Aug 30 2009, 07:25 PM
reinstaller 1600x1200
20 Gigaflops / around 59 U/sec : CPU ( S... Aug 30 2009, 08:41 PM
Sherry Haibara 1280x800
20 Gigaflops - around 60 U/sec (CPU: Core... Aug 30 2009, 09:00 PM
blackosx Mitch, this is lovely - Thanks
blackosx (C2D 2.66... Aug 30 2009, 09:12 PM
Chaz_UK I just did a little trial on my 20 inch iMac and i... Aug 30 2009, 09:45 PM
AppleIIGuy 9800GT
Core i7 920 stock speed
290 Updates/sec
98... Aug 31 2009, 12:03 AM
radov4n Great tool
1600x1200, i7
http://i28.tinypic.com... Aug 31 2009, 09:06 AM
johan why do result go up and down. in same session?
i ... Aug 31 2009, 02:52 PM
mitch_de QUOTE (johan @ Aug 31 2009, 04:52 PM) why... Aug 31 2009, 04:05 PM
Cyberdog ! don't work on my iMac Core Duo 1.83 Ghz + ATI ... Aug 31 2009, 07:34 PM
proengin W3520 overclocked to 4.1Ghz Turbo, PCIE 102Mhz, 12... Sep 1 2009, 05:46 AM
mitch_de QUOTE (proengin @ Sep 1 2009, 07:46 AM) W... Sep 1 2009, 10:38 AM
olaszvandor QUOTE (proengin @ Sep 1 2009, 06:46 AM) W... Feb 10 2010, 05:12 PM
rob-ART I will be posting some results on BareFeats.com to... Sep 1 2009, 01:54 PM
JBeed I'm a bit worried here.
My MacBook Pro (2.4GHz... Sep 1 2009, 06:44 PM
mitch_de Your results are normal - dont worry.
The 8600M GT... Sep 1 2009, 07:01 PM
reinstaller All my results are with cuda drivers. Sep 3 2009, 06:42 AM
n00b32 Hi,
how come CUDA on Mac OS X? Where did you get ... Oct 13 2009, 06:48 AM

shoarthing QUOTE (n00b32 @ Oct 13 2009, 07:48 AM) Hi... Oct 13 2009, 07:44 AM

n00b32 thnx, didn't know that
right now only the rel... Oct 13 2009, 11:52 AM
MarceloDub good Oct 30 2009, 04:09 AM
JBeed Well, a small update.
As of the new version, runni... Sep 5 2009, 12:39 AM
mitch_de UPDATE:
ATI devs give me an hint to increase the c... Sep 7 2009, 03:19 PM
proengin Here are my OpenCL_GALAXIES_16K_SSE4_VSYNC_OFF ben... Sep 10 2009, 05:47 AM
macguitarm Not sure if this is the correct topic / thread
Fo... Sep 16 2009, 06:12 PM
wesux QUOTE (macguitarm @ Sep 17 2009, 04:12 AM... Sep 17 2009, 11:59 PM
sch8mid QUOTE (wesux @ Sep 17 2009, 11:59 PM) Now... Sep 23 2009, 06:13 AM
mitch_de ATI Apple Dev told me that they didnt reached time... Sep 16 2009, 08:51 PM
mitch_de New Galaxies OpenCL Bench V2:
- Apple updated / fi... Sep 29 2009, 06:02 AM
Thijmus QUOTE (mitch_de @ Sep 29 2009, 08:02 AM) ... Sep 29 2009, 11:54 AM
mitch_de Thanks !
Have you tried also the 8K Version ? ... Sep 29 2009, 04:54 PM
Thijmus QUOTE (mitch_de @ Sep 29 2009, 06:54 PM) ... Sep 30 2009, 06:14 PM
shoarthing QUOTE (mitch_de @ Sep 29 2009, 05:54 PM) ... Oct 6 2009, 09:31 AM
osssua QUOTE (shoarthing @ Oct 6 2009, 09:31 AM)... Oct 6 2009, 01:37 PM
lamer0 Mirror for galaxies, I hate rapidshare with a pass... Sep 30 2009, 06:05 AM
mitch_de QUOTE (lamer0 @ Sep 30 2009, 08:05 AM) q8... Sep 30 2009, 10:12 AM
shoarthing Zotac ION-ITX with its integrated 9400M alone mana... Oct 7 2009, 06:49 AM
nvidia2008 QUOTE (shoarthing @ Oct 7 2009, 02:49 PM)... Oct 10 2009, 01:08 PM
shoarthing QUOTE (nvidia2008 @ Oct 10 2009, 02:08 PM... Oct 10 2009, 02:16 PM
Schenkenberg Awesome post!
Galaxies 32k running on 1900x12... Oct 7 2009, 01:22 PM
n00b32 Hi,
I added a table for better comparison of the ... Oct 9 2009, 08:44 AM
cwestpha Hmm on my 2008 Mac Pro 2.8 Ghz 8-core with 285 GTX... Oct 12 2009, 11:49 PM
byronrock Is there a way to run the other benchmark in a hac... Nov 1 2009, 07:08 AM
AROBASEFR Hi
I've got success with ATI HD4850 Gainward ... Nov 11 2009, 02:27 PM
dudelolchris All the OpenCL demos crash on my brand new Late 20... Nov 13 2009, 03:55 AM
computergek80 They crash for me too, 27" iMac Radeon HD 467... Nov 16 2009, 11:06 PM
mitch_de Be pattient.
Apple will for sure fix that problem... Nov 18 2009, 06:32 AM
J the Ninja Hey, everybody. I wanted to post this link in here... Nov 29 2009, 09:42 PM
dudelolchris On ati radeon HD 4670: ~ 9 fps in 2D and heightmap... Dec 1 2009, 04:15 AM
nvidia2008 QUOTE (dudelolchris @ Dec 1 2009, 12:15 P... Dec 22 2009, 12:12 PM
animeboy Hi,
I just got a new upgrade for my Early 2008 ... Dec 1 2009, 04:25 AM
rushko Hi,
Any news here?
My results for Galaxies 32K
... Jan 22 2010, 08:49 PM
mitch_de YEP !
Galaxies 32K , NV 8800GTX : 197 Gigaflop... Jan 24 2010, 07:11 PM
blackosx Hi mitch_de
Well done with keeping up to date wit... Jan 25 2010, 07:28 AM
mitch_de Ist normal that the ray/sec needs some time to sta... Jan 25 2010, 01:09 PM
dodusman Hello
snow lepoard 10.6.2 64 bits nvidia 250 gt... Jan 26 2010, 11:55 AM
mitch_de PLEASE READ THE "HOW TO RUN" file within... Jan 26 2010, 12:58 PM
rushko Hi Mitch_de,
I am tryin to start mandelbrot and n... Jan 30 2010, 03:56 PM
mitch_de Thanks sharing your results !
Big OpenCL advan... Jan 30 2010, 10:39 PM
rushko Hi,
That's good news Mitch, I just wonder whe... Jan 31 2010, 02:05 AM
mitch_de QUOTE (rushko @ Jan 31 2010, 03:05 AM) Hi... Jan 31 2010, 07:40 AM
mitch_de New Version of smalllux (OpenCL raytraycing) . Now... Feb 16 2010, 07:19 PM
mrheat great stuff mitch,
finally reaching 447W Total Po... Feb 17 2010, 08:01 AM
mitch_de QUOTE (mrheat @ Feb 17 2010, 09:01 AM) gr... Feb 17 2010, 12:29 PM
mikoffski Core i7 920 2.66Ghz + Radeon HD4850
Dragon scene ... Feb 24 2010, 01:29 AM
mitch_de THANKS for your values !
Even 4850 performes ... Mar 2 2010, 06:43 PM
jeanlain Hi, I tried the latest version :
With the little... Mar 5 2010, 12:28 PM
mitch_de Thanks for your results of GPU only !
Indeed 4... Mar 5 2010, 12:39 PM
jeanlain Thanks for the info.
With GPU + 4 CPUs I get 1480... Mar 5 2010, 03:53 PM
mitch_de You see that highly optimized smallux does hybrid ... Mar 5 2010, 07:04 PM
jeanlain QUOTE (mitch_de @ Mar 5 2010, 08:04 PM) Y... Mar 6 2010, 11:37 AM
mitch_de GREAT classroom KSamples /Sec !
I dont think y... Mar 6 2010, 03:04 PM
mitch_de New Version uploaded (link #1 post) !
V1.4.3
... Mar 21 2010, 10:44 AM
jeanlain Hey, seems like 10.6.3 has improved speed a bit.
O... Mar 29 2010, 09:01 PM
mitch_de The newest always you will find since now more eas... Mar 30 2010, 07:28 AM
jeanlain 1400 Ksamples/sec and 2900 Krays/s (luxball), GPU ... Mar 30 2010, 09:19 PM
mitch_de QUOTE (jeanlain @ Mar 30 2010, 10:19 PM) ... Mar 31 2010, 06:41 AM
mitch_de Uploaded 1.4.6 at macupdate, with feature to incre... Mar 31 2010, 08:01 AM
nissefar Anyone got results from a Macbook Pro with 9600m i... Apr 4 2010, 10:42 PM
mitch_de Would be interesting !
Newest Version 1.5.2, ... Apr 6 2010, 05:36 PM
mm67 My Geforce 250 GTS and 9600 GT results:
Apr 6 2010, 09:18 PM
mitch_de Thanks will be referenced in next version
An GTX 2... Apr 6 2010, 10:09 PM
mastershredder Im sorry to say pwned XFX 4870 1Gb default clocks... Apr 8 2010, 01:25 AM
jeanlain midrange GPU: 9.2 sec
highend GPU: 16.6 sec
radeo... Apr 10 2010, 08:48 PM
mitch_de No, it is not optimized for ATI gpus.
With OpenCl ... Apr 15 2010, 05:48 AM
AROBASEFR My results with Smallux 1.5.7
OpenCL GPU 0: Radeo... Apr 25 2010, 12:06 PM
mitch_de Thanks !
SLG updated to 1.5.8 !
all times ... May 5 2010, 05:25 AM
AROBASEFR QUOTE (mitch_de @ May 5 2010, 05:25 AM) T... May 6 2010, 05:47 PM
mitch_de Updated to 1.6.0
Bugfix for some gpu only results.... May 7 2010, 11:14 AM
AROBASEFR QUOTE (mitch_de @ May 7 2010, 11:14 AM) U... May 7 2010, 05:22 PM
mitch_de Many thanks !
Did the new pixelfilter bench g... May 8 2010, 04:16 AM
AROBASEFR QUOTE (mitch_de @ May 8 2010, 04:16 AM) M... May 8 2010, 10:00 AM
mitch_de QUOTE (AROBASEFR @ May 8 2010, 12:00 PM) ... May 8 2010, 01:07 PM
mitch_de Updated to 1.6.1 : an gap line between OpenCL pixe... May 8 2010, 08:14 PM
AROBASEFR QUOTE (mitch_de @ May 8 2010, 08:14 PM) U... May 9 2010, 06:30 PM
mitch_de Thanks very much.
Great OpenCL pixeldevice filter... May 10 2010, 03:18 AM ![]() |
|
Lo-Fi Version | Time is now: 31st July 2010 - 02:05 PM |