Jump to content

CUDA-Z Info+Bench (Nvidia only) - updated Oct 2012


  • Please log in to reply
74 replies to this topic

#1
mitch_de

mitch_de

    InsanelyMacaholic

  • Local Moderators
  • 2,884 posts
  • Gender:Male
  • Location:Stuttgart / Germany
Hi,
some OS X Apps already use CUDA (must be installed extra, Nvidia gpus only!!) , like Squeeze 7, Mathematica, Toast 11 for h.264 encoding export also.

I found an great CUDA-Z Tool which shows much informations and also has an small benchmarks (VRAM speed, PCI-E <> VRAM Speed,..) within.

EDIT: New Beta Version available!
Added DL Link and new screenshoots of my 9600GT / 10.8.2
New version shows more details about CUDA and gpu card.
GPU GHz + VRAM GHz are max+fixed values (like using OpenCL info) and can´t be used to check AGPM.


http://sourceforge.n...es/cuda-z/Beta/

Attached Files



#2
blackosx

blackosx

    InsanelyMacaholic

  • Coders
  • 3,085 posts
  • Gender:Male
  • Location:UK
Nice one mitch. Good to see you're still keeping an eye on the video drivers / benchmarking. I'll give this a go on the weekend.

#3
mitch_de

mitch_de

    InsanelyMacaholic

  • Local Moderators
  • 2,884 posts
  • Gender:Male
  • Location:Stuttgart / Germany
In diff to OpenCL , GTX4xx card should run CUDA (withz newest CUDA drivers) also with OS X 10.6.x. Would be nice to see some CUDA GTX4xx SL basic benchmark values (Gigaflops + memory copy speeds / VRAM speed (dev to dev speed).

#4
morfy

morfy

    InsanelyMac Legend

  • Members
  • PipPipPipPipPipPipPip
  • 890 posts
Good very thx :rolleyes:

Update:
Attached File  Schermata_2011_04_09_a_21.33.18.png   55.49KB   521 downloads
Attached File  Schermata_2011_04_09_a_21.33.25_1.png   46.53KB   335 downloads
Attached File  Schermata_2011_04_09_a_21.33.33.png   61.66KB   332 downloads

#5
Gringo Vermelho

Gringo Vermelho

    The Jan Bird fix

  • Supervisors
  • 6,111 posts
  • Gender:Male
  • Location:Brazil
For mitch.

1GB ASUS ENGTX460 on 10.6.7, latest Quadro 4000 drivers, CUDA driver and SDK installed.

CUDA Device Query (Runtime API) version (CUDART static linking)

There is 1 device supporting CUDA

Device 0: "GeForce GTX 460"
  CUDA Driver Version / Runtime Version		  4.0 / 4.0
  CUDA Capability Major/Minor version number:	2.1
  Total amount of global memory:				 1024 MBytes (1073414144 bytes)
  ( 7) Multiprocessors x (48) CUDA Cores/MP:	 336 CUDA Cores
  GPU Clock Speed:							   1.35 GHz
  Memory Clock rate:							 1800.00 Mhz
  Memory Bus Width:							  256-bit
  L2 Cache Size:								 524288 bytes
  Max Texture Dimension Size (x,y,z)			 1D=(65536), 2D=(65536,65535), 3D=(2048,2048,2048)
  Max Layered Texture Size (dim) x layers		1D=(16384) x 2048, 2D=(16384,16384) x 2048
  Total amount of constant memory:			   65536 bytes
  Total amount of shared memory per block:	   49152 bytes
  Total number of registers available per block: 32768
  Warp size:									 32
  Maximum number of threads per block:		   1024
  Maximum sizes of each dimension of a block:	1024 x 1024 x 64
  Maximum sizes of each dimension of a grid:	 65535 x 65535 x 65535
  Maximum memory pitch:						  2147483647 bytes
  Texture alignment:							 512 bytes
  Concurrent copy and execution:				 Yes with 1 copy engine(s)
  Run time limit on kernels:					 Yes
  Integrated GPU sharing Host Memory:			No
  Support host page-locked memory mapping:	   Yes
  Concurrent kernel execution:				   Yes
  Alignment requirement for Surfaces:			Yes
  Device has ECC support enabled:				No
  Device is using TCC driver mode:			   No
  Device supports Unified Addressing (UVA):	  No
  Device PCI Bus ID / PCI location ID:		   1 / 0
  Compute Mode:
	 < Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) >

deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 4.0, CUDA Runtime Version = 4.0, NumDevs = 1, Device = GeForce GTX 460
[./deviceQuery] test results...
PASSED

./bandwidthTest Starting...

Running on...

 Device 0: GeForce GTX 460
 Quick Mode

 Host to Device Bandwidth, 1 Device(s), Paged memory
   Transfer Size (Bytes)	Bandwidth(MB/s)
   33554432			2516.6

 Device to Host Bandwidth, 1 Device(s), Paged memory
   Transfer Size (Bytes)	Bandwidth(MB/s)
   33554432			2199.2

 Device to Device Bandwidth, 1 Device(s)
   Transfer Size (Bytes)	Bandwidth(MB/s)
   33554432			58782.2

[./bandwidthTest] test results...
PASSED


#6
vbetts

vbetts

    InsanelyMac Deity

  • Members
  • PipPipPipPipPipPipPipPipPipPip
  • 1,609 posts
  • Gender:Male
Dirty bit.
Posted Image
Posted Image
Posted Image

#7
Graebags

Graebags

    InsanelyMac Geek

  • Members
  • PipPipPip
  • 129 posts
  • Gender:Male
  • Location:Canberra
Hi & thanks,
with Lion DP2, I'm getting the Cuda 4 beta driver installing ok, and runs Galaxies, and Displacement bench, but Cuda-Z reports it as "Cuda not Found".

#8
iLeopod

iLeopod

    InsanelyMac Sage

  • Members
  • PipPipPipPipPipPip
  • 435 posts
  • Gender:Male
  • Location:Germany
  • Interests:ileopod.wordpress.com
Here on my Ion aka 9400m:Attached File  CUDA_Z.html   3.45KB   89 downloads

and 9800 Gt :Attached File  9800gt.html   3.46KB   77 downloads

#9
Chaz_UK

Chaz_UK

    InsanelyMac Protégé

  • Members
  • Pip
  • 29 posts
  • Gender:Male
  • Location:England
Here are some lowly 8500GT results:
Posted ImagePosted Image

Slowest so far! :o

#10
Graebags

Graebags

    InsanelyMac Geek

  • Members
  • PipPipPip
  • 129 posts
  • Gender:Male
  • Location:Canberra
My GTX460 results on SL 10.6.7

Attached Files



#11
Gringo Vermelho

Gringo Vermelho

    The Jan Bird fix

  • Supervisors
  • 6,111 posts
  • Gender:Male
  • Location:Brazil
Your GTX 460 is clocked higher than mine - compare:
Attached File  GTX460.jpg   136.62KB   176 downloads
Your extra 80 Mhz = nice g1g4fl0pz boost.

And look at your pageable memory copy, that's crazy...it's more than twice as fast as mine!

You: Core i7, memory controller is part of the CPU, DDR3

Me: Core 2 Duo, P45 chipset, DDR2

#12
Graebags

Graebags

    InsanelyMac Geek

  • Members
  • PipPipPip
  • 129 posts
  • Gender:Male
  • Location:Canberra
Hi,
wait till someone with an i7-2600 posts results, lol.
My GTX460 has factory OC at 715/1430 mhz. My i7 is boosted using the Gigabyte Quickboost utility to 3.45ghz. These are some interesting (but depressing) charts: http://www.cpubenchm...h_end_cpus.html

#13
Gringo Vermelho

Gringo Vermelho

    The Jan Bird fix

  • Supervisors
  • 6,111 posts
  • Gender:Male
  • Location:Brazil
*lmao* yeah that's depressing alright.

I just kept scrolling down and down and down.. until my E8500 finally appeared waaaaay down at the end. ;)

The fastest CPU on that list that would work in my motherboard is the Intel Core 2 Extreme X9750 - interestingly, its clock frequency is 3.16 GHz just like my E8500, but it scores in the low 5000s - almost twice as much as the E8500. I wonder if there's more to it than the two extra cores.

I can't find it for sale online, which is probably for the best, I'm sure it costs a million dollah..

#14
MacFanatic76

MacFanatic76

    InsanelyMac Protégé

  • Members
  • PipPip
  • 62 posts
  • Gender:Male
Thanx for sharing, Mitch !

Werde das mal etwa spaeter ausprobieren ! :blink:

#15
morfy

morfy

    InsanelyMac Legend

  • Members
  • PipPipPipPipPipPipPip
  • 890 posts
Not work on my Lion preview 2.(cuda driver 4.0.13rc)

#16
Flashe

Flashe

    Flashy ~ Flasheu

  • Members
  • PipPipPipPipPip
  • 298 posts
  • Location:93 Carats
Hi all,

Don't work for me too with Lion DP2.

My graphics card is 9800 GT.


Posted Image

#17
Gringo Vermelho

Gringo Vermelho

    The Jan Bird fix

  • Supervisors
  • 6,111 posts
  • Gender:Male
  • Location:Brazil
Have you installed the nvidia CUDA driver?

namaste.

#18
Carstiman

Carstiman

    InsanelyMac Geek

  • Members
  • PipPipPip
  • 116 posts
  • Gender:Male
hello, need some help to install the deviceQuery

xcode, cuda driver, toolkit and tools are installed.

i open cd /Developer/GPU\ Computing in terminal and execute make.


bash-3.2# make
 make -C src/alignedTypes/ 
 make -C src/asyncAPI/ 
 make -C src/bandwidthTest/ 
 make -C src/bicubicTexture/ 
 make -C src/bilateralFilter/ 
 make -C src/binomialOptions/ 
 make -C src/BlackScholes/ 
 make -C src/boxFilter/ 
 make -C src/clock/ 
 make -C src/concurrentKernels/ 
 make -C src/convolutionFFT2D/ 
 make -C src/convolutionSeparable/ 
 make -C src/convolutionTexture/ 
 make -C src/cppIntegration/ 
 make -C src/dct8x8/ 
 make -C src/deviceQuery/ 
 ld: can't open output file for writing: ../../bin/darwin/release/deviceQuery, errno=21
 collect2: ld returned 1 exit status
 make[2]: *** [../../bin/darwin/release/deviceQuery] Error 1
 make[1]: *** [src/deviceQuery/Makefile.ph_build] Error 2
 make: *** [all] Error 2

Cuda Z and Pyrit work with the cuda drive but can´t get the deviceQuery.

could you please help ?

#19
Flashe

Flashe

    Flashy ~ Flasheu

  • Members
  • PipPipPipPipPip
  • 298 posts
  • Location:93 Carats

Have you installed the nvidia CUDA driver?

namaste.


Hi Gringo Vermelho,

I installed:
- Developer Drivers for MacOS
- CUDA Toolkit
- CUDA Tools SDK
- GPU Computing SDK code samples

all that is in the link : http://developer.nvi...oolkit-40#MacOS

#20
Carstiman

Carstiman

    InsanelyMac Geek

  • Members
  • PipPipPip
  • 116 posts
  • Gender:Male

hello, need some help to install the deviceQuery

xcode, cuda driver, toolkit and tools are installed.

i open cd /Developer/GPU\ Computing in terminal and execute make.

Cuda Z and Pyrit work with the cuda drive but can´t get the deviceQuery.

could you please help ?


solved, installed all again and "make" works fine now ;)

bash-3.2# ./deviceQuery
[./deviceQuery] starting...
./deviceQuery Starting...

 CUDA Device Query (Runtime API) version (CUDART static linking)

There is 1 device supporting CUDA

Device 0: "GeForce GTS 450"
  CUDA Driver Version / Runtime Version		  4.0 / 4.0
  CUDA Capability Major/Minor version number:	2.1
  Total amount of global memory:				 1024 MBytes (1073283072 bytes)
  ( 4) Multiprocessors x (48) CUDA Cores/MP:	 192 CUDA Cores
  GPU Clock Speed:							   1.57 GHz
  Memory Clock rate:							 1804.00 Mhz
  Memory Bus Width:							  128-bit
  L2 Cache Size:								 262144 bytes
  Max Texture Dimension Size (x,y,z)			 1D=(65536), 2D=(65536,65535), 3D=(2048,2048,2048)
  Max Layered Texture Size (dim) x layers		1D=(16384) x 2048, 2D=(16384,16384) x 2048
  Total amount of constant memory:			   65536 bytes
  Total amount of shared memory per block:	   49152 bytes
  Total number of registers available per block: 32768
  Warp size:									 32
  Maximum number of threads per block:		   1024
  Maximum sizes of each dimension of a block:	1024 x 1024 x 64
  Maximum sizes of each dimension of a grid:	 65535 x 65535 x 65535
  Maximum memory pitch:						  2147483647 bytes
  Texture alignment:							 512 bytes
  Concurrent copy and execution:				 Yes with 2 copy engine(s)
  Run time limit on kernels:					 Yes
  Integrated GPU sharing Host Memory:			No
  Support host page-locked memory mapping:	   Yes
  Concurrent kernel execution:				   Yes
  Alignment requirement for Surfaces:			Yes
  Device has ECC support enabled:				No
  Device is using TCC driver mode:			   No
  Device supports Unified Addressing (UVA):	  No
  Device PCI Bus ID / PCI location ID:		   1 / 0
  Compute Mode:
	 < Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) >

deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 4.0, CUDA Runtime Version = 4.0, NumDevs = 1, Device = GeForce GTS 450
[./deviceQuery] test results...
PASSED

Press ENTER to exit...

bash-3.2#






0 user(s) are reading this topic

0 members, 0 guests, 0 anonymous users

© 2014 InsanelyMac  |   News  |   Forum  |   Downloads  |   OSx86 Wiki  |   Mac Netbook  |   PHP hosting by CatN  |   Designed by Ed Gain  |   Logo by irfan  |   Privacy Policy