mitch_de Posted January 15, 2013 Share Posted January 15, 2013 Hi, smallluxgpu (SLG) is an OpenCL powered gpu raytraycing used in Luxrender and well known Luxmark. The devs work on SLG4 and need to know which OS X / GPU combinations work / not work (errors in terminal insted luxball window). EDIT: AMD OpenCL drivers seems to have an BUG which does overbrighten the result. EDIT: 18.02. new test version EDIT: try, new special scenes with slg4 test from post #25 We have collected AMD reports THANKS. You may check if older, non Fermi gpus work like my 9600GT. Please test it and report: 1. OS X version, 2. gpu type, 3. speed : xyM samples/sec how to: download the zip, unpack it somewhere, start the terminal, cd path_to_main folder_slg4 (hint: type cd SPACE and than drag& drop the folder into the terminal + ENTER), copy (from Textedit opened Instructions_for_testing.txt the commands into the terminal ENTER ESC key quits slg. EDIT: try, new special scenes with slg4 test from post #25 results: mostly gpu speed bound but also cpu speed has an minor effect for speed! Nvidia 9600 GT, OS X 10.8.3 D50, 0.13M samples/sec, WORKING AMD 5570, 10.8.2, 0.86M, BUG AMD 6870, 10.8.3 D50, 1.37M, BUG log - working, no errors: ... [sLG] [PathOCLRenderThread::0] OpenCL buffer creation time: 0ms [sLG] [PathOCLRenderThread::0] Defined symbols: -D LUXRAYS_OPENCL_KERNEL -D SLG_OPENCL_KERNEL -D PARAM_TASK_COUNT=65536 -D PARAM_IMAGE_WIDTH=640 -D PARAM_IMAGE_HEIGHT=480 -D PARAM_RAY_EPSILON_MIN=1.000000e-09f -D PARAM_RAY_EPSILON_MAX=1.000000e-01f -D PARAM_MAX_PATH_DEPTH=16 -D PARAM_RR_DEPTH=5 -D PARAM_RR_CAP=5.000000e-01f -D PARAM_ACCEL_QBVH -D PARAM_ENABLE_TEX_CONST_FLOAT -D PARAM_ENABLE_TEX_CONST_FLOAT3 -D PARAM_ENABLE_MAT_MATTE -D PARAM_ENABLE_MAT_GLASS -D PARAM_HAS_PASSTHROUGH -D PARAM_DIRECT_LIGHT_SAMPLING -D PARAM_DL_LIGHT_COUNT=4 -D PARAM_IMAGE_FILTER_TYPE=0 -D PARAM_SAMPLER_TYPE=0 [sLG] [PathOCLRenderThread::0] Compiling kernels [sLG] [PathOCLRenderThread::0] Kernels not cached [sLG] [PathOCLRenderThread::0] Compiling Init Kernel [sLG] [PathOCLRenderThread::0] Compiling AdvancePaths Kernel [sLG] [PathOCLRenderThread::0] Kernels compilation time: 4ms [sLG] Done. Link to comment https://www.insanelymac.com/forum/topic/285685-smallluxgpu-4-alpha-nvidia-amd-tests-ended-thanks/ Share on other sites More sharing options...
Wayang-NT Posted January 15, 2013 Share Posted January 15, 2013 12D50 - AMD6870 - samples/sec 1.37M but the image seem too bright ... Link to comment https://www.insanelymac.com/forum/topic/285685-smallluxgpu-4-alpha-nvidia-amd-tests-ended-thanks/#findComment-1881111 Share on other sites More sharing options...
mitch_de Posted January 15, 2013 Author Share Posted January 15, 2013 OK, some AMD gpus have such probs. THANKS Link to comment https://www.insanelymac.com/forum/topic/285685-smallluxgpu-4-alpha-nvidia-amd-tests-ended-thanks/#findComment-1881117 Share on other sites More sharing options...
iFIRE Posted January 15, 2013 Share Posted January 15, 2013 bash-3.2# cd /Users/iFIRE/Downloads/SLG4_alpha bash-3.2# /Users/iFIRE/Downloads/SLG4_alpha/slg4 [sLG] Reading configuration file: scenes/luxball/luxball.cfg [sLG] Configuration: [sLG] image.width = 640 [sLG] image.height = 480 [sLG] batch.halttime = 0 [sLG] scene.file = scenes/luxball/luxball.scn [sLG] scene.fieldofview = 45 [sLG] opencl.cpu.use = 0 [sLG] opencl.gpu.use = 1 [sLG] opencl.gpu.workgroup.size = 64 [sLG] screen.refresh.interval = 100 [sLG] film.gamma = 2.2 [sLG] path.maxdepth = 16 [sLG] path.russianroulette.depth = 5 [sLG] light.maxdepth = 16 [sLG] light.russianroulette.depth = 5 [LuxRays::SDL] Reading scene: scenes/luxball/luxball.scn [LuxRays::SDL] Camera postion: Point[0.5, -1.4, 1.7] [LuxRays::SDL] Camera target: Point[0, 0, 0.4] [LuxRays::SDL] Material definition: shell [LuxRays::SDL] Material definition: whitematte [LuxRays::SDL] Material definition: luxtext [LuxRays::SDL] Material definition: blacktext [LuxRays::SDL] Material definition: whitelight [LuxRays::SDL] The l1 object is a light sources with 2 triangles [LuxRays::SDL] The l2 object is a light sources with 2 triangles [LuxRays::SDL] PLY object count: 7 [LuxRays] OpenCL Platform 0: Apple [LuxRays] Device 0 name: NativeThread-000 [LuxRays] Device 0 type: NATIVE_THREAD [LuxRays] Device 0 compute units: 1 [LuxRays] Device 0 max allocable memory: 0MBytes [LuxRays] Device 0 max allocable memory block size: 0MBytes [LuxRays] Device 1 name: NativeThread-001 [LuxRays] Device 1 type: NATIVE_THREAD [LuxRays] Device 1 compute units: 1 [LuxRays] Device 1 max allocable memory: 0MBytes [LuxRays] Device 1 max allocable memory block size: 0MBytes [LuxRays] Device 2 name: NativeThread-002 [LuxRays] Device 2 type: NATIVE_THREAD [LuxRays] Device 2 compute units: 1 [LuxRays] Device 2 max allocable memory: 0MBytes [LuxRays] Device 2 max allocable memory block size: 0MBytes [LuxRays] Device 3 name: NativeThread-003 [LuxRays] Device 3 type: NATIVE_THREAD [LuxRays] Device 3 compute units: 1 [LuxRays] Device 3 max allocable memory: 0MBytes [LuxRays] Device 3 max allocable memory block size: 0MBytes [LuxRays] Device 4 name: NativeThread-004 [LuxRays] Device 4 type: NATIVE_THREAD [LuxRays] Device 4 compute units: 1 [LuxRays] Device 4 max allocable memory: 0MBytes [LuxRays] Device 4 max allocable memory block size: 0MBytes [LuxRays] Device 5 name: NativeThread-005 [LuxRays] Device 5 type: NATIVE_THREAD [LuxRays] Device 5 compute units: 1 [LuxRays] Device 5 max allocable memory: 0MBytes [LuxRays] Device 5 max allocable memory block size: 0MBytes [LuxRays] Device 6 name: NativeThread-006 [LuxRays] Device 6 type: NATIVE_THREAD [LuxRays] Device 6 compute units: 1 [LuxRays] Device 6 max allocable memory: 0MBytes [LuxRays] Device 6 max allocable memory block size: 0MBytes [LuxRays] Device 7 name: NativeThread-007 [LuxRays] Device 7 type: NATIVE_THREAD [LuxRays] Device 7 compute units: 1 [LuxRays] Device 7 max allocable memory: 0MBytes [LuxRays] Device 7 max allocable memory block size: 0MBytes [LuxRays] Device 8 name: Intel® Core™ i7 CPU 860 @ 2.80GHz [LuxRays] Device 8 type: OPENCL_CPU [LuxRays] Device 8 compute units: 8 [LuxRays] Device 8 max allocable memory: 4096MBytes [LuxRays] Device 8 max allocable memory block size: 1024MBytes [LuxRays] Device 9 name: ATI Radeon HD 5770 [LuxRays] Device 9 type: OPENCL_GPU [LuxRays] Device 9 compute units: 10 [LuxRays] Device 9 max allocable memory: 1024MBytes [LuxRays] Device 9 max allocable memory block size: 256MBytes [LuxRays] Preprocessing DataSet [LuxRays] Total vertex count: 223380 [LuxRays] Total triangle count: 260794 [LuxRays] Total vertices memory usage: 2617Kbytes [LuxRays] Total triangles memory usage: 3056Kbytes [LuxRays] Building QBVH, primitives: 260794, initial nodes: 21734 [LuxRays] QBVH completed with 43467/43468 nodes [LuxRays] Total QBVH memory usage: 4754Kbytes [LuxRays] Total QBVH QuadTriangle count: 87103 [LuxRays] Max. QBVH Depth: 25 [LuxRays] Creating 1 intersection device(s) [LuxRays] Allocating intersection device 0: ATI Radeon HD 5770 (Type = OPENCL_GPU) [sLG] OpenCL Devices used: [sLG] [ATI Radeon HD 5770Intersect] [LuxRays] [OpenCL device::ATI Radeon HD 5770Intersect] Ray buffer size: 3072Kbytes [LuxRays] [OpenCL device::ATI Radeon HD 5770Intersect] Ray hits buffer size: 1024Kbytes [LuxRays] [OpenCL device::ATI Radeon HD 5770Intersect] OpenCL image support is available [LuxRays] [OpenCL device::ATI Radeon HD 5770Intersect] OpenCL max. image buffer size: 8192x8192 [LuxRays] [OpenCL device::ATI Radeon HD 5770Intersect] QBVH node image buffer size: 553x551 [LuxRays] [OpenCL device::ATI Radeon HD 5770Intersect] QBVH triangle image buffer size: 940x927 [LuxRays] [OpenCL device::ATI Radeon HD 5770Intersect] Enabled QBVH scene storage inside image [LuxRays] [OpenCL device::ATI Radeon HD 5770Intersect] QBVH max. stack size: 24 [LuxRays] [OpenCL device::ATI Radeon HD 5770Intersect] QBVH Image Storage kernel work group size: 256 [LuxRays] [OpenCL device::ATI Radeon HD 5770Intersect] Suggested work group size: 256 [LuxRays] [OpenCL device::ATI Radeon HD 5770Intersect] Forced work group size: 64 [sLG] Configuring 1 CPU render threads [sLG] [PathOCLRenderThread] OpenCL task count: 65536 [sLG] [PathOCLRenderThread] OpenCL max. page memory size: 262144Kbytes [sLG] [PathOCLRenderThread::CompiledScene] Compile Camera [sLG] [PathOCLRenderThread::CompiledScene] Compile Geometry [sLG] [PathOCLRenderThread::CompiledScene] Scene geometry compilation time: 14ms [sLG] [PathOCLRenderThread::CompiledScene] Compile Materials [sLG] [PathOCLRenderThread::CompiledScene] Compile Textures [sLG] [PathOCLRenderThread::CompiledScene] Textures compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Material compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile Triangle AreaLights [sLG] [PathOCLRenderThread::CompiledScene] Triangle area lights compilation time: 1ms [sLG] [PathOCLRenderThread::CompiledScene] Compile InfiniteLight [sLG] [PathOCLRenderThread::CompiledScene] Infinitelight compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile SunLight [sLG] [PathOCLRenderThread::CompiledScene] Compile SkyLight [sLG] [PathOCLRenderThread::CompiledScene] Compile ImageMaps [sLG] [PathOCLRenderThread::CompiledScene] Image maps page count: 0 [sLG] [PathOCLRenderThread::CompiledScene] Texture maps compilation time: 0ms [sLG] Starting 1 PathOCL render threads [sLG] [PathOCLRenderThread::0] FrameBuffer buffer size: 4835Kbytes [sLG] [PathOCLRenderThread::0] Camera buffer size: 272bytes [sLG] [PathOCLRenderThread::0] MeshIDs buffer size: 1018Kbytes [sLG] [PathOCLRenderThread::0] Normals buffer size: 2617Kbytes [sLG] [PathOCLRenderThread::0] UVs buffer size: 1745Kbytes [sLG] [PathOCLRenderThread::0] Vertices buffer size: 2617Kbytes [sLG] [PathOCLRenderThread::0] Triangles buffer size: 3056Kbytes [sLG] [PathOCLRenderThread::0] Textures buffer size: 324bytes [sLG] [PathOCLRenderThread::0] Materials buffer size: 160bytes [sLG] [PathOCLRenderThread::0] Mesh material index buffer size: 28bytes [sLG] [PathOCLRenderThread::0] Triangle AreaLights buffer size: 272bytes [sLG] [PathOCLRenderThread::0] Triangle AreaLights index buffer size: 1018Kbytes [sLG] [PathOCLRenderThread::0] Ray buffer size: 3072Kbytes [sLG] [PathOCLRenderThread::0] RayHit buffer size: 1024Kbytes [sLG] [PathOCLRenderThread::0] GPUTask buffer size: 23296Kbytes [sLG] [PathOCLRenderThread::0] Size of a SampleData: 8bytes [sLG] [PathOCLRenderThread::0] SampleData buffer size: 512Kbytes [sLG] [PathOCLRenderThread::0] GPUTask Stats buffer size: 256Kbytes [sLG] [PathOCLRenderThread::0] OpenCL buffer creation time: 0ms [sLG] [PathOCLRenderThread::0] Defined symbols: -D LUXRAYS_OPENCL_KERNEL -D SLG_OPENCL_KERNEL -D PARAM_TASK_COUNT=65536 -D PARAM_IMAGE_WIDTH=640 -D PARAM_IMAGE_HEIGHT=480 -D PARAM_RAY_EPSILON_MIN=1.000000e-09f -D PARAM_RAY_EPSILON_MAX=1.000000e-01f -D PARAM_MAX_PATH_DEPTH=16 -D PARAM_RR_DEPTH=5 -D PARAM_RR_CAP=5.000000e-01f -D PARAM_ACCEL_QBVH -D PARAM_ENABLE_TEX_CONST_FLOAT -D PARAM_ENABLE_TEX_CONST_FLOAT3 -D PARAM_ENABLE_MAT_MATTE -D PARAM_ENABLE_MAT_GLASS -D PARAM_HAS_PASSTHROUGH -D PARAM_DIRECT_LIGHT_SAMPLING -D PARAM_DL_LIGHT_COUNT=4 -D PARAM_IMAGE_FILTER_TYPE=0 -D PARAM_SAMPLER_TYPE=0 [sLG] [PathOCLRenderThread::0] Compiling kernels [sLG] [PathOCLRenderThread::0] Kernels not cached [sLG] [PathOCLRenderThread::0] Compiling Init Kernel [sLG] [PathOCLRenderThread::0] Compiling AdvancePaths Kernel [sLG] [PathOCLRenderThread::0] Kernels compilation time: 743ms [sLG] Done. bash-3.2# HD 5770 Link to comment https://www.insanelymac.com/forum/topic/285685-smallluxgpu-4-alpha-nvidia-amd-tests-ended-thanks/#findComment-1881130 Share on other sites More sharing options...
mitch_de Posted January 15, 2013 Author Share Posted January 15, 2013 Thanks, seems that AMD OpenCL drivers have problems in 10.8.2/3 (and before). Will see if some other (Fermi?) Nvidia gpus work already like mine. Link to comment https://www.insanelymac.com/forum/topic/285685-smallluxgpu-4-alpha-nvidia-amd-tests-ended-thanks/#findComment-1881133 Share on other sites More sharing options...
Rampage Dev Posted January 15, 2013 Share Posted January 15, 2013 10.8.3 12D50 GTX 580 App comes up as black screen then locks up: [sLG] Reading configuration file: scenes/luxball/luxball.cfg [sLG] Configuration: [sLG] image.width = 640 [sLG] image.height = 480 [sLG] batch.halttime = 0 [sLG] scene.file = scenes/luxball/luxball.scn [sLG] scene.fieldofview = 45 [sLG] opencl.cpu.use = 0 [sLG] opencl.gpu.use = 1 [sLG] opencl.gpu.workgroup.size = 64 [sLG] screen.refresh.interval = 100 [sLG] film.gamma = 2.2 [sLG] path.maxdepth = 16 [sLG] path.russianroulette.depth = 5 [sLG] light.maxdepth = 16 [sLG] light.russianroulette.depth = 5 [LuxRays::SDL] Reading scene: scenes/luxball/luxball.scn [LuxRays::SDL] Camera postion: Point[0.5, -1.4, 1.7] [LuxRays::SDL] Camera target: Point[0, 0, 0.4] [LuxRays::SDL] Material definition: shell [LuxRays::SDL] Material definition: whitematte [LuxRays::SDL] Material definition: luxtext [LuxRays::SDL] Material definition: blacktext [LuxRays::SDL] Material definition: whitelight [LuxRays::SDL] The l1 object is a light sources with 2 triangles [LuxRays::SDL] The l2 object is a light sources with 2 triangles [LuxRays::SDL] PLY object count: 7 [LuxRays] OpenCL Platform 0: Apple [LuxRays] Device 0 name: NativeThread-000 [LuxRays] Device 0 type: NATIVE_THREAD [LuxRays] Device 0 compute units: 1 [LuxRays] Device 0 max allocable memory: 0MBytes [LuxRays] Device 0 max allocable memory block size: 0MBytes [LuxRays] Device 1 name: NativeThread-001 [LuxRays] Device 1 type: NATIVE_THREAD [LuxRays] Device 1 compute units: 1 [LuxRays] Device 1 max allocable memory: 0MBytes [LuxRays] Device 1 max allocable memory block size: 0MBytes [LuxRays] Device 2 name: NativeThread-002 [LuxRays] Device 2 type: NATIVE_THREAD [LuxRays] Device 2 compute units: 1 [LuxRays] Device 2 max allocable memory: 0MBytes [LuxRays] Device 2 max allocable memory block size: 0MBytes [LuxRays] Device 3 name: NativeThread-003 [LuxRays] Device 3 type: NATIVE_THREAD [LuxRays] Device 3 compute units: 1 [LuxRays] Device 3 max allocable memory: 0MBytes [LuxRays] Device 3 max allocable memory block size: 0MBytes [LuxRays] Device 4 name: NativeThread-004 [LuxRays] Device 4 type: NATIVE_THREAD [LuxRays] Device 4 compute units: 1 [LuxRays] Device 4 max allocable memory: 0MBytes [LuxRays] Device 4 max allocable memory block size: 0MBytes [LuxRays] Device 5 name: NativeThread-005 [LuxRays] Device 5 type: NATIVE_THREAD [LuxRays] Device 5 compute units: 1 [LuxRays] Device 5 max allocable memory: 0MBytes [LuxRays] Device 5 max allocable memory block size: 0MBytes [LuxRays] Device 6 name: NativeThread-006 [LuxRays] Device 6 type: NATIVE_THREAD [LuxRays] Device 6 compute units: 1 [LuxRays] Device 6 max allocable memory: 0MBytes [LuxRays] Device 6 max allocable memory block size: 0MBytes [LuxRays] Device 7 name: NativeThread-007 [LuxRays] Device 7 type: NATIVE_THREAD [LuxRays] Device 7 compute units: 1 [LuxRays] Device 7 max allocable memory: 0MBytes [LuxRays] Device 7 max allocable memory block size: 0MBytes [LuxRays] Device 8 name: Intel® Core i7-3820 CPU @ 3.60GHz [LuxRays] Device 8 type: OPENCL_CPU [LuxRays] Device 8 compute units: 8 [LuxRays] Device 8 max allocable memory: 65536MBytes [LuxRays] Device 8 max allocable memory block size: 16384MBytes [LuxRays] Device 9 name: GeForce GTX 580 [LuxRays] Device 9 type: OPENCL_GPU [LuxRays] Device 9 compute units: 16 [LuxRays] Device 9 max allocable memory: 1536MBytes [LuxRays] Device 9 max allocable memory block size: 384MBytes [LuxRays] Preprocessing DataSet [LuxRays] Total vertex count: 223380 [LuxRays] Total triangle count: 260794 [LuxRays] Total vertices memory usage: 2617Kbytes [LuxRays] Total triangles memory usage: 3056Kbytes [LuxRays] Building QBVH, primitives: 260794, initial nodes: 21734 [LuxRays] QBVH completed with 43467/43468 nodes [LuxRays] Total QBVH memory usage: 4754Kbytes [LuxRays] Total QBVH QuadTriangle count: 87103 [LuxRays] Max. QBVH Depth: 25 [LuxRays] Creating 1 intersection device(s) [LuxRays] Allocating intersection device 0: GeForce GTX 580 (Type = OPENCL_GPU) [sLG] OpenCL Devices used: [sLG] [GeForce GTX 580Intersect] [LuxRays] [OpenCL device::GeForce GTX 580Intersect] Ray buffer size: 3072Kbytes [LuxRays] [OpenCL device::GeForce GTX 580Intersect] Ray hits buffer size: 1024Kbytes [LuxRays] [OpenCL device::GeForce GTX 580Intersect] OpenCL image support is available [LuxRays] [OpenCL device::GeForce GTX 580Intersect] OpenCL max. image buffer size: 8192x8192 [LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH node image buffer size: 553x551 [LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH triangle image buffer size: 940x927 [LuxRays] [OpenCL device::GeForce GTX 580Intersect] Enabled QBVH scene storage inside image [LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH max. stack size: 24 [LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH Image Storage kernel work group size: 512 [LuxRays] [OpenCL device::GeForce GTX 580Intersect] Suggested work group size: 512 [LuxRays] [OpenCL device::GeForce GTX 580Intersect] Forced work group size: 64 [sLG] Configuring 1 CPU render threads [sLG] [PathOCLRenderThread] OpenCL task count: 65536 [sLG] [PathOCLRenderThread] OpenCL max. page memory size: 393216Kbytes [sLG] [PathOCLRenderThread::CompiledScene] Compile Camera [sLG] [PathOCLRenderThread::CompiledScene] Compile Geometry [sLG] [PathOCLRenderThread::CompiledScene] Scene geometry compilation time: 6ms [sLG] [PathOCLRenderThread::CompiledScene] Compile Materials [sLG] [PathOCLRenderThread::CompiledScene] Compile Textures [sLG] [PathOCLRenderThread::CompiledScene] Textures compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Material compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile Triangle AreaLights [sLG] [PathOCLRenderThread::CompiledScene] Triangle area lights compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile InfiniteLight [sLG] [PathOCLRenderThread::CompiledScene] Infinitelight compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile SunLight [sLG] [PathOCLRenderThread::CompiledScene] Compile SkyLight [sLG] [PathOCLRenderThread::CompiledScene] Compile ImageMaps [sLG] [PathOCLRenderThread::CompiledScene] Image maps page count: 0 [sLG] [PathOCLRenderThread::CompiledScene] Texture maps compilation time: 0ms [sLG] Starting 1 PathOCL render threads [sLG] [PathOCLRenderThread::0] FrameBuffer buffer size: 4835Kbytes [sLG] [PathOCLRenderThread::0] Camera buffer size: 272bytes [sLG] [PathOCLRenderThread::0] MeshIDs buffer size: 1018Kbytes [sLG] [PathOCLRenderThread::0] Normals buffer size: 2617Kbytes [sLG] [PathOCLRenderThread::0] UVs buffer size: 1745Kbytes [sLG] [PathOCLRenderThread::0] Vertices buffer size: 2617Kbytes [sLG] [PathOCLRenderThread::0] Triangles buffer size: 3056Kbytes [sLG] [PathOCLRenderThread::0] Textures buffer size: 324bytes [sLG] [PathOCLRenderThread::0] Materials buffer size: 160bytes [sLG] [PathOCLRenderThread::0] Mesh material index buffer size: 28bytes [sLG] [PathOCLRenderThread::0] Triangle AreaLights buffer size: 272bytes [sLG] [PathOCLRenderThread::0] Triangle AreaLights index buffer size: 1018Kbytes [sLG] [PathOCLRenderThread::0] Ray buffer size: 3072Kbytes [sLG] [PathOCLRenderThread::0] RayHit buffer size: 1024Kbytes [sLG] [PathOCLRenderThread::0] GPUTask buffer size: 23296Kbytes [sLG] [PathOCLRenderThread::0] Size of a SampleData: 8bytes [sLG] [PathOCLRenderThread::0] SampleData buffer size: 512Kbytes [sLG] [PathOCLRenderThread::0] GPUTask Stats buffer size: 256Kbytes [sLG] [PathOCLRenderThread::0] OpenCL buffer creation time: 0ms [sLG] [PathOCLRenderThread::0] Defined symbols: -D LUXRAYS_OPENCL_KERNEL -D SLG_OPENCL_KERNEL -D PARAM_TASK_COUNT=65536 -D PARAM_IMAGE_WIDTH=640 -D PARAM_IMAGE_HEIGHT=480 -D PARAM_RAY_EPSILON_MIN=1.000000e-09f -D PARAM_RAY_EPSILON_MAX=1.000000e-01f -D PARAM_MAX_PATH_DEPTH=16 -D PARAM_RR_DEPTH=5 -D PARAM_RR_CAP=5.000000e-01f -D PARAM_ACCEL_QBVH -D PARAM_ENABLE_TEX_CONST_FLOAT -D PARAM_ENABLE_TEX_CONST_FLOAT3 -D PARAM_ENABLE_MAT_MATTE -D PARAM_ENABLE_MAT_GLASS -D PARAM_HAS_PASSTHROUGH -D PARAM_DIRECT_LIGHT_SAMPLING -D PARAM_DL_LIGHT_COUNT=4 -D PARAM_IMAGE_FILTER_TYPE=0 -D PARAM_SAMPLER_TYPE=0 [sLG] [PathOCLRenderThread::0] Compiling kernels [sLG] [PathOCLRenderThread::0] Kernels not cached [sLG] [PathOCLRenderThread::0] Compiling Init Kernel [sLG] [PathOCLRenderThread::0] Compiling AdvancePaths Kernel [sLG] [PathOCLRenderThread::0] Kernels compilation time: 240ms Link to comment https://www.insanelymac.com/forum/topic/285685-smallluxgpu-4-alpha-nvidia-amd-tests-ended-thanks/#findComment-1881161 Share on other sites More sharing options...
stefano.85 Posted January 15, 2013 Share Posted January 15, 2013 for me (first system in rig with nvidia latest 10.8.2 retail drivers, latest cuda and opencl patch by netkas) it crashes before start... here's the log. Cheers slg4 log.rtf slg4 log terminal.rtf Link to comment https://www.insanelymac.com/forum/topic/285685-smallluxgpu-4-alpha-nvidia-amd-tests-ended-thanks/#findComment-1881168 Share on other sites More sharing options...
mitch_de Posted January 16, 2013 Author Share Posted January 16, 2013 Thanks! To be sure that your GPU runs well with stable(older SLG), check if Luxmark (OpenCL Benchmark) works. If not it is an general prob with your OpenCL/Fermi combi. Normally Luxmark works also on Fermi gpus (if OpenCL works) http://www.luxrender..._OSXIntel64.zip Link to comment https://www.insanelymac.com/forum/topic/285685-smallluxgpu-4-alpha-nvidia-amd-tests-ended-thanks/#findComment-1881196 Share on other sites More sharing options...
tora Posted January 16, 2013 Share Posted January 16, 2013 10.8.2 (12C60) GTX 560 Ti Last login: Wed Jan 16 03:12:52 on console localhost:~ victoriano$ cd /Users/victoriano/Downloads/SLG4_alpha localhost:SLG4_alpha victoriano$ /Users/victoriano/Downloads/SLG4_alpha/slg4 [sLG] Reading configuration file: scenes/luxball/luxball.cfg [sLG] Configuration: [sLG] image.width = 640 [sLG] image.height = 480 [sLG] batch.halttime = 0 [sLG] scene.file = scenes/luxball/luxball.scn [sLG] scene.fieldofview = 45 [sLG] opencl.cpu.use = 0 [sLG] opencl.gpu.use = 1 [sLG] opencl.gpu.workgroup.size = 64 [sLG] screen.refresh.interval = 100 [sLG] film.gamma = 2.2 [sLG] path.maxdepth = 16 [sLG] path.russianroulette.depth = 5 [sLG] light.maxdepth = 16 [sLG] light.russianroulette.depth = 5 [LuxRays::SDL] Reading scene: scenes/luxball/luxball.scn [LuxRays::SDL] Camera postion: Point[0.5, -1.4, 1.7] [LuxRays::SDL] Camera target: Point[0, 0, 0.4] [LuxRays::SDL] Material definition: shell [LuxRays::SDL] Material definition: whitematte [LuxRays::SDL] Material definition: luxtext [LuxRays::SDL] Material definition: blacktext [LuxRays::SDL] Material definition: whitelight [LuxRays::SDL] The l1 object is a light sources with 2 triangles [LuxRays::SDL] The l2 object is a light sources with 2 triangles [LuxRays::SDL] PLY object count: 7 [LuxRays] OpenCL Platform 0: Apple [LuxRays] Device 0 name: NativeThread-000 [LuxRays] Device 0 type: NATIVE_THREAD [LuxRays] Device 0 compute units: 1 [LuxRays] Device 0 max allocable memory: 0MBytes [LuxRays] Device 0 max allocable memory block size: 0MBytes [LuxRays] Device 1 name: NativeThread-001 [LuxRays] Device 1 type: NATIVE_THREAD [LuxRays] Device 1 compute units: 1 [LuxRays] Device 1 max allocable memory: 0MBytes [LuxRays] Device 1 max allocable memory block size: 0MBytes [LuxRays] Device 2 name: Intel(R) Core(TM)2 Duo CPU E8400 @ 3.00GHz [LuxRays] Device 2 type: OPENCL_CPU [LuxRays] Device 2 compute units: 2 [LuxRays] Device 2 max allocable memory: 4096MBytes [LuxRays] Device 2 max allocable memory block size: 1024MBytes [LuxRays] Device 3 name: GeForce GTX 560 Ti [LuxRays] Device 3 type: OPENCL_GPU [LuxRays] Device 3 compute units: 16 [LuxRays] Device 3 max allocable memory: 1024MBytes [LuxRays] Device 3 max allocable memory block size: 256MBytes [LuxRays] Preprocessing DataSet [LuxRays] Total vertex count: 223380 [LuxRays] Total triangle count: 260794 [LuxRays] Total vertices memory usage: 2617Kbytes [LuxRays] Total triangles memory usage: 3056Kbytes [LuxRays] Building QBVH, primitives: 260794, initial nodes: 21734 [LuxRays] QBVH completed with 43467/43468 nodes [LuxRays] Total QBVH memory usage: 4754Kbytes [LuxRays] Total QBVH QuadTriangle count: 87103 [LuxRays] Max. QBVH Depth: 25 [LuxRays] Creating 1 intersection device(s) [LuxRays] Allocating intersection device 0: GeForce GTX 560 Ti (Type = OPENCL_GPU) [sLG] OpenCL Devices used: [sLG] [GeForce GTX 560 TiIntersect] [LuxRays] [OpenCL device::GeForce GTX 560 TiIntersect] Ray buffer size: 3072Kbytes [LuxRays] [OpenCL device::GeForce GTX 560 TiIntersect] Ray hits buffer size: 1024Kbytes [LuxRays] [OpenCL device::GeForce GTX 560 TiIntersect] OpenCL image support is available [LuxRays] [OpenCL device::GeForce GTX 560 TiIntersect] OpenCL max. image buffer size: 8192x8192 [LuxRays] [OpenCL device::GeForce GTX 560 TiIntersect] QBVH node image buffer size: 553x551 [LuxRays] [OpenCL device::GeForce GTX 560 TiIntersect] QBVH triangle image buffer size: 940x927 [LuxRays] [OpenCL device::GeForce GTX 560 TiIntersect] Enabled QBVH scene storage inside image [LuxRays] [OpenCL device::GeForce GTX 560 TiIntersect] QBVH max. stack size: 24 [LuxRays] [OpenCL device::GeForce GTX 560 TiIntersect] QBVH Image Storage kernel work group size: 512 [LuxRays] [OpenCL device::GeForce GTX 560 TiIntersect] Suggested work group size: 512 [LuxRays] [OpenCL device::GeForce GTX 560 TiIntersect] Forced work group size: 64 [sLG] Configuring 1 CPU render threads [sLG] [PathOCLRenderThread] OpenCL task count: 65536 [sLG] [PathOCLRenderThread] OpenCL max. page memory size: 262144Kbytes [sLG] [PathOCLRenderThread::CompiledScene] Compile Camera [sLG] [PathOCLRenderThread::CompiledScene] Compile Geometry [sLG] [PathOCLRenderThread::CompiledScene] Scene geometry compilation time: 14ms [sLG] [PathOCLRenderThread::CompiledScene] Compile Materials [sLG] [PathOCLRenderThread::CompiledScene] Compile Textures [sLG] [PathOCLRenderThread::CompiledScene] Textures compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Material compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile Triangle AreaLights [sLG] [PathOCLRenderThread::CompiledScene] Triangle area lights compilation time: 1ms [sLG] [PathOCLRenderThread::CompiledScene] Compile InfiniteLight [sLG] [PathOCLRenderThread::CompiledScene] Infinitelight compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile SunLight [sLG] [PathOCLRenderThread::CompiledScene] Compile SkyLight [sLG] [PathOCLRenderThread::CompiledScene] Compile ImageMaps [sLG] [PathOCLRenderThread::CompiledScene] Image maps page count: 0 [sLG] [PathOCLRenderThread::CompiledScene] Texture maps compilation time: 0ms [sLG] Starting 1 PathOCL render threads [sLG] [PathOCLRenderThread::0] FrameBuffer buffer size: 4835Kbytes [sLG] [PathOCLRenderThread::0] Camera buffer size: 272bytes [sLG] [PathOCLRenderThread::0] MeshIDs buffer size: 1018Kbytes [sLG] [PathOCLRenderThread::0] Normals buffer size: 2617Kbytes [sLG] [PathOCLRenderThread::0] UVs buffer size: 1745Kbytes [sLG] [PathOCLRenderThread::0] Vertices buffer size: 2617Kbytes [sLG] [PathOCLRenderThread::0] Triangles buffer size: 3056Kbytes [sLG] [PathOCLRenderThread::0] Textures buffer size: 324bytes [sLG] [PathOCLRenderThread::0] Materials buffer size: 160bytes [sLG] [PathOCLRenderThread::0] Mesh material index buffer size: 28bytes [sLG] [PathOCLRenderThread::0] Triangle AreaLights buffer size: 272bytes [sLG] [PathOCLRenderThread::0] Triangle AreaLights index buffer size: 1018Kbytes [sLG] [PathOCLRenderThread::0] Ray buffer size: 3072Kbytes [sLG] [PathOCLRenderThread::0] RayHit buffer size: 1024Kbytes [sLG] [PathOCLRenderThread::0] GPUTask buffer size: 23296Kbytes [sLG] [PathOCLRenderThread::0] Size of a SampleData: 8bytes [sLG] [PathOCLRenderThread::0] SampleData buffer size: 512Kbytes [sLG] [PathOCLRenderThread::0] GPUTask Stats buffer size: 256Kbytes [sLG] [PathOCLRenderThread::0] OpenCL buffer creation time: 0ms [sLG] [PathOCLRenderThread::0] Defined symbols: -D LUXRAYS_OPENCL_KERNEL -D SLG_OPENCL_KERNEL -D PARAM_TASK_COUNT=65536 -D PARAM_IMAGE_WIDTH=640 -D PARAM_IMAGE_HEIGHT=480 -D PARAM_RAY_EPSILON_MIN=1.000000e-09f -D PARAM_RAY_EPSILON_MAX=1.000000e-01f -D PARAM_MAX_PATH_DEPTH=16 -D PARAM_RR_DEPTH=5 -D PARAM_RR_CAP=5.000000e-01f -D PARAM_ACCEL_QBVH -D PARAM_ENABLE_TEX_CONST_FLOAT -D PARAM_ENABLE_TEX_CONST_FLOAT3 -D PARAM_ENABLE_MAT_MATTE -D PARAM_ENABLE_MAT_GLASS -D PARAM_HAS_PASSTHROUGH -D PARAM_DIRECT_LIGHT_SAMPLING -D PARAM_DL_LIGHT_COUNT=4 -D PARAM_IMAGE_FILTER_TYPE=0 -D PARAM_SAMPLER_TYPE=0 [sLG] [PathOCLRenderThread::0] Compiling kernels [sLG] [PathOCLRenderThread::0] Kernels not cached [sLG] [PathOCLRenderThread::0] Compiling Init Kernel [sLG] [PathOCLRenderThread::0] Compiling AdvancePaths Kernel [sLG] [PathOCLRenderThread::0] Kernels compilation time: 612ms [sLG] [PathOCLRenderThread::0] Rendering thread ERROR: clFlush(CL_INVALID_COMMAND_QUEUE) [sLG] ========================================================= [sLG] Unhandled exception [sLG] Obtained 5 stack frames. [sLG] 0 [sLG] 1 [sLG] 2 [sLG] 3 [sLG] 4 Abort trap: 6 localhost:SLG4_alpha victoriano$ Link to comment https://www.insanelymac.com/forum/topic/285685-smallluxgpu-4-alpha-nvidia-amd-tests-ended-thanks/#findComment-1881208 Share on other sites More sharing options...
mitch_de Posted January 16, 2013 Author Share Posted January 16, 2013 Thanks, at least fermi OpenCL has , like AMD OpenCL probs with new OpenCL code (SLG4 uses newer OpenCL 1.1+1.2 features if available for the gpu type). 1. If you test SLG4 with fermi/kelper gpu please also crosstest if LUXMARK (SLG3 usage) works http://www.luxrender..._OSXIntel64.zip 2. Nvidia gpus like GTX 260, GTX 285 (less features for OpenCL) would be interesting, also 8800GT,9800GT 3. please also reprot OpenCL features (Opencl version usage) with attached OpenCLinfo. OpenCL_info.zip Looks like: GA_EP35:~ andreasm$ /Volumes/Programme_1/OPENCL/oclinfo ; exit; 1 OpenCL platform found! [Platform 0] Name: Apple Vendor: Apple Version: OpenCL 1.2 (Dec 4 2012 18:26:30) Profile: FULL_PROFILE Extensions: cl_APPLE_SetMemObjectDestructor cl_APPLE_ContextLoggingFunctions cl_APPLE_clut cl_APPLE_query_kernel_names cl_APPLE_gl_sharing cl_khr_gl_event 2 OpenCL devices found! [Device 0] Name: Intel(R) Core(TM)2 Duo CPU E7300 @ 2.66GHz Vendor: Intel Type: CPU Device Version: OpenCL 1.2 ... [Device 1] Name: GeForce 9600 GT Vendor: NVIDIA Type: GPU Device Version: OpenCL 1.0 Driver Version: CLH 1.0 Compute Units: 8 Work Group Size: 512 Clock: 1680 MHz Global Memory: 512 MB Local Memory: 16 KB Cache Size: 0 KB Cache Line Size: 0 Bytes Available: Yes Double-Precision: No Extensions: cl_APPLE_SetMemObjectDestructor cl_APPLE_ContextLoggingFunctions cl_APPLE_clut cl_APPLE_query_kernel_names cl_APPLE_gl_sharing cl_khr_gl_event cl_khr_byte_addressable_store cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics logout Link to comment https://www.insanelymac.com/forum/topic/285685-smallluxgpu-4-alpha-nvidia-amd-tests-ended-thanks/#findComment-1881260 Share on other sites More sharing options...
maximus Posted January 16, 2013 Share Posted January 16, 2013 It crashes with this: [sLG] [PathOCLRenderThread::0] Rendering thread ERROR: clFlush(CL_INVALID_COMMAND_QUEUE) [sLG] Unhandled exception [sLG] Obtained 5 stack frames. [sLG] 0 [sLG] 1 [sLG] 2 [sLG] 3 [sLG] 4 Abort trap: 6 Luxmark works ok. OpenCLinfo: 1 OpenCL platform found! [Platform 0] Name: Apple Vendor: Apple Version: OpenCL 1.2 (Aug 24 2012 00:53:09) Profile: FULL_PROFILE Extensions: cl_APPLE_SetMemObjectDestructor cl_APPLE_ContextLoggingFunctions cl_APPLE_clut cl_APPLE_query_kernel_names cl_APPLE_gl_sharing cl_khr_gl_event 2 OpenCL devices found! [Device 0] Name: Intel(R) Core(TM) i7-3820 CPU @ 3.60GHz Vendor: Intel Type: CPU Device Version: OpenCL 1.2 Driver Version: 1.1 Compute Units: 8 Work Group Size: 1024 Clock: 4600 MHz Global Memory (Total): 8192 MB Global Memory (Host): 8192 MB Global Memory (PCIe): 0 MB Local Memory: 32 KB Cache Size: 0.0625 KB Cache Line Size: 10485760 Bytes Available: Yes Double-Precision: Yes Extensions: cl_APPLE_SetMemObjectDestructor cl_APPLE_ContextLoggingFunctions cl_APPLE_clut cl_APPLE_query_kernel_names cl_APPLE_gl_sharing cl_khr_gl_event cl_khr_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_APPLE_fp64_basic_ops cl_APPLE_fixed_alpha_channel_orders [Device 1] Name: GeForce GTX 580 Vendor: NVIDIA Type: GPU Device Version: OpenCL 1.1 Driver Version: CLH 1.0 Compute Units: 64 Work Group Size: 1024 Clock: 1880 MHz Global Memory: 1536 MB Local Memory: 48 KB Cache Size: 0 KB Cache Line Size: 0 Bytes Available: Yes Double-Precision: No Extensions: cl_APPLE_SetMemObjectDestructor cl_APPLE_ContextLoggingFunctions cl_APPLE_clut cl_APPLE_query_kernel_names cl_APPLE_gl_sharing cl_khr_gl_event cl_khr_byte_addressable_store cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_APPLE_fp64_basic_ops Link to comment https://www.insanelymac.com/forum/topic/285685-smallluxgpu-4-alpha-nvidia-amd-tests-ended-thanks/#findComment-1881268 Share on other sites More sharing options...
stefano.85 Posted January 16, 2013 Share Posted January 16, 2013 Hi mitch... so... i've attached oclinfo debug and a screenshot of luxmark test (i think it works fine) tnx and cheers! p.s. since at the moment i think there's a problem with attachments i've uploaded them to my dropbox, here they are... https://dl.dropbox.com/u/8697187/oclinfo.rtf https://dl.dropbox.com/u/8697187/Schermata%202013-01-16%20alle%2019.43.27.png Link to comment https://www.insanelymac.com/forum/topic/285685-smallluxgpu-4-alpha-nvidia-amd-tests-ended-thanks/#findComment-1881344 Share on other sites More sharing options...
p.H Posted January 17, 2013 Share Posted January 17, 2013 10.8.2 (12C3006) ATI-Radeon 5650M Just as other ATI users report, too bright. Below is the log: [sLG] Reading configuration file: scenes/luxball/luxball.cfg [sLG] Configuration: [sLG] image.width = 640 [sLG] image.height = 480 [sLG] batch.halttime = 0 [sLG] scene.file = scenes/luxball/luxball.scn [sLG] scene.fieldofview = 45 [sLG] opencl.cpu.use = 0 [sLG] opencl.gpu.use = 1 [sLG] opencl.gpu.workgroup.size = 64 [sLG] screen.refresh.interval = 100 [sLG] film.gamma = 2.2 [sLG] path.maxdepth = 16 [sLG] path.russianroulette.depth = 5 [sLG] light.maxdepth = 16 [sLG] light.russianroulette.depth = 5 [LuxRays::SDL] Reading scene: scenes/luxball/luxball.scn [LuxRays::SDL] Camera postion: Point[0.5, -1.4, 1.7] [LuxRays::SDL] Camera target: Point[0, 0, 0.4] [LuxRays::SDL] Material definition: shell [LuxRays::SDL] Material definition: whitematte [LuxRays::SDL] Material definition: luxtext [LuxRays::SDL] Material definition: blacktext [LuxRays::SDL] Material definition: whitelight [LuxRays::SDL] The l1 object is a light sources with 2 triangles [LuxRays::SDL] The l2 object is a light sources with 2 triangles [LuxRays::SDL] PLY object count: 7 [LuxRays] OpenCL Platform 0: Apple [LuxRays] Device 0 name: NativeThread-000 [LuxRays] Device 0 type: NATIVE_THREAD [LuxRays] Device 0 compute units: 1 [LuxRays] Device 0 max allocable memory: 0MBytes [LuxRays] Device 0 max allocable memory block size: 0MBytes [LuxRays] Device 1 name: NativeThread-001 [LuxRays] Device 1 type: NATIVE_THREAD [LuxRays] Device 1 compute units: 1 [LuxRays] Device 1 max allocable memory: 0MBytes [LuxRays] Device 1 max allocable memory block size: 0MBytes [LuxRays] Device 2 name: NativeThread-002 [LuxRays] Device 2 type: NATIVE_THREAD [LuxRays] Device 2 compute units: 1 [LuxRays] Device 2 max allocable memory: 0MBytes [LuxRays] Device 2 max allocable memory block size: 0MBytes [LuxRays] Device 3 name: NativeThread-003 [LuxRays] Device 3 type: NATIVE_THREAD [LuxRays] Device 3 compute units: 1 [LuxRays] Device 3 max allocable memory: 0MBytes [LuxRays] Device 3 max allocable memory block size: 0MBytes [LuxRays] Device 4 name: Intel® Core i3 CPU M 380 @ 2.53GHz [LuxRays] Device 4 type: OPENCL_CPU [LuxRays] Device 4 compute units: 4 [LuxRays] Device 4 max allocable memory: 8192MBytes [LuxRays] Device 4 max allocable memory block size: 2048MBytes [LuxRays] Device 5 name: ATI Radeon HD 5650 [LuxRays] Device 5 type: OPENCL_GPU [LuxRays] Device 5 compute units: 5 [LuxRays] Device 5 max allocable memory: 1024MBytes [LuxRays] Device 5 max allocable memory block size: 256MBytes [LuxRays] Preprocessing DataSet [LuxRays] Total vertex count: 223380 [LuxRays] Total triangle count: 260794 [LuxRays] Total vertices memory usage: 2617Kbytes [LuxRays] Total triangles memory usage: 3056Kbytes [LuxRays] Building QBVH, primitives: 260794, initial nodes: 21734 [LuxRays] QBVH completed with 43467/43468 nodes [LuxRays] Total QBVH memory usage: 4754Kbytes [LuxRays] Total QBVH QuadTriangle count: 87103 [LuxRays] Max. QBVH Depth: 25 [LuxRays] Creating 1 intersection device(s) [LuxRays] Allocating intersection device 0: ATI Radeon HD 5650 (Type = OPENCL_GPU) [sLG] OpenCL Devices used: [sLG] [ATI Radeon HD 5650Intersect] [LuxRays] [OpenCL device::ATI Radeon HD 5650Intersect] Ray buffer size: 3072Kbytes [LuxRays] [OpenCL device::ATI Radeon HD 5650Intersect] Ray hits buffer size: 1024Kbytes [LuxRays] [OpenCL device::ATI Radeon HD 5650Intersect] OpenCL image support is available [LuxRays] [OpenCL device::ATI Radeon HD 5650Intersect] OpenCL max. image buffer size: 8192x8192 [LuxRays] [OpenCL device::ATI Radeon HD 5650Intersect] QBVH node image buffer size: 553x551 [LuxRays] [OpenCL device::ATI Radeon HD 5650Intersect] QBVH triangle image buffer size: 940x927 [LuxRays] [OpenCL device::ATI Radeon HD 5650Intersect] Enabled QBVH scene storage inside image [LuxRays] [OpenCL device::ATI Radeon HD 5650Intersect] QBVH max. stack size: 24 [LuxRays] [OpenCL device::ATI Radeon HD 5650Intersect] QBVH Image Storage kernel work group size: 256 [LuxRays] [OpenCL device::ATI Radeon HD 5650Intersect] Suggested work group size: 256 [LuxRays] [OpenCL device::ATI Radeon HD 5650Intersect] Forced work group size: 64 [sLG] Configuring 1 CPU render threads [sLG] [PathOCLRenderThread] OpenCL task count: 65536 [sLG] [PathOCLRenderThread] OpenCL max. page memory size: 262144Kbytes [sLG] [PathOCLRenderThread::CompiledScene] Compile Camera [sLG] [PathOCLRenderThread::CompiledScene] Compile Geometry [sLG] [PathOCLRenderThread::CompiledScene] Scene geometry compilation time: 13ms [sLG] [PathOCLRenderThread::CompiledScene] Compile Materials [sLG] [PathOCLRenderThread::CompiledScene] Compile Textures [sLG] [PathOCLRenderThread::CompiledScene] Textures compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Material compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile Triangle AreaLights [sLG] [PathOCLRenderThread::CompiledScene] Triangle area lights compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile InfiniteLight [sLG] [PathOCLRenderThread::CompiledScene] Infinitelight compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile SunLight [sLG] [PathOCLRenderThread::CompiledScene] Compile SkyLight [sLG] [PathOCLRenderThread::CompiledScene] Compile ImageMaps [sLG] [PathOCLRenderThread::CompiledScene] Image maps page count: 0 [sLG] [PathOCLRenderThread::CompiledScene] Texture maps compilation time: 0ms [sLG] Starting 1 PathOCL render threads [sLG] [PathOCLRenderThread::0] FrameBuffer buffer size: 4835Kbytes [sLG] [PathOCLRenderThread::0] Camera buffer size: 272bytes [sLG] [PathOCLRenderThread::0] MeshIDs buffer size: 1018Kbytes [sLG] [PathOCLRenderThread::0] Normals buffer size: 2617Kbytes [sLG] [PathOCLRenderThread::0] UVs buffer size: 1745Kbytes [sLG] [PathOCLRenderThread::0] Vertices buffer size: 2617Kbytes [sLG] [PathOCLRenderThread::0] Triangles buffer size: 3056Kbytes [sLG] [PathOCLRenderThread::0] Textures buffer size: 324bytes [sLG] [PathOCLRenderThread::0] Materials buffer size: 160bytes [sLG] [PathOCLRenderThread::0] Mesh material index buffer size: 28bytes [sLG] [PathOCLRenderThread::0] Triangle AreaLights buffer size: 272bytes [sLG] [PathOCLRenderThread::0] Triangle AreaLights index buffer size: 1018Kbytes [sLG] [PathOCLRenderThread::0] Ray buffer size: 3072Kbytes [sLG] [PathOCLRenderThread::0] RayHit buffer size: 1024Kbytes [sLG] [PathOCLRenderThread::0] GPUTask buffer size: 23296Kbytes [sLG] [PathOCLRenderThread::0] Size of a SampleData: 8bytes [sLG] [PathOCLRenderThread::0] SampleData buffer size: 512Kbytes [sLG] [PathOCLRenderThread::0] GPUTask Stats buffer size: 256Kbytes [sLG] [PathOCLRenderThread::0] OpenCL buffer creation time: 0ms [sLG] [PathOCLRenderThread::0] Defined symbols: -D LUXRAYS_OPENCL_KERNEL -D SLG_OPENCL_KERNEL -D PARAM_TASK_COUNT=65536 -D PARAM_IMAGE_WIDTH=640 -D PARAM_IMAGE_HEIGHT=480 -D PARAM_RAY_EPSILON_MIN=1.000000e-09f -D PARAM_RAY_EPSILON_MAX=1.000000e-01f -D PARAM_MAX_PATH_DEPTH=16 -D PARAM_RR_DEPTH=5 -D PARAM_RR_CAP=5.000000e-01f -D PARAM_ACCEL_QBVH -D PARAM_ENABLE_TEX_CONST_FLOAT -D PARAM_ENABLE_TEX_CONST_FLOAT3 -D PARAM_ENABLE_MAT_MATTE -D PARAM_ENABLE_MAT_GLASS -D PARAM_HAS_PASSTHROUGH -D PARAM_DIRECT_LIGHT_SAMPLING -D PARAM_DL_LIGHT_COUNT=4 -D PARAM_IMAGE_FILTER_TYPE=0 -D PARAM_SAMPLER_TYPE=0 [sLG] [PathOCLRenderThread::0] Compiling kernels [sLG] [PathOCLRenderThread::0] Kernels not cached [sLG] [PathOCLRenderThread::0] Compiling Init Kernel [sLG] [PathOCLRenderThread::0] Compiling AdvancePaths Kernel [sLG] [PathOCLRenderThread::0] Kernels compilation time: 988ms [sLG] Done. Link to comment https://www.insanelymac.com/forum/topic/285685-smallluxgpu-4-alpha-nvidia-amd-tests-ended-thanks/#findComment-1881430 Share on other sites More sharing options...
Regi Yassin Posted January 17, 2013 Share Posted January 17, 2013 it crashed Dwis-Mac-Pro:SLG4_alpha regae$ ./slg4 [sLG] Reading configuration file: scenes/luxball/luxball.cfg [sLG] Configuration: [sLG] image.width = 640 [sLG] image.height = 480 [sLG] batch.halttime = 0 [sLG] scene.file = scenes/luxball/luxball.scn [sLG] scene.fieldofview = 45 [sLG] opencl.cpu.use = 0 [sLG] opencl.gpu.use = 1 [sLG] opencl.gpu.workgroup.size = 64 [sLG] screen.refresh.interval = 100 [sLG] film.gamma = 2.2 [sLG] path.maxdepth = 16 [sLG] path.russianroulette.depth = 5 [sLG] light.maxdepth = 16 [sLG] light.russianroulette.depth = 5 [LuxRays::SDL] Reading scene: scenes/luxball/luxball.scn [LuxRays::SDL] Camera postion: Point[0.5, -1.4, 1.7] [LuxRays::SDL] Camera target: Point[0, 0, 0.4] [LuxRays::SDL] Material definition: shell [LuxRays::SDL] Material definition: whitematte [LuxRays::SDL] Material definition: luxtext [LuxRays::SDL] Material definition: blacktext [LuxRays::SDL] Material definition: whitelight [LuxRays::SDL] The l1 object is a light sources with 2 triangles [LuxRays::SDL] The l2 object is a light sources with 2 triangles [LuxRays::SDL] PLY object count: 7 [LuxRays] OpenCL Platform 0: Apple [LuxRays] Device 0 name: NativeThread-000 [LuxRays] Device 0 type: NATIVE_THREAD [LuxRays] Device 0 compute units: 1 [LuxRays] Device 0 max allocable memory: 0MBytes [LuxRays] Device 0 max allocable memory block size: 0MBytes [LuxRays] Device 1 name: NativeThread-001 [LuxRays] Device 1 type: NATIVE_THREAD [LuxRays] Device 1 compute units: 1 [LuxRays] Device 1 max allocable memory: 0MBytes [LuxRays] Device 1 max allocable memory block size: 0MBytes [LuxRays] Device 2 name: NativeThread-002 [LuxRays] Device 2 type: NATIVE_THREAD [LuxRays] Device 2 compute units: 1 [LuxRays] Device 2 max allocable memory: 0MBytes [LuxRays] Device 2 max allocable memory block size: 0MBytes [LuxRays] Device 3 name: NativeThread-003 [LuxRays] Device 3 type: NATIVE_THREAD [LuxRays] Device 3 compute units: 1 [LuxRays] Device 3 max allocable memory: 0MBytes [LuxRays] Device 3 max allocable memory block size: 0MBytes [LuxRays] Device 4 name: NativeThread-004 [LuxRays] Device 4 type: NATIVE_THREAD [LuxRays] Device 4 compute units: 1 [LuxRays] Device 4 max allocable memory: 0MBytes [LuxRays] Device 4 max allocable memory block size: 0MBytes [LuxRays] Device 5 name: NativeThread-005 [LuxRays] Device 5 type: NATIVE_THREAD [LuxRays] Device 5 compute units: 1 [LuxRays] Device 5 max allocable memory: 0MBytes [LuxRays] Device 5 max allocable memory block size: 0MBytes [LuxRays] Device 6 name: NativeThread-006 [LuxRays] Device 6 type: NATIVE_THREAD [LuxRays] Device 6 compute units: 1 [LuxRays] Device 6 max allocable memory: 0MBytes [LuxRays] Device 6 max allocable memory block size: 0MBytes [LuxRays] Device 7 name: NativeThread-007 [LuxRays] Device 7 type: NATIVE_THREAD [LuxRays] Device 7 compute units: 1 [LuxRays] Device 7 max allocable memory: 0MBytes [LuxRays] Device 7 max allocable memory block size: 0MBytes [LuxRays] Device 8 name: Intel(R) Core(TM) i7-2600K CPU @ 3.40GHz [LuxRays] Device 8 type: OPENCL_CPU [LuxRays] Device 8 compute units: 8 [LuxRays] Device 8 max allocable memory: 8192MBytes [LuxRays] Device 8 max allocable memory block size: 2048MBytes [LuxRays] Device 9 name: GeForce GTX 650 Ti [LuxRays] Device 9 type: OPENCL_GPU [LuxRays] Device 9 compute units: 12 [LuxRays] Device 9 max allocable memory: 2048MBytes [LuxRays] Device 9 max allocable memory block size: 512MBytes [LuxRays] Preprocessing DataSet [LuxRays] Total vertex count: 223380 [LuxRays] Total triangle count: 260794 [LuxRays] Total vertices memory usage: 2617Kbytes [LuxRays] Total triangles memory usage: 3056Kbytes [LuxRays] Building QBVH, primitives: 260794, initial nodes: 21734 [LuxRays] QBVH completed with 43467/43468 nodes [LuxRays] Total QBVH memory usage: 4754Kbytes [LuxRays] Total QBVH QuadTriangle count: 87103 [LuxRays] Max. QBVH Depth: 25 [LuxRays] Creating 1 intersection device(s) [LuxRays] Allocating intersection device 0: GeForce GTX 650 Ti (Type = OPENCL_GPU) [sLG] OpenCL Devices used: [sLG] [GeForce GTX 650 TiIntersect] [LuxRays] [OpenCL device::GeForce GTX 650 TiIntersect] Ray buffer size: 3072Kbytes [LuxRays] [OpenCL device::GeForce GTX 650 TiIntersect] Ray hits buffer size: 1024Kbytes [LuxRays] [OpenCL device::GeForce GTX 650 TiIntersect] OpenCL image support is available [LuxRays] [OpenCL device::GeForce GTX 650 TiIntersect] OpenCL max. image buffer size: 8192x8192 [LuxRays] [OpenCL device::GeForce GTX 650 TiIntersect] QBVH node image buffer size: 553x551 [LuxRays] [OpenCL device::GeForce GTX 650 TiIntersect] QBVH triangle image buffer size: 940x927 [LuxRays] [OpenCL device::GeForce GTX 650 TiIntersect] Enabled QBVH scene storage inside image [LuxRays] [OpenCL device::GeForce GTX 650 TiIntersect] QBVH max. stack size: 24 [LuxRays] [OpenCL device::GeForce GTX 650 TiIntersect] QBVH Image Storage kernel work group size: 1024 [LuxRays] [OpenCL device::GeForce GTX 650 TiIntersect] Suggested work group size: 1024 [LuxRays] [OpenCL device::GeForce GTX 650 TiIntersect] Forced work group size: 64 [sLG] Configuring 1 CPU render threads [sLG] [PathOCLRenderThread] OpenCL task count: 65536 [sLG] [PathOCLRenderThread] OpenCL max. page memory size: 524288Kbytes [sLG] [PathOCLRenderThread::CompiledScene] Compile Camera [sLG] [PathOCLRenderThread::CompiledScene] Compile Geometry [sLG] [PathOCLRenderThread::CompiledScene] Scene geometry compilation time: 8ms [sLG] [PathOCLRenderThread::CompiledScene] Compile Materials [sLG] [PathOCLRenderThread::CompiledScene] Compile Textures [sLG] [PathOCLRenderThread::CompiledScene] Textures compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Material compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile Triangle AreaLights [sLG] [PathOCLRenderThread::CompiledScene] Triangle area lights compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile InfiniteLight [sLG] [PathOCLRenderThread::CompiledScene] Infinitelight compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile SunLight [sLG] [PathOCLRenderThread::CompiledScene] Compile SkyLight [sLG] [PathOCLRenderThread::CompiledScene] Compile ImageMaps [sLG] [PathOCLRenderThread::CompiledScene] Image maps page count: 0 [sLG] [PathOCLRenderThread::CompiledScene] Texture maps compilation time: 0ms [sLG] Starting 1 PathOCL render threads [sLG] [PathOCLRenderThread::0] FrameBuffer buffer size: 4835Kbytes [sLG] [PathOCLRenderThread::0] Camera buffer size: 272bytes [sLG] [PathOCLRenderThread::0] MeshIDs buffer size: 1018Kbytes [sLG] [PathOCLRenderThread::0] Normals buffer size: 2617Kbytes [sLG] [PathOCLRenderThread::0] UVs buffer size: 1745Kbytes [sLG] [PathOCLRenderThread::0] Vertices buffer size: 2617Kbytes [sLG] [PathOCLRenderThread::0] Triangles buffer size: 3056Kbytes [sLG] [PathOCLRenderThread::0] Textures buffer size: 324bytes [sLG] [PathOCLRenderThread::0] Materials buffer size: 160bytes [sLG] [PathOCLRenderThread::0] Mesh material index buffer size: 28bytes [sLG] [PathOCLRenderThread::0] Triangle AreaLights buffer size: 272bytes [sLG] [PathOCLRenderThread::0] Triangle AreaLights index buffer size: 1018Kbytes [sLG] [PathOCLRenderThread::0] Ray buffer size: 3072Kbytes [sLG] [PathOCLRenderThread::0] RayHit buffer size: 1024Kbytes [sLG] [PathOCLRenderThread::0] GPUTask buffer size: 23296Kbytes [sLG] [PathOCLRenderThread::0] Size of a SampleData: 8bytes [sLG] [PathOCLRenderThread::0] SampleData buffer size: 512Kbytes [sLG] [PathOCLRenderThread::0] GPUTask Stats buffer size: 256Kbytes [sLG] [PathOCLRenderThread::0] OpenCL buffer creation time: 0ms [sLG] [PathOCLRenderThread::0] Defined symbols: -D LUXRAYS_OPENCL_KERNEL -D SLG_OPENCL_KERNEL -D PARAM_TASK_COUNT=65536 -D PARAM_IMAGE_WIDTH=640 -D PARAM_IMAGE_HEIGHT=480 -D PARAM_RAY_EPSILON_MIN=1.000000e-09f -D PARAM_RAY_EPSILON_MAX=1.000000e-01f -D PARAM_MAX_PATH_DEPTH=16 -D PARAM_RR_DEPTH=5 -D PARAM_RR_CAP=5.000000e-01f -D PARAM_ACCEL_QBVH -D PARAM_ENABLE_TEX_CONST_FLOAT -D PARAM_ENABLE_TEX_CONST_FLOAT3 -D PARAM_ENABLE_MAT_MATTE -D PARAM_ENABLE_MAT_GLASS -D PARAM_HAS_PASSTHROUGH -D PARAM_DIRECT_LIGHT_SAMPLING -D PARAM_DL_LIGHT_COUNT=4 -D PARAM_IMAGE_FILTER_TYPE=0 -D PARAM_SAMPLER_TYPE=0 [sLG] [PathOCLRenderThread::0] Compiling kernels [sLG] [PathOCLRenderThread::0] Kernels not cached [sLG] [PathOCLRenderThread::0] Compiling Init Kernel [sLG] [PathOCLRenderThread::0] Compiling AdvancePaths Kernel [sLG] [PathOCLRenderThread::0] Kernels compilation time: 294ms [sLG] [PathOCLRenderThread::0] Rendering thread ERROR: clFlush(CL_INVALID_COMMAND_QUEUE) [sLG] ========================================================= [sLG] Unhandled exception [sLG] Obtained 5 stack frames. [sLG] 0 [sLG] 1 [sLG] 2 [sLG] 3 [sLG] 4 Abort trap: 6 here is the openclinfo 1 OpenCL platform found! [Platform 0] Name: Apple Vendor: Apple Version: OpenCL 1.2 (Aug 24 2012 00:53:09) Profile: FULL_PROFILE Extensions: cl_APPLE_SetMemObjectDestructor cl_APPLE_ContextLoggingFunctions cl_APPLE_clut cl_APPLE_query_kernel_names cl_APPLE_gl_sharing cl_khr_gl_event 2 OpenCL devices found! [Device 0] Name: Intel(R) Core(TM) i7-2600K CPU @ 3.40GHz Vendor: Intel Type: CPU Device Version: OpenCL 1.2 Driver Version: 1.1 Compute Units: 8 Work Group Size: 1024 Clock: 3400 MHz Global Memory (Total): 8192 MB Global Memory (Host): 8192 MB Global Memory (PCIe): 0 MB Local Memory: 32 KB Cache Size: 0.0625 KB Cache Line Size: 8388608 Bytes Available: Yes Double-Precision: Yes Extensions: cl_APPLE_SetMemObjectDestructor cl_APPLE_ContextLoggingFunctions cl_APPLE_clut cl_APPLE_query_kernel_names cl_APPLE_gl_sharing cl_khr_gl_event cl_khr_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_APPLE_fp64_basic_ops cl_APPLE_fixed_alpha_channel_orders [Device 1] Name: GeForce GTX 650 Ti Vendor: NVIDIA Type: GPU Device Version: OpenCL 1.1 Driver Version: CLH 1.0 Compute Units: 12 Work Group Size: 1024 Clock: 0 MHz Global Memory: 2048 MB Local Memory: 48 KB Cache Size: 0 KB Cache Line Size: 0 Bytes Available: Yes Double-Precision: No Extensions: cl_APPLE_SetMemObjectDestructor cl_APPLE_ContextLoggingFunctions cl_APPLE_clut cl_APPLE_query_kernel_names cl_APPLE_gl_sharing cl_khr_gl_event cl_khr_byte_addressable_store cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_APPLE_fp64_basic_ops logout Link to comment https://www.insanelymac.com/forum/topic/285685-smallluxgpu-4-alpha-nvidia-amd-tests-ended-thanks/#findComment-1881491 Share on other sites More sharing options...
mitch_de Posted February 10, 2013 Author Share Posted February 10, 2013 Jens made an new special test build with test special test scene (green ball, green ground). http://www.jensverwi..._slg4_proof.zip unzip zip somewhere start terminal cd folderpath of main folder/OSX_AMD_slg4_proof test the 2 builds OK and NOT_OK: ./test_mac_fix_ok/slg4 -D opencl.devices.select 01 -D sampler.type SOBOL -D opencl.task.count 262144 -w 512 -e 512 scenes/luxball/luxball-hdr.cfg ./test_mac_fix_not_ok/slg4 -D opencl.devices.select 01 -D sampler.type SOBOL -D opencl.task.count 262144 -w 512 -e 512 scenes/luxball/luxball-hdr.cfg (you can drag & crop those command lines obove into the already opened terminal window to start the tests - dont forget to set working folder by cd before running) Normally - if scene renders correct - it looks like: Link to comment https://www.insanelymac.com/forum/topic/285685-smallluxgpu-4-alpha-nvidia-amd-tests-ended-thanks/#findComment-1887610 Share on other sites More sharing options...
maximus Posted February 10, 2013 Share Posted February 10, 2013 OK renders correctly and produces: [sLG] Reading configuration file: scenes/luxball/luxball-hdr.cfg [sLG] Configuration: [sLG] image.width = 512 [sLG] image.height = 512 [sLG] batch.halttime = 0 [sLG] scene.file = scenes/luxball/luxball-hdr.scn [sLG] opencl.cpu.use = 0 [sLG] opencl.gpu.use = 1 [sLG] opencl.gpu.workgroup.size = 64 [sLG] screen.refresh.interval = 100 [sLG] path.maxdepth = 8 [sLG] path.russianroulette.depth = 5 [sLG] opencl.devices.select = 01 [sLG] sampler.type = SOBOL [sLG] opencl.task.count = 262144 [LuxRays::SDL] Reading scene: scenes/luxball/luxball-hdr.scn [LuxRays::SDL] Camera postion: Point[0.6, -1.7, 0.4] [LuxRays::SDL] Camera target: Point[0, 0, 0.4] [LuxRays::SDL] Material definition: shell [LuxRays::SDL] Material definition: whitematte [LuxRays::SDL] Material definition: luxtext [LuxRays::SDL] Material definition: blacktext [LuxRays::SDL] PLY object count: 5 [LuxRays::SDL] Reading texture map: scenes/simple-mat/arch.exr [LuxRays::SDL] HDR RGB (128bit) texture map size: 1024x512 (6144Kbytes) [LuxRays] OpenCL Platform 0: Apple [LuxRays] Device 0 name: NativeThread-000 [LuxRays] Device 0 type: NATIVE_THREAD [LuxRays] Device 0 compute units: 1 [LuxRays] Device 0 max allocable memory: 0MBytes [LuxRays] Device 0 max allocable memory block size: 0MBytes [LuxRays] Device 1 name: NativeThread-001 [LuxRays] Device 1 type: NATIVE_THREAD [LuxRays] Device 1 compute units: 1 [LuxRays] Device 1 max allocable memory: 0MBytes [LuxRays] Device 1 max allocable memory block size: 0MBytes [LuxRays] Device 2 name: NativeThread-002 [LuxRays] Device 2 type: NATIVE_THREAD [LuxRays] Device 2 compute units: 1 [LuxRays] Device 2 max allocable memory: 0MBytes [LuxRays] Device 2 max allocable memory block size: 0MBytes [LuxRays] Device 3 name: NativeThread-003 [LuxRays] Device 3 type: NATIVE_THREAD [LuxRays] Device 3 compute units: 1 [LuxRays] Device 3 max allocable memory: 0MBytes [LuxRays] Device 3 max allocable memory block size: 0MBytes [LuxRays] Device 4 name: NativeThread-004 [LuxRays] Device 4 type: NATIVE_THREAD [LuxRays] Device 4 compute units: 1 [LuxRays] Device 4 max allocable memory: 0MBytes [LuxRays] Device 4 max allocable memory block size: 0MBytes [LuxRays] Device 5 name: NativeThread-005 [LuxRays] Device 5 type: NATIVE_THREAD [LuxRays] Device 5 compute units: 1 [LuxRays] Device 5 max allocable memory: 0MBytes [LuxRays] Device 5 max allocable memory block size: 0MBytes [LuxRays] Device 6 name: NativeThread-006 [LuxRays] Device 6 type: NATIVE_THREAD [LuxRays] Device 6 compute units: 1 [LuxRays] Device 6 max allocable memory: 0MBytes [LuxRays] Device 6 max allocable memory block size: 0MBytes [LuxRays] Device 7 name: NativeThread-007 [LuxRays] Device 7 type: NATIVE_THREAD [LuxRays] Device 7 compute units: 1 [LuxRays] Device 7 max allocable memory: 0MBytes [LuxRays] Device 7 max allocable memory block size: 0MBytes [LuxRays] Device 8 name: Intel® Core™ i7-3820 CPU @ 3.60GHz [LuxRays] Device 8 type: OPENCL_CPU [LuxRays] Device 8 compute units: 8 [LuxRays] Device 8 max allocable memory: 8192MBytes [LuxRays] Device 8 max allocable memory block size: 2048MBytes [LuxRays] Device 9 name: GeForce GTX 580 [LuxRays] Device 9 type: OPENCL_GPU [LuxRays] Device 9 compute units: 16 [LuxRays] Device 9 max allocable memory: 1536MBytes [LuxRays] Device 9 max allocable memory block size: 384MBytes [LuxRays] Preprocessing DataSet [LuxRays] Total vertex count: 223352 [LuxRays] Total triangle count: 260780 [LuxRays] Total vertices memory usage: 2617Kbytes [LuxRays] Total triangles memory usage: 3056Kbytes [LuxRays] Building QBVH, primitives: 260780, initial nodes: 21733 [LuxRays] QBVH completed with 43609/86932 nodes [LuxRays] Total QBVH memory usage: 4769Kbytes [LuxRays] Total QBVH QuadTriangle count: 87045 [LuxRays] Max. QBVH Depth: 24 [LuxRays] Creating 1 intersection device(s) [LuxRays] Allocating intersection device 0: GeForce GTX 580 (Type = OPENCL_GPU) [sLG] OpenCL Devices used: [sLG] [GeForce GTX 580Intersect] [sLG] Device OpenCL version: OpenCL 1.1 [LuxRays] [OpenCL device::GeForce GTX 580Intersect] Ray buffer size: 3072Kbytes [LuxRays] [OpenCL device::GeForce GTX 580Intersect] Ray hits buffer size: 1024Kbytes [LuxRays] [OpenCL device::GeForce GTX 580Intersect] OpenCL image support is available [LuxRays] [OpenCL device::GeForce GTX 580Intersect] OpenCL max. image buffer size: 8192x8192 [LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH node image buffer size: 553x553 [LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH triangle image buffer size: 940x927 [LuxRays] [OpenCL device::GeForce GTX 580Intersect] Enabled QBVH scene storage inside image [LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH max. stack size: 24 [LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH Image Storage kernel work group size: 512 [LuxRays] [OpenCL device::GeForce GTX 580Intersect] Suggested work group size: 512 [LuxRays] [OpenCL device::GeForce GTX 580Intersect] Forced work group size: 64 [sLG] Configuring 1 CPU render threads [sLG] [PathOCLRenderThread] OpenCL task count: 262144 [sLG] [PathOCLRenderThread] OpenCL max. page memory size: 393216Kbytes [sLG] [PathOCLRenderThread::CompiledScene] Compile Camera [sLG] [PathOCLRenderThread::CompiledScene] Compile Geometry [sLG] [PathOCLRenderThread::CompiledScene] Scene geometry compilation time: 6ms [sLG] [PathOCLRenderThread::CompiledScene] Compile Materials [sLG] [PathOCLRenderThread::CompiledScene] Compile Textures [sLG] [PathOCLRenderThread::CompiledScene] Textures compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Material compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile Triangle AreaLights [sLG] [PathOCLRenderThread::CompiledScene] Triangle area lights compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile InfiniteLight [sLG] [PathOCLRenderThread::CompiledScene] Infinitelight compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile SunLight [sLG] [PathOCLRenderThread::CompiledScene] Compile SkyLight [sLG] [PathOCLRenderThread::CompiledScene] Compile ImageMaps [sLG] [PathOCLRenderThread::CompiledScene] Image maps page count: 1 [sLG] [PathOCLRenderThread::CompiledScene] RGB channel page 0 size: 6144Kbytes [sLG] [PathOCLRenderThread::CompiledScene] Texture maps compilation time: 1ms [sLG] Starting 1 PathOCL render threads [sLG] [PathOCLRenderThread::0] FrameBuffer buffer size: 4128Kbytes [sLG] [PathOCLRenderThread::0] Camera buffer size: 272bytes [sLG] [PathOCLRenderThread::0] MeshIDs buffer size: 1018Kbytes [sLG] [PathOCLRenderThread::0] Normals buffer size: 2617Kbytes [sLG] [PathOCLRenderThread::0] UVs buffer size: 1744Kbytes [sLG] [PathOCLRenderThread::0] Vertices buffer size: 2617Kbytes [sLG] [PathOCLRenderThread::0] Triangles buffer size: 3056Kbytes [sLG] [PathOCLRenderThread::0] ImageMaps description buffer size: 20bytes [sLG] [PathOCLRenderThread::0] ImageMaps buffer size: 6144Kbytes [sLG] [PathOCLRenderThread::0] Textures buffer size: 144bytes [sLG] [PathOCLRenderThread::0] Materials buffer size: 192bytes [sLG] [PathOCLRenderThread::0] Mesh material index buffer size: 20bytes [sLG] [PathOCLRenderThread::0] InfiniteLight buffer size: 52bytes [sLG] [PathOCLRenderThread::0] Ray buffer size: 12288Kbytes [sLG] [PathOCLRenderThread::0] RayHit buffer size: 4096Kbytes [sLG] [PathOCLRenderThread::0] GPUTask buffer size: 53248Kbytes [sLG] [PathOCLRenderThread::0] Sample dimensions: 28 [sLG] [PathOCLRenderThread::0] Size of a SampleData: 8bytes [sLG] [PathOCLRenderThread::0] SampleData buffer size: 2048Kbytes [sLG] [PathOCLRenderThread::0] GPUTask Stats buffer size: 1024Kbytes [sLG] [PathOCLRenderThread::0] OpenCL buffer creation time: 0ms [sLG] [PathOCLRenderThread::0] Sobol table size: 896 [sLG] [PathOCLRenderThread::0] Defined symbols: -D LUXRAYS_OPENCL_KERNEL -D SLG_OPENCL_KERNEL -D PARAM_TASK_COUNT=262144 -D PARAM_IMAGE_WIDTH=512 -D PARAM_IMAGE_HEIGHT=512 -D PARAM_RAY_EPSILON_MIN=1.000000e-09f -D PARAM_RAY_EPSILON_MAX=1.000000e-01f -D PARAM_MAX_PATH_DEPTH=8 -D PARAM_RR_DEPTH=5 -D PARAM_RR_CAP=5.000000e-01f -D PARAM_ACCEL_QBVH -D PARAM_ENABLE_TEX_CONST_FLOAT3 -D PARAM_ENABLE_MAT_MATTE -D PARAM_CAMERA_HAS_DOF -D PARAM_HAS_INFINITELIGHT -D PARAM_HAS_IMAGEMAPS -D PARAM_IMAGEMAPS_PAGE_0 -D PARAM_IMAGE_FILTER_TYPE=0 -D PARAM_SAMPLER_TYPE=2 -D PARAM_SAMPLER_SOBOL_MAXDEPTH=8 [sLG] [PathOCLRenderThread::0] Compiling kernels [sLG] [PathOCLRenderThread::0] Kernels not cached [sLG] [PathOCLRenderThread::0] Compiling Init Kernel [sLG] [PathOCLRenderThread::0] Compiling AdvancePaths Kernel [sLG] [PathOCLRenderThread::0] Kernels compilation time: 111ms This is NOT_OK: [sLG] Reading configuration file: scenes/luxball/luxball-hdr.cfg [sLG] Configuration: [sLG] image.width = 512 [sLG] image.height = 512 [sLG] batch.halttime = 0 [sLG] scene.file = scenes/luxball/luxball-hdr.scn [sLG] opencl.cpu.use = 0 [sLG] opencl.gpu.use = 1 [sLG] opencl.gpu.workgroup.size = 64 [sLG] screen.refresh.interval = 100 [sLG] path.maxdepth = 8 [sLG] path.russianroulette.depth = 5 [sLG] opencl.devices.select = 01 [sLG] sampler.type = SOBOL [sLG] opencl.task.count = 262144 [LuxRays::SDL] Reading scene: scenes/luxball/luxball-hdr.scn [LuxRays::SDL] Camera postion: Point[0.6, -1.7, 0.4] [LuxRays::SDL] Camera target: Point[0, 0, 0.4] [LuxRays::SDL] Material definition: shell [LuxRays::SDL] Material definition: whitematte [LuxRays::SDL] Material definition: luxtext [LuxRays::SDL] Material definition: blacktext [LuxRays::SDL] PLY object count: 5 [LuxRays::SDL] Reading texture map: scenes/simple-mat/arch.exr [LuxRays::SDL] HDR RGB (128bit) texture map size: 1024x512 (6144Kbytes) [LuxRays] OpenCL Platform 0: Apple [LuxRays] Device 0 name: NativeThread-000 [LuxRays] Device 0 type: NATIVE_THREAD [LuxRays] Device 0 compute units: 1 [LuxRays] Device 0 max allocable memory: 0MBytes [LuxRays] Device 0 max allocable memory block size: 0MBytes [LuxRays] Device 1 name: NativeThread-001 [LuxRays] Device 1 type: NATIVE_THREAD [LuxRays] Device 1 compute units: 1 [LuxRays] Device 1 max allocable memory: 0MBytes [LuxRays] Device 1 max allocable memory block size: 0MBytes [LuxRays] Device 2 name: NativeThread-002 [LuxRays] Device 2 type: NATIVE_THREAD [LuxRays] Device 2 compute units: 1 [LuxRays] Device 2 max allocable memory: 0MBytes [LuxRays] Device 2 max allocable memory block size: 0MBytes [LuxRays] Device 3 name: NativeThread-003 [LuxRays] Device 3 type: NATIVE_THREAD [LuxRays] Device 3 compute units: 1 [LuxRays] Device 3 max allocable memory: 0MBytes [LuxRays] Device 3 max allocable memory block size: 0MBytes [LuxRays] Device 4 name: NativeThread-004 [LuxRays] Device 4 type: NATIVE_THREAD [LuxRays] Device 4 compute units: 1 [LuxRays] Device 4 max allocable memory: 0MBytes [LuxRays] Device 4 max allocable memory block size: 0MBytes [LuxRays] Device 5 name: NativeThread-005 [LuxRays] Device 5 type: NATIVE_THREAD [LuxRays] Device 5 compute units: 1 [LuxRays] Device 5 max allocable memory: 0MBytes [LuxRays] Device 5 max allocable memory block size: 0MBytes [LuxRays] Device 6 name: NativeThread-006 [LuxRays] Device 6 type: NATIVE_THREAD [LuxRays] Device 6 compute units: 1 [LuxRays] Device 6 max allocable memory: 0MBytes [LuxRays] Device 6 max allocable memory block size: 0MBytes [LuxRays] Device 7 name: NativeThread-007 [LuxRays] Device 7 type: NATIVE_THREAD [LuxRays] Device 7 compute units: 1 [LuxRays] Device 7 max allocable memory: 0MBytes [LuxRays] Device 7 max allocable memory block size: 0MBytes [LuxRays] Device 8 name: Intel® Core™ i7-3820 CPU @ 3.60GHz [LuxRays] Device 8 type: OPENCL_CPU [LuxRays] Device 8 compute units: 8 [LuxRays] Device 8 max allocable memory: 8192MBytes [LuxRays] Device 8 max allocable memory block size: 2048MBytes [LuxRays] Device 9 name: GeForce GTX 580 [LuxRays] Device 9 type: OPENCL_GPU [LuxRays] Device 9 compute units: 16 [LuxRays] Device 9 max allocable memory: 1536MBytes [LuxRays] Device 9 max allocable memory block size: 384MBytes [LuxRays] Preprocessing DataSet [LuxRays] Total vertex count: 223352 [LuxRays] Total triangle count: 260780 [LuxRays] Total vertices memory usage: 2617Kbytes [LuxRays] Total triangles memory usage: 3056Kbytes [LuxRays] Building QBVH, primitives: 260780, initial nodes: 21733 [LuxRays] QBVH completed with 43609/86932 nodes [LuxRays] Total QBVH memory usage: 4769Kbytes [LuxRays] Total QBVH QuadTriangle count: 87045 [LuxRays] Max. QBVH Depth: 24 [LuxRays] Creating 1 intersection device(s) [LuxRays] Allocating intersection device 0: GeForce GTX 580 (Type = OPENCL_GPU) [sLG] OpenCL Devices used: [sLG] [GeForce GTX 580Intersect] [sLG] Device OpenCL version: OpenCL 1.1 [LuxRays] [OpenCL device::GeForce GTX 580Intersect] Ray buffer size: 3072Kbytes [LuxRays] [OpenCL device::GeForce GTX 580Intersect] Ray hits buffer size: 1024Kbytes [LuxRays] [OpenCL device::GeForce GTX 580Intersect] OpenCL image support is available [LuxRays] [OpenCL device::GeForce GTX 580Intersect] OpenCL max. image buffer size: 8192x8192 [LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH node image buffer size: 553x553 [LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH triangle image buffer size: 940x927 [LuxRays] [OpenCL device::GeForce GTX 580Intersect] Enabled QBVH scene storage inside image [LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH max. stack size: 24 [LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH Image Storage kernel work group size: 512 [LuxRays] [OpenCL device::GeForce GTX 580Intersect] Suggested work group size: 512 [LuxRays] [OpenCL device::GeForce GTX 580Intersect] Forced work group size: 64 [sLG] Configuring 1 CPU render threads [sLG] [PathOCLRenderThread] OpenCL task count: 262144 [sLG] [PathOCLRenderThread] OpenCL max. page memory size: 393216Kbytes [sLG] [PathOCLRenderThread::CompiledScene] Compile Camera [sLG] [PathOCLRenderThread::CompiledScene] Compile Geometry [sLG] [PathOCLRenderThread::CompiledScene] Scene geometry compilation time: 6ms [sLG] [PathOCLRenderThread::CompiledScene] Compile Materials [sLG] [PathOCLRenderThread::CompiledScene] Compile Textures [sLG] [PathOCLRenderThread::CompiledScene] Textures compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Material compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile Triangle AreaLights [sLG] [PathOCLRenderThread::CompiledScene] Triangle area lights compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile InfiniteLight [sLG] [PathOCLRenderThread::CompiledScene] Infinitelight compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile SunLight [sLG] [PathOCLRenderThread::CompiledScene] Compile SkyLight [sLG] [PathOCLRenderThread::CompiledScene] Compile ImageMaps [sLG] [PathOCLRenderThread::CompiledScene] Image maps page count: 1 [sLG] [PathOCLRenderThread::CompiledScene] RGB channel page 0 size: 6144Kbytes [sLG] [PathOCLRenderThread::CompiledScene] Texture maps compilation time: 1ms [sLG] Starting 1 PathOCL render threads [sLG] [PathOCLRenderThread::0] FrameBuffer buffer size: 4128Kbytes [sLG] [PathOCLRenderThread::0] Camera buffer size: 272bytes [sLG] [PathOCLRenderThread::0] MeshIDs buffer size: 1018Kbytes [sLG] [PathOCLRenderThread::0] Normals buffer size: 2617Kbytes [sLG] [PathOCLRenderThread::0] UVs buffer size: 1744Kbytes [sLG] [PathOCLRenderThread::0] Vertices buffer size: 2617Kbytes [sLG] [PathOCLRenderThread::0] Triangles buffer size: 3056Kbytes [sLG] [PathOCLRenderThread::0] ImageMaps description buffer size: 20bytes [sLG] [PathOCLRenderThread::0] ImageMaps buffer size: 6144Kbytes [sLG] [PathOCLRenderThread::0] Textures buffer size: 144bytes [sLG] [PathOCLRenderThread::0] Materials buffer size: 192bytes [sLG] [PathOCLRenderThread::0] Mesh material index buffer size: 20bytes [sLG] [PathOCLRenderThread::0] InfiniteLight buffer size: 52bytes [sLG] [PathOCLRenderThread::0] Ray buffer size: 12288Kbytes [sLG] [PathOCLRenderThread::0] RayHit buffer size: 4096Kbytes [sLG] [PathOCLRenderThread::0] GPUTask buffer size: 53248Kbytes [sLG] [PathOCLRenderThread::0] Sample dimensions: 28 [sLG] [PathOCLRenderThread::0] Size of a SampleData: 8bytes [sLG] [PathOCLRenderThread::0] SampleData buffer size: 2048Kbytes [sLG] [PathOCLRenderThread::0] GPUTask Stats buffer size: 1024Kbytes [sLG] [PathOCLRenderThread::0] OpenCL buffer creation time: 0ms [sLG] [PathOCLRenderThread::0] Sobol table size: 896 [sLG] [PathOCLRenderThread::0] Defined symbols: -D LUXRAYS_OPENCL_KERNEL -D SLG_OPENCL_KERNEL -D PARAM_TASK_COUNT=262144 -D PARAM_IMAGE_WIDTH=512 -D PARAM_IMAGE_HEIGHT=512 -D PARAM_RAY_EPSILON_MIN=1.000000e-09f -D PARAM_RAY_EPSILON_MAX=1.000000e-01f -D PARAM_MAX_PATH_DEPTH=8 -D PARAM_RR_DEPTH=5 -D PARAM_RR_CAP=5.000000e-01f -D PARAM_ACCEL_QBVH -D PARAM_ENABLE_TEX_CONST_FLOAT3 -D PARAM_ENABLE_MAT_MATTE -D PARAM_CAMERA_HAS_DOF -D PARAM_HAS_INFINITELIGHT -D PARAM_HAS_IMAGEMAPS -D PARAM_IMAGEMAPS_PAGE_0 -D PARAM_IMAGE_FILTER_TYPE=0 -D PARAM_SAMPLER_TYPE=2 -D PARAM_SAMPLER_SOBOL_MAXDEPTH=8 [sLG] [PathOCLRenderThread::0] Compiling kernels [sLG] [PathOCLRenderThread::0] Kernels not cached [sLG] [PathOCLRenderThread::0] Compiling Init Kernel [sLG] [PathOCLRenderThread::0] Compiling AdvancePaths Kernel [sLG] [PathOCLRenderThread::0] Kernels compilation time: 111ms Link to comment https://www.insanelymac.com/forum/topic/285685-smallluxgpu-4-alpha-nvidia-amd-tests-ended-thanks/#findComment-1887628 Share on other sites More sharing options...
k3nny Posted February 10, 2013 Share Posted February 10, 2013 1) test_mac_fix_ok: OK 2) test_mac_fix_not_ok: NOT OK > alex@box [~/Downloads/OSX_AMD_slg4_proof] ./test_mac_fix_not_ok/slg4 -D opencl.devices.select 01 -D sampler.type SOBOL -D opencl.task.count 262144 -w 512 -e 512 scenes/luxball/luxball-hdr.cfg [sLG] Reading configuration file: scenes/luxball/luxball-hdr.cfg [sLG] Configuration: [sLG] image.width = 512 [sLG] image.height = 512 [sLG] batch.halttime = 0 [sLG] scene.file = scenes/luxball/luxball-hdr.scn [sLG] opencl.cpu.use = 0 [sLG] opencl.gpu.use = 1 [sLG] opencl.gpu.workgroup.size = 64 [sLG] screen.refresh.interval = 100 [sLG] path.maxdepth = 8 [sLG] path.russianroulette.depth = 5 [sLG] opencl.devices.select = 01 [sLG] sampler.type = SOBOL [sLG] opencl.task.count = 262144 [LuxRays::SDL] Reading scene: scenes/luxball/luxball-hdr.scn [LuxRays::SDL] Camera postion: Point[0.6, -1.7, 0.4] [LuxRays::SDL] Camera target: Point[0, 0, 0.4] [LuxRays::SDL] Material definition: shell [LuxRays::SDL] Material definition: whitematte [LuxRays::SDL] Material definition: luxtext [LuxRays::SDL] Material definition: blacktext [LuxRays::SDL] PLY object count: 5 [LuxRays::SDL] Reading texture map: scenes/simple-mat/arch.exr [LuxRays::SDL] HDR RGB (128bit) texture map size: 1024x512 (6144Kbytes) [LuxRays] OpenCL Platform 0: Apple [LuxRays] Device 0 name: NativeThread-000 [LuxRays] Device 0 type: NATIVE_THREAD [LuxRays] Device 0 compute units: 1 [LuxRays] Device 0 max allocable memory: 0MBytes [LuxRays] Device 0 max allocable memory block size: 0MBytes [LuxRays] Device 1 name: NativeThread-001 [LuxRays] Device 1 type: NATIVE_THREAD [LuxRays] Device 1 compute units: 1 [LuxRays] Device 1 max allocable memory: 0MBytes [LuxRays] Device 1 max allocable memory block size: 0MBytes [LuxRays] Device 2 name: NativeThread-002 [LuxRays] Device 2 type: NATIVE_THREAD [LuxRays] Device 2 compute units: 1 [LuxRays] Device 2 max allocable memory: 0MBytes [LuxRays] Device 2 max allocable memory block size: 0MBytes [LuxRays] Device 3 name: NativeThread-003 [LuxRays] Device 3 type: NATIVE_THREAD [LuxRays] Device 3 compute units: 1 [LuxRays] Device 3 max allocable memory: 0MBytes [LuxRays] Device 3 max allocable memory block size: 0MBytes [LuxRays] Device 4 name: Intel(R) Core(TM) i5-3570K CPU @ 3.40GHz [LuxRays] Device 4 type: OPENCL_CPU [LuxRays] Device 4 compute units: 4 [LuxRays] Device 4 max allocable memory: 8192MBytes [LuxRays] Device 4 max allocable memory block size: 2048MBytes [LuxRays] Device 5 name: AMD Radeon HD Pitcairn XT Prototype Compute Engine [LuxRays] Device 5 type: OPENCL_GPU [LuxRays] Device 5 compute units: 20 [LuxRays] Device 5 max allocable memory: 1024MBytes [LuxRays] Device 5 max allocable memory block size: 256MBytes [LuxRays] Preprocessing DataSet [LuxRays] Total vertex count: 223352 [LuxRays] Total triangle count: 260780 [LuxRays] Total vertices memory usage: 2617Kbytes [LuxRays] Total triangles memory usage: 3056Kbytes [LuxRays] Building QBVH, primitives: 260780, initial nodes: 21733 [LuxRays] QBVH completed with 43609/86932 nodes [LuxRays] Total QBVH memory usage: 4769Kbytes [LuxRays] Total QBVH QuadTriangle count: 87045 [LuxRays] Max. QBVH Depth: 24 [LuxRays] Creating 1 intersection device(s) [LuxRays] Allocating intersection device 0: AMD Radeon HD Pitcairn XT Prototype Compute Engine (Type = OPENCL_GPU) [sLG] OpenCL Devices used: [sLG] [AMD Radeon HD Pitcairn XT Prototype Compute EngineIntersect] [sLG] Device OpenCL version: OpenCL 1.1 [LuxRays] [OpenCL device::AMD Radeon HD Pitcairn XT Prototype Compute EngineIntersect] Ray buffer size: 3072Kbytes [LuxRays] [OpenCL device::AMD Radeon HD Pitcairn XT Prototype Compute EngineIntersect] Ray hits buffer size: 1024Kbytes [LuxRays] [OpenCL device::AMD Radeon HD Pitcairn XT Prototype Compute EngineIntersect] OpenCL image support is available [LuxRays] [OpenCL device::AMD Radeon HD Pitcairn XT Prototype Compute EngineIntersect] OpenCL max. image buffer size: 8192x8192 [LuxRays] [OpenCL device::AMD Radeon HD Pitcairn XT Prototype Compute EngineIntersect] QBVH node image buffer size: 553x553 [LuxRays] [OpenCL device::AMD Radeon HD Pitcairn XT Prototype Compute EngineIntersect] QBVH triangle image buffer size: 940x927 [LuxRays] [OpenCL device::AMD Radeon HD Pitcairn XT Prototype Compute EngineIntersect] Enabled QBVH scene storage inside image [LuxRays] [OpenCL device::AMD Radeon HD Pitcairn XT Prototype Compute EngineIntersect] QBVH max. stack size: 24 [LuxRays] [OpenCL device::AMD Radeon HD Pitcairn XT Prototype Compute EngineIntersect] QBVH Image Storage kernel work group size: 256 [LuxRays] [OpenCL device::AMD Radeon HD Pitcairn XT Prototype Compute EngineIntersect] Suggested work group size: 256 [LuxRays] [OpenCL device::AMD Radeon HD Pitcairn XT Prototype Compute EngineIntersect] Forced work group size: 64 [sLG] Configuring 1 CPU render threads [sLG] [PathOCLRenderThread] OpenCL task count: 262144 [sLG] [PathOCLRenderThread] OpenCL max. page memory size: 262144Kbytes [sLG] [PathOCLRenderThread::CompiledScene] Compile Camera [sLG] [PathOCLRenderThread::CompiledScene] Compile Geometry [sLG] [PathOCLRenderThread::CompiledScene] Scene geometry compilation time: 7ms [sLG] [PathOCLRenderThread::CompiledScene] Compile Materials [sLG] [PathOCLRenderThread::CompiledScene] Compile Textures [sLG] [PathOCLRenderThread::CompiledScene] Textures compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Material compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile Triangle AreaLights [sLG] [PathOCLRenderThread::CompiledScene] Triangle area lights compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile InfiniteLight [sLG] [PathOCLRenderThread::CompiledScene] Infinitelight compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile SunLight [sLG] [PathOCLRenderThread::CompiledScene] Compile SkyLight [sLG] [PathOCLRenderThread::CompiledScene] Compile ImageMaps [sLG] [PathOCLRenderThread::CompiledScene] Image maps page count: 1 [sLG] [PathOCLRenderThread::CompiledScene] RGB channel page 0 size: 6144Kbytes [sLG] [PathOCLRenderThread::CompiledScene] Texture maps compilation time: 1ms [sLG] Starting 1 PathOCL render threads [sLG] [PathOCLRenderThread::0] FrameBuffer buffer size: 4128Kbytes [sLG] [PathOCLRenderThread::0] Camera buffer size: 272bytes [sLG] [PathOCLRenderThread::0] MeshIDs buffer size: 1018Kbytes [sLG] [PathOCLRenderThread::0] Normals buffer size: 2617Kbytes [sLG] [PathOCLRenderThread::0] UVs buffer size: 1744Kbytes [sLG] [PathOCLRenderThread::0] Vertices buffer size: 2617Kbytes [sLG] [PathOCLRenderThread::0] Triangles buffer size: 3056Kbytes [sLG] [PathOCLRenderThread::0] ImageMaps description buffer size: 20bytes [sLG] [PathOCLRenderThread::0] ImageMaps buffer size: 6144Kbytes [sLG] [PathOCLRenderThread::0] Textures buffer size: 144bytes [sLG] [PathOCLRenderThread::0] Materials buffer size: 192bytes [sLG] [PathOCLRenderThread::0] Mesh material index buffer size: 20bytes [sLG] [PathOCLRenderThread::0] InfiniteLight buffer size: 52bytes [sLG] [PathOCLRenderThread::0] Ray buffer size: 12288Kbytes [sLG] [PathOCLRenderThread::0] RayHit buffer size: 4096Kbytes [sLG] [PathOCLRenderThread::0] GPUTask buffer size: 53248Kbytes [sLG] [PathOCLRenderThread::0] Sample dimensions: 28 [sLG] [PathOCLRenderThread::0] Size of a SampleData: 8bytes [sLG] [PathOCLRenderThread::0] SampleData buffer size: 2048Kbytes [sLG] [PathOCLRenderThread::0] GPUTask Stats buffer size: 1024Kbytes [sLG] [PathOCLRenderThread::0] OpenCL buffer creation time: 0ms [sLG] [PathOCLRenderThread::0] Sobol table size: 896 [sLG] [PathOCLRenderThread::0] Defined symbols: -D LUXRAYS_OPENCL_KERNEL -D SLG_OPENCL_KERNEL -D PARAM_TASK_COUNT=262144 -D PARAM_IMAGE_WIDTH=512 -D PARAM_IMAGE_HEIGHT=512 -D PARAM_RAY_EPSILON_MIN=1.000000e-09f -D PARAM_RAY_EPSILON_MAX=1.000000e-01f -D PARAM_MAX_PATH_DEPTH=8 -D PARAM_RR_DEPTH=5 -D PARAM_RR_CAP=5.000000e-01f -D PARAM_ACCEL_QBVH -D PARAM_ENABLE_TEX_CONST_FLOAT3 -D PARAM_ENABLE_MAT_MATTE -D PARAM_CAMERA_HAS_DOF -D PARAM_HAS_INFINITELIGHT -D PARAM_HAS_IMAGEMAPS -D PARAM_IMAGEMAPS_PAGE_0 -D PARAM_IMAGE_FILTER_TYPE=0 -D PARAM_SAMPLER_TYPE=2 -D PARAM_SAMPLER_SOBOL_MAXDEPTH=8 [sLG] [PathOCLRenderThread::0] Compiling kernels [sLG] [PathOCLRenderThread::0] Kernels not cached [sLG] [PathOCLRenderThread::0] Compiling Init Kernel [sLG] [PathOCLRenderThread::0] Compiling AdvancePaths Kernel [sLG] [PathOCLRenderThread::0] Kernels compilation time: 2ms Link to comment https://www.insanelymac.com/forum/topic/285685-smallluxgpu-4-alpha-nvidia-amd-tests-ended-thanks/#findComment-1887630 Share on other sites More sharing options...
mitch_de Posted February 11, 2013 Author Share Posted February 11, 2013 Thanks ! Especially AMD users are wanted (as i understood Jens). You can follow Jens tests here : http://www.luxrender.net/forum/viewtopic.php?f=34&t=9592&p=92345#p92345 Link to comment https://www.insanelymac.com/forum/topic/285685-smallluxgpu-4-alpha-nvidia-amd-tests-ended-thanks/#findComment-1887778 Share on other sites More sharing options...
bcobco Posted February 11, 2013 Share Posted February 11, 2013 AMD ATHLON X2 7850BE NVIDIA GT440 compu:SLG4_alpha cc$ ./slg4 [sLG] Reading configuration file: scenes/luxball/luxball.cfg [sLG] Configuration: [sLG] image.width = 640 [sLG] image.height = 480 [sLG] batch.halttime = 0 [sLG] scene.file = scenes/luxball/luxball.scn [sLG] scene.fieldofview = 45 [sLG] opencl.cpu.use = 0 [sLG] opencl.gpu.use = 1 [sLG] opencl.gpu.workgroup.size = 64 [sLG] screen.refresh.interval = 100 [sLG] film.gamma = 2.2 [sLG] path.maxdepth = 16 [sLG] path.russianroulette.depth = 5 [sLG] light.maxdepth = 16 [sLG] light.russianroulette.depth = 5 [LuxRays::SDL] Reading scene: scenes/luxball/luxball.scn [LuxRays::SDL] Camera postion: Point[0.5, -1.4, 1.7] [LuxRays::SDL] Camera target: Point[0, 0, 0.4] [LuxRays::SDL] Material definition: shell [LuxRays::SDL] Material definition: whitematte [LuxRays::SDL] Material definition: luxtext [LuxRays::SDL] Material definition: blacktext [LuxRays::SDL] Material definition: whitelight [LuxRays::SDL] The l1 object is a light sources with 2 triangles [LuxRays::SDL] The l2 object is a light sources with 2 triangles [LuxRays::SDL] PLY object count: 7 [LuxRays] OpenCL Platform 0: Apple [LuxRays] Device 0 name: NativeThread-000 [LuxRays] Device 0 type: NATIVE_THREAD [LuxRays] Device 0 compute units: 1 [LuxRays] Device 0 max allocable memory: 0MBytes [LuxRays] Device 0 max allocable memory block size: 0MBytes [LuxRays] Device 1 name: NativeThread-001 [LuxRays] Device 1 type: NATIVE_THREAD [LuxRays] Device 1 compute units: 1 [LuxRays] Device 1 max allocable memory: 0MBytes [LuxRays] Device 1 max allocable memory block size: 0MBytes [LuxRays] Device 2 name: GeForce GT 440 [LuxRays] Device 2 type: OPENCL_GPU [LuxRays] Device 2 compute units: 2 [LuxRays] Device 2 max allocable memory: 1024MBytes [LuxRays] Device 2 max allocable memory block size: 256MBytes [LuxRays] Device 3 name: AMD Athlon 7850 Dual-Core Processor [LuxRays] Device 3 type: OPENCL_CPU [LuxRays] Device 3 compute units: 2 [LuxRays] Device 3 max allocable memory: 3072MBytes [LuxRays] Device 3 max allocable memory block size: 1024MBytes [LuxRays] Preprocessing DataSet [LuxRays] Total vertex count: 223380 [LuxRays] Total triangle count: 260794 [LuxRays] Total vertices memory usage: 2617Kbytes [LuxRays] Total triangles memory usage: 3056Kbytes [LuxRays] Building QBVH, primitives: 260794, initial nodes: 21734 [LuxRays] QBVH completed with 43467/43468 nodes [LuxRays] Total QBVH memory usage: 4754Kbytes [LuxRays] Total QBVH QuadTriangle count: 87103 [LuxRays] Max. QBVH Depth: 25 [LuxRays] Creating 1 intersection device(s) [LuxRays] Allocating intersection device 0: GeForce GT 440 (Type = OPENCL_GPU) [sLG] OpenCL Devices used: [sLG] [GeForce GT 440Intersect] [sLG] RUNTIME ERROR: OpenCL version 1.1 or better is required for device: GeForce GT 440Intersect compu:SLG4_alpha cc$ uname -a Darwin compu.local 10.8.0 Darwin Kernel Version 10.8.0: ma 22 aug 2011 22:12:25 CEST; legacy kernel v8 :xnu-1504.15.3/BUILD/obj/RELEASE_I386 i386 compu:SLG4_alpha cc$ /Users/cc/Desktop/oclinfo 1 OpenCL platform found! [Platform 0] Name: Apple Vendor: Apple Version: OpenCL 1.0 (Dec 23 2010 17:30:26) Profile: FULL_PROFILE 2 OpenCL devices found! [Device 0] Name: GeForce GT 440 Vendor: NVIDIA Type: GPU Device Version: OpenCL 1.0 Driver Version: CLH 1.0 Compute Units: 2 Work Group Size: 1024 Clock: 1620 MHz Global Memory: 1024 MB Local Memory: 48 KB Cache Size: 0 KB Cache Line Size: 0 Bytes Available: Yes Double-Precision: No Extensions: cl_khr_byte_addressable_store cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_APPLE_gl_sharing cl_APPLE_SetMemObjectDestructor cl_APPLE_ContextLoggingFunctions [Device 1] Name: AMD Athlon 7850 Dual-Core Processor Vendor: Intel Type: CPU Device Version: OpenCL 1.0 Driver Version: 1.0 Compute Units: 2 Work Group Size: 1 Clock: 3000 MHz Global Memory (Total): 5120 MB Global Memory (Host): 3072 MB Global Memory (PCIe): 2048 MB Local Memory: 16 KB Cache Size: 4096 KB Cache Line Size: 64 Bytes Available: Yes Double-Precision: Yes Extensions: cl_khr_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_APPLE_gl_sharing cl_APPLE_SetMemObjectDestructor cl_APPLE_ContextLoggingFunctions Link to comment https://www.insanelymac.com/forum/topic/285685-smallluxgpu-4-alpha-nvidia-amd-tests-ended-thanks/#findComment-1887908 Share on other sites More sharing options...
mitch_de Posted February 14, 2013 Author Share Posted February 14, 2013 Thanks - AMD GPU developers found the bug in the MAD OpenCL part (which made overbrigthen in first test / green test wrong). They will fix it and submit changes to Apple. In the past i see that non other Nvidia gpu could run slg4 alpha without chrashing - at least the submitted Fermi based gpus crashed. Anyone else with nvidia (Fermi +non fermi ) may try the newer test (green ball) again? DL + Info look post #15 Link to comment https://www.insanelymac.com/forum/topic/285685-smallluxgpu-4-alpha-nvidia-amd-tests-ended-thanks/#findComment-1888753 Share on other sites More sharing options...
RobertX Posted February 14, 2013 Share Posted February 14, 2013 tested on my GT 430 results: Last login: Wed Feb 13 20:18:45 on ttys000 Leslies-Mac-Pro:~ leslie$ cd /Users/leslie/Downloads/OSX_AMD_slg4_proof Leslies-Mac-Pro:OSX_AMD_slg4_proof leslie$ ./test_mac_fix_ok/slg4 -D opencl.devices.select 01 -D sampler.type SOBOL -D opencl.task.count 262144 -w 512 -e 512 scenes/luxball/luxball-hdr.cfg [sLG] Reading configuration file: scenes/luxball/luxball-hdr.cfg [sLG] Configuration: [sLG] image.width = 512 [sLG] image.height = 512 [sLG] batch.halttime = 0 [sLG] scene.file = scenes/luxball/luxball-hdr.scn [sLG] opencl.cpu.use = 0 [sLG] opencl.gpu.use = 1 [sLG] opencl.gpu.workgroup.size = 64 [sLG] screen.refresh.interval = 100 [sLG] path.maxdepth = 8 [sLG] path.russianroulette.depth = 5 [sLG] opencl.devices.select = 01 [sLG] sampler.type = SOBOL [sLG] opencl.task.count = 262144 [LuxRays::SDL] Reading scene: scenes/luxball/luxball-hdr.scn [LuxRays::SDL] Camera postion: Point[0.6, -1.7, 0.4] [LuxRays::SDL] Camera target: Point[0, 0, 0.4] [LuxRays::SDL] Material definition: shell [LuxRays::SDL] Material definition: whitematte [LuxRays::SDL] Material definition: luxtext [LuxRays::SDL] Material definition: blacktext [LuxRays::SDL] PLY object count: 5 [LuxRays::SDL] Reading texture map: scenes/simple-mat/arch.exr [LuxRays::SDL] HDR RGB (128bit) texture map size: 1024x512 (6144Kbytes) [LuxRays] OpenCL Platform 0: Apple [LuxRays] Device 0 name: NativeThread-000 [LuxRays] Device 0 type: NATIVE_THREAD [LuxRays] Device 0 compute units: 1 [LuxRays] Device 0 max allocable memory: 0MBytes [LuxRays] Device 0 max allocable memory block size: 0MBytes [LuxRays] Device 1 name: NativeThread-001 [LuxRays] Device 1 type: NATIVE_THREAD [LuxRays] Device 1 compute units: 1 [LuxRays] Device 1 max allocable memory: 0MBytes [LuxRays] Device 1 max allocable memory block size: 0MBytes [LuxRays] Device 2 name: Intel® Core™2 Duo CPU E6550 @ 2.33GHz [LuxRays] Device 2 type: OPENCL_CPU [LuxRays] Device 2 compute units: 2 [LuxRays] Device 2 max allocable memory: 8192MBytes [LuxRays] Device 2 max allocable memory block size: 2048MBytes [LuxRays] Device 3 name: GeForce GT 430 [LuxRays] Device 3 type: OPENCL_GPU [LuxRays] Device 3 compute units: 2 [LuxRays] Device 3 max allocable memory: 1024MBytes [LuxRays] Device 3 max allocable memory block size: 256MBytes [LuxRays] Preprocessing DataSet [LuxRays] Total vertex count: 223352 [LuxRays] Total triangle count: 260780 [LuxRays] Total vertices memory usage: 2617Kbytes [LuxRays] Total triangles memory usage: 3056Kbytes [LuxRays] Building QBVH, primitives: 260780, initial nodes: 21733 [LuxRays] QBVH completed with 43609/86932 nodes [LuxRays] Total QBVH memory usage: 4769Kbytes [LuxRays] Total QBVH QuadTriangle count: 87045 [LuxRays] Max. QBVH Depth: 24 [LuxRays] Creating 1 intersection device(s) [LuxRays] Allocating intersection device 0: GeForce GT 430 (Type = OPENCL_GPU) [sLG] OpenCL Devices used: [sLG] [GeForce GT 430Intersect] [sLG] Device OpenCL version: OpenCL 1.1 [LuxRays] [OpenCL device::GeForce GT 430Intersect] Ray buffer size: 3072Kbytes [LuxRays] [OpenCL device::GeForce GT 430Intersect] Ray hits buffer size: 1024Kbytes [LuxRays] [OpenCL device::GeForce GT 430Intersect] OpenCL image support is available [LuxRays] [OpenCL device::GeForce GT 430Intersect] OpenCL max. image buffer size: 8192x8192 [LuxRays] [OpenCL device::GeForce GT 430Intersect] QBVH node image buffer size: 553x553 [LuxRays] [OpenCL device::GeForce GT 430Intersect] QBVH triangle image buffer size: 940x927 [LuxRays] [OpenCL device::GeForce GT 430Intersect] Enabled QBVH scene storage inside image [LuxRays] [OpenCL device::GeForce GT 430Intersect] QBVH max. stack size: 24 [LuxRays] [OpenCL device::GeForce GT 430Intersect] QBVH Image Storage kernel work group size: 512 [LuxRays] [OpenCL device::GeForce GT 430Intersect] Suggested work group size: 512 [LuxRays] [OpenCL device::GeForce GT 430Intersect] Forced work group size: 64 [sLG] Configuring 1 CPU render threads [sLG] [PathOCLRenderThread] OpenCL task count: 262144 [sLG] [PathOCLRenderThread] OpenCL max. page memory size: 262144Kbytes [sLG] [PathOCLRenderThread::CompiledScene] Compile Camera [sLG] [PathOCLRenderThread::CompiledScene] Compile Geometry [sLG] [PathOCLRenderThread::CompiledScene] Scene geometry compilation time: 13ms [sLG] [PathOCLRenderThread::CompiledScene] Compile Materials [sLG] [PathOCLRenderThread::CompiledScene] Compile Textures [sLG] [PathOCLRenderThread::CompiledScene] Textures compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Material compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile Triangle AreaLights [sLG] [PathOCLRenderThread::CompiledScene] Triangle area lights compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile InfiniteLight [sLG] [PathOCLRenderThread::CompiledScene] Infinitelight compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile SunLight [sLG] [PathOCLRenderThread::CompiledScene] Compile SkyLight [sLG] [PathOCLRenderThread::CompiledScene] Compile ImageMaps [sLG] [PathOCLRenderThread::CompiledScene] Image maps page count: 1 [sLG] [PathOCLRenderThread::CompiledScene] RGB channel page 0 size: 6144Kbytes [sLG] [PathOCLRenderThread::CompiledScene] Texture maps compilation time: 2ms [sLG] Starting 1 PathOCL render threads [sLG] [PathOCLRenderThread::0] FrameBuffer buffer size: 4128Kbytes [sLG] [PathOCLRenderThread::0] Camera buffer size: 272bytes [sLG] [PathOCLRenderThread::0] MeshIDs buffer size: 1018Kbytes [sLG] [PathOCLRenderThread::0] Normals buffer size: 2617Kbytes [sLG] [PathOCLRenderThread::0] UVs buffer size: 1744Kbytes [sLG] [PathOCLRenderThread::0] Vertices buffer size: 2617Kbytes [sLG] [PathOCLRenderThread::0] Triangles buffer size: 3056Kbytes [sLG] [PathOCLRenderThread::0] ImageMaps description buffer size: 20bytes [sLG] [PathOCLRenderThread::0] ImageMaps buffer size: 6144Kbytes [sLG] [PathOCLRenderThread::0] Textures buffer size: 144bytes [sLG] [PathOCLRenderThread::0] Materials buffer size: 192bytes [sLG] [PathOCLRenderThread::0] Mesh material index buffer size: 20bytes [sLG] [PathOCLRenderThread::0] InfiniteLight buffer size: 52bytes [sLG] [PathOCLRenderThread::0] Ray buffer size: 12288Kbytes [sLG] [PathOCLRenderThread::0] RayHit buffer size: 4096Kbytes [sLG] [PathOCLRenderThread::0] GPUTask buffer size: 53248Kbytes [sLG] [PathOCLRenderThread::0] Sample dimensions: 28 [sLG] [PathOCLRenderThread::0] Size of a SampleData: 8bytes [sLG] [PathOCLRenderThread::0] SampleData buffer size: 2048Kbytes [sLG] [PathOCLRenderThread::0] GPUTask Stats buffer size: 1024Kbytes [sLG] [PathOCLRenderThread::0] OpenCL buffer creation time: 0ms [sLG] [PathOCLRenderThread::0] Sobol table size: 896 [sLG] [PathOCLRenderThread::0] Defined symbols: -D LUXRAYS_OPENCL_KERNEL -D SLG_OPENCL_KERNEL -D PARAM_TASK_COUNT=262144 -D PARAM_IMAGE_WIDTH=512 -D PARAM_IMAGE_HEIGHT=512 -D PARAM_RAY_EPSILON_MIN=1.000000e-09f -D PARAM_RAY_EPSILON_MAX=1.000000e-01f -D PARAM_MAX_PATH_DEPTH=8 -D PARAM_RR_DEPTH=5 -D PARAM_RR_CAP=5.000000e-01f -D PARAM_ACCEL_QBVH -D PARAM_ENABLE_TEX_CONST_FLOAT3 -D PARAM_ENABLE_MAT_MATTE -D PARAM_CAMERA_HAS_DOF -D PARAM_HAS_INFINITELIGHT -D PARAM_HAS_IMAGEMAPS -D PARAM_IMAGEMAPS_PAGE_0 -D PARAM_IMAGE_FILTER_TYPE=0 -D PARAM_SAMPLER_TYPE=2 -D PARAM_SAMPLER_SOBOL_MAXDEPTH=8 [sLG] [PathOCLRenderThread::0] Compiling kernels [sLG] [PathOCLRenderThread::0] Kernels not cached [sLG] [PathOCLRenderThread::0] Compiling Init Kernel [sLG] [PathOCLRenderThread::0] Compiling AdvancePaths Kernel [sLG] [PathOCLRenderThread::0] Kernels compilation time: 258ms Leslies-Mac-Pro:OSX_AMD_slg4_proof leslie$ ./test_mac_fix_ok/slg4 -D opencl.devices.select 01 -D sampler.type SOBOL -D opencl.task.count 262144 -w 512 -e 512 scenes/luxball/luxball-hdr.cfg [sLG] Reading configuration file: scenes/luxball/luxball-hdr.cfg [sLG] Configuration: [sLG] image.width = 512 [sLG] image.height = 512 [sLG] batch.halttime = 0 [sLG] scene.file = scenes/luxball/luxball-hdr.scn [sLG] opencl.cpu.use = 0 [sLG] opencl.gpu.use = 1 [sLG] opencl.gpu.workgroup.size = 64 [sLG] screen.refresh.interval = 100 [sLG] path.maxdepth = 8 [sLG] path.russianroulette.depth = 5 [sLG] opencl.devices.select = 01 [sLG] sampler.type = SOBOL [sLG] opencl.task.count = 262144 [LuxRays::SDL] Reading scene: scenes/luxball/luxball-hdr.scn [LuxRays::SDL] Camera postion: Point[0.6, -1.7, 0.4] [LuxRays::SDL] Camera target: Point[0, 0, 0.4] [LuxRays::SDL] Material definition: shell [LuxRays::SDL] Material definition: whitematte [LuxRays::SDL] Material definition: luxtext [LuxRays::SDL] Material definition: blacktext [LuxRays::SDL] PLY object count: 5 [LuxRays::SDL] Reading texture map: scenes/simple-mat/arch.exr [LuxRays::SDL] HDR RGB (128bit) texture map size: 1024x512 (6144Kbytes) [LuxRays] OpenCL Platform 0: Apple [LuxRays] Device 0 name: NativeThread-000 [LuxRays] Device 0 type: NATIVE_THREAD [LuxRays] Device 0 compute units: 1 [LuxRays] Device 0 max allocable memory: 0MBytes [LuxRays] Device 0 max allocable memory block size: 0MBytes [LuxRays] Device 1 name: NativeThread-001 [LuxRays] Device 1 type: NATIVE_THREAD [LuxRays] Device 1 compute units: 1 [LuxRays] Device 1 max allocable memory: 0MBytes [LuxRays] Device 1 max allocable memory block size: 0MBytes [LuxRays] Device 2 name: Intel® Core™2 Duo CPU E6550 @ 2.33GHz [LuxRays] Device 2 type: OPENCL_CPU [LuxRays] Device 2 compute units: 2 [LuxRays] Device 2 max allocable memory: 8192MBytes [LuxRays] Device 2 max allocable memory block size: 2048MBytes [LuxRays] Device 3 name: GeForce GT 430 [LuxRays] Device 3 type: OPENCL_GPU [LuxRays] Device 3 compute units: 2 [LuxRays] Device 3 max allocable memory: 1024MBytes [LuxRays] Device 3 max allocable memory block size: 256MBytes [LuxRays] Preprocessing DataSet [LuxRays] Total vertex count: 223352 [LuxRays] Total triangle count: 260780 [LuxRays] Total vertices memory usage: 2617Kbytes [LuxRays] Total triangles memory usage: 3056Kbytes [LuxRays] Building QBVH, primitives: 260780, initial nodes: 21733 [LuxRays] QBVH completed with 43609/86932 nodes [LuxRays] Total QBVH memory usage: 4769Kbytes [LuxRays] Total QBVH QuadTriangle count: 87045 [LuxRays] Max. QBVH Depth: 24 [LuxRays] Creating 1 intersection device(s) [LuxRays] Allocating intersection device 0: GeForce GT 430 (Type = OPENCL_GPU) [sLG] OpenCL Devices used: [sLG] [GeForce GT 430Intersect] [sLG] Device OpenCL version: OpenCL 1.1 [LuxRays] [OpenCL device::GeForce GT 430Intersect] Ray buffer size: 3072Kbytes [LuxRays] [OpenCL device::GeForce GT 430Intersect] Ray hits buffer size: 1024Kbytes [LuxRays] [OpenCL device::GeForce GT 430Intersect] OpenCL image support is available [LuxRays] [OpenCL device::GeForce GT 430Intersect] OpenCL max. image buffer size: 8192x8192 [LuxRays] [OpenCL device::GeForce GT 430Intersect] QBVH node image buffer size: 553x553 [LuxRays] [OpenCL device::GeForce GT 430Intersect] QBVH triangle image buffer size: 940x927 [LuxRays] [OpenCL device::GeForce GT 430Intersect] Enabled QBVH scene storage inside image [LuxRays] [OpenCL device::GeForce GT 430Intersect] QBVH max. stack size: 24 [LuxRays] [OpenCL device::GeForce GT 430Intersect] QBVH Image Storage kernel work group size: 512 [LuxRays] [OpenCL device::GeForce GT 430Intersect] Suggested work group size: 512 [LuxRays] [OpenCL device::GeForce GT 430Intersect] Forced work group size: 64 [sLG] Configuring 1 CPU render threads [sLG] [PathOCLRenderThread] OpenCL task count: 262144 [sLG] [PathOCLRenderThread] OpenCL max. page memory size: 262144Kbytes [sLG] [PathOCLRenderThread::CompiledScene] Compile Camera [sLG] [PathOCLRenderThread::CompiledScene] Compile Geometry [sLG] [PathOCLRenderThread::CompiledScene] Scene geometry compilation time: 13ms [sLG] [PathOCLRenderThread::CompiledScene] Compile Materials [sLG] [PathOCLRenderThread::CompiledScene] Compile Textures [sLG] [PathOCLRenderThread::CompiledScene] Textures compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Material compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile Triangle AreaLights [sLG] [PathOCLRenderThread::CompiledScene] Triangle area lights compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile InfiniteLight [sLG] [PathOCLRenderThread::CompiledScene] Infinitelight compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile SunLight [sLG] [PathOCLRenderThread::CompiledScene] Compile SkyLight [sLG] [PathOCLRenderThread::CompiledScene] Compile ImageMaps [sLG] [PathOCLRenderThread::CompiledScene] Image maps page count: 1 [sLG] [PathOCLRenderThread::CompiledScene] RGB channel page 0 size: 6144Kbytes [sLG] [PathOCLRenderThread::CompiledScene] Texture maps compilation time: 2ms [sLG] Starting 1 PathOCL render threads [sLG] [PathOCLRenderThread::0] FrameBuffer buffer size: 4128Kbytes [sLG] [PathOCLRenderThread::0] Camera buffer size: 272bytes [sLG] [PathOCLRenderThread::0] MeshIDs buffer size: 1018Kbytes [sLG] [PathOCLRenderThread::0] Normals buffer size: 2617Kbytes [sLG] [PathOCLRenderThread::0] UVs buffer size: 1744Kbytes [sLG] [PathOCLRenderThread::0] Vertices buffer size: 2617Kbytes [sLG] [PathOCLRenderThread::0] Triangles buffer size: 3056Kbytes [sLG] [PathOCLRenderThread::0] ImageMaps description buffer size: 20bytes [sLG] [PathOCLRenderThread::0] ImageMaps buffer size: 6144Kbytes [sLG] [PathOCLRenderThread::0] Textures buffer size: 144bytes [sLG] [PathOCLRenderThread::0] Materials buffer size: 192bytes [sLG] [PathOCLRenderThread::0] Mesh material index buffer size: 20bytes [sLG] [PathOCLRenderThread::0] InfiniteLight buffer size: 52bytes [sLG] [PathOCLRenderThread::0] Ray buffer size: 12288Kbytes [sLG] [PathOCLRenderThread::0] RayHit buffer size: 4096Kbytes [sLG] [PathOCLRenderThread::0] GPUTask buffer size: 53248Kbytes [sLG] [PathOCLRenderThread::0] Sample dimensions: 28 [sLG] [PathOCLRenderThread::0] Size of a SampleData: 8bytes [sLG] [PathOCLRenderThread::0] SampleData buffer size: 2048Kbytes [sLG] [PathOCLRenderThread::0] GPUTask Stats buffer size: 1024Kbytes [sLG] [PathOCLRenderThread::0] OpenCL buffer creation time: 0ms [sLG] [PathOCLRenderThread::0] Sobol table size: 896 [sLG] [PathOCLRenderThread::0] Defined symbols: -D LUXRAYS_OPENCL_KERNEL -D SLG_OPENCL_KERNEL -D PARAM_TASK_COUNT=262144 -D PARAM_IMAGE_WIDTH=512 -D PARAM_IMAGE_HEIGHT=512 -D PARAM_RAY_EPSILON_MIN=1.000000e-09f -D PARAM_RAY_EPSILON_MAX=1.000000e-01f -D PARAM_MAX_PATH_DEPTH=8 -D PARAM_RR_DEPTH=5 -D PARAM_RR_CAP=5.000000e-01f -D PARAM_ACCEL_QBVH -D PARAM_ENABLE_TEX_CONST_FLOAT3 -D PARAM_ENABLE_MAT_MATTE -D PARAM_CAMERA_HAS_DOF -D PARAM_HAS_INFINITELIGHT -D PARAM_HAS_IMAGEMAPS -D PARAM_IMAGEMAPS_PAGE_0 -D PARAM_IMAGE_FILTER_TYPE=0 -D PARAM_SAMPLER_TYPE=2 -D PARAM_SAMPLER_SOBOL_MAXDEPTH=8 [sLG] [PathOCLRenderThread::0] Compiling kernels [sLG] [PathOCLRenderThread::0] Kernels not cached [sLG] [PathOCLRenderThread::0] Compiling Init Kernel [sLG] [PathOCLRenderThread::0] Compiling AdvancePaths Kernel [sLG] [PathOCLRenderThread::0] Kernels compilation time: 6ms Leslies-Mac-Pro:OSX_AMD_slg4_proof leslie$ oclinfo: Last login: Thu Feb 14 07:03:03 on ttys000 /Applications/Utilities/oclinfo-0.2/oclinfo ; exit; Leslies-Mac-Pro:~ leslie$ /Applications/Utilities/oclinfo-0.2/oclinfo ; exit; 1 OpenCL platform found! [Platform 0] Name: Apple Vendor: Apple Version: OpenCL 1.2 (Aug 24 2012 00:53:09) Profile: FULL_PROFILE Extensions: cl_APPLE_SetMemObjectDestructor cl_APPLE_ContextLoggingFunctions cl_APPLE_clut cl_APPLE_query_kernel_names cl_APPLE_gl_sharing cl_khr_gl_event [OpenCL-only Context] 2 OpenCL devices found! [Device 0] Name: Intel® Core2 Duo CPU E6550 @ 2.33GHz Vendor: Intel Type: CPU Device Version: OpenCL 1.2 Driver Version: 1.1 Compute Units: 2 Work Group Size: 1024 Clock: 2327 MHz Global Memory (Total): 8192 MB Global Memory (Host): 8192 MB Global Memory (PCIe): 0 MB Local Memory: 32 KB Cache Size: 0.0625 KB Cache Line Size: 4194304 Bytes Available: Yes Double-Precision: Yes Extensions: cl_APPLE_SetMemObjectDestructor cl_APPLE_ContextLoggingFunctions cl_APPLE_clut cl_APPLE_query_kernel_names cl_APPLE_gl_sharing cl_khr_gl_event cl_khr_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_APPLE_fp64_basic_ops cl_APPLE_fixed_alpha_channel_orders cl_APPLE_biased_fixed_point_image_formats [Device 1] Name: GeForce GT 430 Vendor: NVIDIA Type: GPU Device Version: OpenCL 1.1 Driver Version: CLH 1.0 Compute Units: 2 Work Group Size: 1024 Clock: 1400 MHz Global Memory: 1024 MB Local Memory: 48 KB Cache Size: 0 KB Cache Line Size: 0 Bytes Available: Yes Double-Precision: No Extensions: cl_APPLE_SetMemObjectDestructor cl_APPLE_ContextLoggingFunctions cl_APPLE_clut cl_APPLE_query_kernel_names cl_APPLE_gl_sharing cl_khr_gl_event cl_khr_byte_addressable_store cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_APPLE_fp64_basic_ops [shared OpenCL+OpenGL Context] 2 OpenCL devices found! [Device 0] Name: GeForce GT 430 Vendor: NVIDIA Type: GPU Device Version: OpenCL 1.1 Driver Version: CLH 1.0 Compute Units: 2 Work Group Size: 1024 Clock: 1400 MHz Global Memory: 1024 MB Local Memory: 48 KB Cache Size: 0 KB Cache Line Size: 0 Bytes Available: Yes Double-Precision: No Extensions: cl_APPLE_SetMemObjectDestructor cl_APPLE_ContextLoggingFunctions cl_APPLE_clut cl_APPLE_query_kernel_names cl_APPLE_gl_sharing cl_khr_gl_event cl_khr_byte_addressable_store cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_APPLE_fp64_basic_ops [Device 1] Name: Intel® Core2 Duo CPU E6550 @ 2.33GHz Vendor: Intel Type: CPU Device Version: OpenCL 1.2 Driver Version: 1.1 Compute Units: 2 Work Group Size: 1024 Clock: 2327 MHz Global Memory (Total): 8192 MB Global Memory (Host): 8192 MB Global Memory (PCIe): 0 MB Local Memory: 32 KB Cache Size: 0.0625 KB Cache Line Size: 4194304 Bytes Available: Yes Double-Precision: Yes Extensions: cl_APPLE_SetMemObjectDestructor cl_APPLE_ContextLoggingFunctions cl_APPLE_clut cl_APPLE_query_kernel_names cl_APPLE_gl_sharing cl_khr_gl_event cl_khr_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_APPLE_fp64_basic_ops cl_APPLE_fixed_alpha_channel_orders cl_APPLE_biased_fixed_point_image_formats logout [Process completed] Link to comment https://www.insanelymac.com/forum/topic/285685-smallluxgpu-4-alpha-nvidia-amd-tests-ended-thanks/#findComment-1888787 Share on other sites More sharing options...
mitch_de Posted February 14, 2013 Author Share Posted February 14, 2013 Thanks. Does the slg4 ( first post DL ) also run on your GT 430? Fermi based gpus reported always crashing in OpenCL part. Link to comment https://www.insanelymac.com/forum/topic/285685-smallluxgpu-4-alpha-nvidia-amd-tests-ended-thanks/#findComment-1888849 Share on other sites More sharing options...
RobertX Posted February 14, 2013 Share Posted February 14, 2013 ...hi Mitch, the file from post #1 crashes here... Leslies-Mac-Pro:SLG4_alpha leslie$ /Users/leslie/Downloads/SLG4_alpha/slg4 [sLG] Reading configuration file: scenes/luxball/luxball.cfg [sLG] Configuration: [sLG] image.width = 640 [sLG] image.height = 480 [sLG] batch.halttime = 0 [sLG] scene.file = scenes/luxball/luxball.scn [sLG] scene.fieldofview = 45 [sLG] opencl.cpu.use = 0 [sLG] opencl.gpu.use = 1 [sLG] opencl.gpu.workgroup.size = 64 [sLG] screen.refresh.interval = 100 [sLG] film.gamma = 2.2 [sLG] path.maxdepth = 16 [sLG] path.russianroulette.depth = 5 [sLG] light.maxdepth = 16 [sLG] light.russianroulette.depth = 5 [LuxRays::SDL] Reading scene: scenes/luxball/luxball.scn [LuxRays::SDL] Camera postion: Point[0.5, -1.4, 1.7] [LuxRays::SDL] Camera target: Point[0, 0, 0.4] [LuxRays::SDL] Material definition: shell [LuxRays::SDL] Material definition: whitematte [LuxRays::SDL] Material definition: luxtext [LuxRays::SDL] Material definition: blacktext [LuxRays::SDL] Material definition: whitelight [LuxRays::SDL] The l1 object is a light sources with 2 triangles [LuxRays::SDL] The l2 object is a light sources with 2 triangles [LuxRays::SDL] PLY object count: 7 [LuxRays] OpenCL Platform 0: Apple [LuxRays] Device 0 name: NativeThread-000 [LuxRays] Device 0 type: NATIVE_THREAD [LuxRays] Device 0 compute units: 1 [LuxRays] Device 0 max allocable memory: 0MBytes [LuxRays] Device 0 max allocable memory block size: 0MBytes [LuxRays] Device 1 name: NativeThread-001 [LuxRays] Device 1 type: NATIVE_THREAD [LuxRays] Device 1 compute units: 1 [LuxRays] Device 1 max allocable memory: 0MBytes [LuxRays] Device 1 max allocable memory block size: 0MBytes [LuxRays] Device 2 name: Intel® Core™2 Duo CPU E6550 @ 2.33GHz [LuxRays] Device 2 type: OPENCL_CPU [LuxRays] Device 2 compute units: 2 [LuxRays] Device 2 max allocable memory: 8192MBytes [LuxRays] Device 2 max allocable memory block size: 2048MBytes [LuxRays] Device 3 name: GeForce GT 430 [LuxRays] Device 3 type: OPENCL_GPU [LuxRays] Device 3 compute units: 2 [LuxRays] Device 3 max allocable memory: 1024MBytes [LuxRays] Device 3 max allocable memory block size: 256MBytes [LuxRays] Preprocessing DataSet [LuxRays] Total vertex count: 223380 [LuxRays] Total triangle count: 260794 [LuxRays] Total vertices memory usage: 2617Kbytes [LuxRays] Total triangles memory usage: 3056Kbytes [LuxRays] Building QBVH, primitives: 260794, initial nodes: 21734 [LuxRays] QBVH completed with 43467/43468 nodes [LuxRays] Total QBVH memory usage: 4754Kbytes [LuxRays] Total QBVH QuadTriangle count: 87103 [LuxRays] Max. QBVH Depth: 25 [LuxRays] Creating 1 intersection device(s) [LuxRays] Allocating intersection device 0: GeForce GT 430 (Type = OPENCL_GPU) [sLG] OpenCL Devices used: [sLG] [GeForce GT 430Intersect] [LuxRays] [OpenCL device::GeForce GT 430Intersect] Ray buffer size: 3072Kbytes [LuxRays] [OpenCL device::GeForce GT 430Intersect] Ray hits buffer size: 1024Kbytes [LuxRays] [OpenCL device::GeForce GT 430Intersect] OpenCL image support is available [LuxRays] [OpenCL device::GeForce GT 430Intersect] OpenCL max. image buffer size: 8192x8192 [LuxRays] [OpenCL device::GeForce GT 430Intersect] QBVH node image buffer size: 553x551 [LuxRays] [OpenCL device::GeForce GT 430Intersect] QBVH triangle image buffer size: 940x927 [LuxRays] [OpenCL device::GeForce GT 430Intersect] Enabled QBVH scene storage inside image [LuxRays] [OpenCL device::GeForce GT 430Intersect] QBVH max. stack size: 24 [LuxRays] [OpenCL device::GeForce GT 430Intersect] QBVH Image Storage kernel work group size: 512 [LuxRays] [OpenCL device::GeForce GT 430Intersect] Suggested work group size: 512 [LuxRays] [OpenCL device::GeForce GT 430Intersect] Forced work group size: 64 [sLG] Configuring 1 CPU render threads [sLG] [PathOCLRenderThread] OpenCL task count: 65536 [sLG] [PathOCLRenderThread] OpenCL max. page memory size: 262144Kbytes [sLG] [PathOCLRenderThread::CompiledScene] Compile Camera [sLG] [PathOCLRenderThread::CompiledScene] Compile Geometry [sLG] [PathOCLRenderThread::CompiledScene] Scene geometry compilation time: 12ms [sLG] [PathOCLRenderThread::CompiledScene] Compile Materials [sLG] [PathOCLRenderThread::CompiledScene] Compile Textures [sLG] [PathOCLRenderThread::CompiledScene] Textures compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Material compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile Triangle AreaLights [sLG] [PathOCLRenderThread::CompiledScene] Triangle area lights compilation time: 1ms [sLG] [PathOCLRenderThread::CompiledScene] Compile InfiniteLight [sLG] [PathOCLRenderThread::CompiledScene] Infinitelight compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile SunLight [sLG] [PathOCLRenderThread::CompiledScene] Compile SkyLight [sLG] [PathOCLRenderThread::CompiledScene] Compile ImageMaps [sLG] [PathOCLRenderThread::CompiledScene] Image maps page count: 0 [sLG] [PathOCLRenderThread::CompiledScene] Texture maps compilation time: 0ms [sLG] Starting 1 PathOCL render threads [sLG] [PathOCLRenderThread::0] FrameBuffer buffer size: 4835Kbytes [sLG] [PathOCLRenderThread::0] Camera buffer size: 272bytes [sLG] [PathOCLRenderThread::0] MeshIDs buffer size: 1018Kbytes [sLG] [PathOCLRenderThread::0] Normals buffer size: 2617Kbytes [sLG] [PathOCLRenderThread::0] UVs buffer size: 1745Kbytes [sLG] [PathOCLRenderThread::0] Vertices buffer size: 2617Kbytes [sLG] [PathOCLRenderThread::0] Triangles buffer size: 3056Kbytes [sLG] [PathOCLRenderThread::0] Textures buffer size: 324bytes [sLG] [PathOCLRenderThread::0] Materials buffer size: 160bytes [sLG] [PathOCLRenderThread::0] Mesh material index buffer size: 28bytes [sLG] [PathOCLRenderThread::0] Triangle AreaLights buffer size: 272bytes [sLG] [PathOCLRenderThread::0] Triangle AreaLights index buffer size: 1018Kbytes [sLG] [PathOCLRenderThread::0] Ray buffer size: 3072Kbytes [sLG] [PathOCLRenderThread::0] RayHit buffer size: 1024Kbytes [sLG] [PathOCLRenderThread::0] GPUTask buffer size: 23296Kbytes [sLG] [PathOCLRenderThread::0] Size of a SampleData: 8bytes [sLG] [PathOCLRenderThread::0] SampleData buffer size: 512Kbytes [sLG] [PathOCLRenderThread::0] GPUTask Stats buffer size: 256Kbytes [sLG] [PathOCLRenderThread::0] OpenCL buffer creation time: 0ms [sLG] [PathOCLRenderThread::0] Defined symbols: -D LUXRAYS_OPENCL_KERNEL -D SLG_OPENCL_KERNEL -D PARAM_TASK_COUNT=65536 -D PARAM_IMAGE_WIDTH=640 -D PARAM_IMAGE_HEIGHT=480 -D PARAM_RAY_EPSILON_MIN=1.000000e-09f -D PARAM_RAY_EPSILON_MAX=1.000000e-01f -D PARAM_MAX_PATH_DEPTH=16 -D PARAM_RR_DEPTH=5 -D PARAM_RR_CAP=5.000000e-01f -D PARAM_ACCEL_QBVH -D PARAM_ENABLE_TEX_CONST_FLOAT -D PARAM_ENABLE_TEX_CONST_FLOAT3 -D PARAM_ENABLE_MAT_MATTE -D PARAM_ENABLE_MAT_GLASS -D PARAM_HAS_PASSTHROUGH -D PARAM_DIRECT_LIGHT_SAMPLING -D PARAM_DL_LIGHT_COUNT=4 -D PARAM_IMAGE_FILTER_TYPE=0 -D PARAM_SAMPLER_TYPE=0 [sLG] [PathOCLRenderThread::0] Compiling kernels [sLG] [PathOCLRenderThread::0] Kernels not cached [sLG] [PathOCLRenderThread::0] Compiling Init Kernel [sLG] [PathOCLRenderThread::0] Compiling AdvancePaths Kernel [sLG] [PathOCLRenderThread::0] Kernels compilation time: 5ms [sLG] [PathOCLRenderThread::0] Rendering thread ERROR: clFlush(CL_INVALID_COMMAND_QUEUE) [sLG] ========================================================= [sLG] Unhandled exception [sLG] Obtained 5 stack frames. [sLG] 0 [sLG] 1 [sLG] 2 [sLG] 3 [sLG] 4 Abort trap: 6 Leslies-Mac-Pro:SLG4_alpha leslie$ Link to comment https://www.insanelymac.com/forum/topic/285685-smallluxgpu-4-alpha-nvidia-amd-tests-ended-thanks/#findComment-1888954 Share on other sites More sharing options...
npwski Posted February 15, 2013 Share Posted February 15, 2013 Fermi based gpus reported always crashing in OpenCL part. Yes, it crashes Fermi GTX580: SLG4_alpha np$ ./slg4 [sLG] Reading configuration file: scenes/luxball/luxball.cfg [sLG] Configuration: [sLG] image.width = 640 [sLG] image.height = 480 [sLG] batch.halttime = 0 [sLG] scene.file = scenes/luxball/luxball.scn [sLG] scene.fieldofview = 45 [sLG] opencl.cpu.use = 0 [sLG] opencl.gpu.use = 1 [sLG] opencl.gpu.workgroup.size = 64 [sLG] screen.refresh.interval = 100 [sLG] film.gamma = 2.2 [sLG] path.maxdepth = 16 [sLG] path.russianroulette.depth = 5 [sLG] light.maxdepth = 16 [sLG] light.russianroulette.depth = 5 [LuxRays::SDL] Reading scene: scenes/luxball/luxball.scn [LuxRays::SDL] Camera postion: Point[0.5, -1.4, 1.7] [LuxRays::SDL] Camera target: Point[0, 0, 0.4] [LuxRays::SDL] Material definition: shell [LuxRays::SDL] Material definition: whitematte [LuxRays::SDL] Material definition: luxtext [LuxRays::SDL] Material definition: blacktext [LuxRays::SDL] Material definition: whitelight [LuxRays::SDL] The l1 object is a light sources with 2 triangles [LuxRays::SDL] The l2 object is a light sources with 2 triangles [LuxRays::SDL] PLY object count: 7 [LuxRays] OpenCL Platform 0: Apple [LuxRays] Device 0 name: NativeThread-000 [LuxRays] Device 0 type: NATIVE_THREAD [LuxRays] Device 0 compute units: 1 [LuxRays] Device 0 max allocable memory: 0MBytes [LuxRays] Device 0 max allocable memory block size: 0MBytes [LuxRays] Device 1 name: NativeThread-001 [LuxRays] Device 1 type: NATIVE_THREAD [LuxRays] Device 1 compute units: 1 [LuxRays] Device 1 max allocable memory: 0MBytes [LuxRays] Device 1 max allocable memory block size: 0MBytes [LuxRays] Device 2 name: NativeThread-002 [LuxRays] Device 2 type: NATIVE_THREAD [LuxRays] Device 2 compute units: 1 [LuxRays] Device 2 max allocable memory: 0MBytes [LuxRays] Device 2 max allocable memory block size: 0MBytes [LuxRays] Device 3 name: NativeThread-003 [LuxRays] Device 3 type: NATIVE_THREAD [LuxRays] Device 3 compute units: 1 [LuxRays] Device 3 max allocable memory: 0MBytes [LuxRays] Device 3 max allocable memory block size: 0MBytes [LuxRays] Device 4 name: NativeThread-004 [LuxRays] Device 4 type: NATIVE_THREAD [LuxRays] Device 4 compute units: 1 [LuxRays] Device 4 max allocable memory: 0MBytes [LuxRays] Device 4 max allocable memory block size: 0MBytes [LuxRays] Device 5 name: NativeThread-005 [LuxRays] Device 5 type: NATIVE_THREAD [LuxRays] Device 5 compute units: 1 [LuxRays] Device 5 max allocable memory: 0MBytes [LuxRays] Device 5 max allocable memory block size: 0MBytes [LuxRays] Device 6 name: NativeThread-006 [LuxRays] Device 6 type: NATIVE_THREAD [LuxRays] Device 6 compute units: 1 [LuxRays] Device 6 max allocable memory: 0MBytes [LuxRays] Device 6 max allocable memory block size: 0MBytes [LuxRays] Device 7 name: NativeThread-007 [LuxRays] Device 7 type: NATIVE_THREAD [LuxRays] Device 7 compute units: 1 [LuxRays] Device 7 max allocable memory: 0MBytes [LuxRays] Device 7 max allocable memory block size: 0MBytes [LuxRays] Device 8 name: Intel® Core i7-2700K CPU @ 3.50GHz [LuxRays] Device 8 type: OPENCL_CPU [LuxRays] Device 8 compute units: 8 [LuxRays] Device 8 max allocable memory: 8192MBytes [LuxRays] Device 8 max allocable memory block size: 2048MBytes [LuxRays] Device 9 name: GeForce GTX 580 [LuxRays] Device 9 type: OPENCL_GPU [LuxRays] Device 9 compute units: 64 [LuxRays] Device 9 max allocable memory: 1536MBytes [LuxRays] Device 9 max allocable memory block size: 384MBytes [LuxRays] Preprocessing DataSet [LuxRays] Total vertex count: 223380 [LuxRays] Total triangle count: 260794 [LuxRays] Total vertices memory usage: 2617Kbytes [LuxRays] Total triangles memory usage: 3056Kbytes [LuxRays] Building QBVH, primitives: 260794, initial nodes: 21734 [LuxRays] QBVH completed with 43467/43468 nodes [LuxRays] Total QBVH memory usage: 4754Kbytes [LuxRays] Total QBVH QuadTriangle count: 87103 [LuxRays] Max. QBVH Depth: 25 [LuxRays] Creating 1 intersection device(s) [LuxRays] Allocating intersection device 0: GeForce GTX 580 (Type = OPENCL_GPU) [sLG] OpenCL Devices used: [sLG] [GeForce GTX 580Intersect] [LuxRays] [OpenCL device::GeForce GTX 580Intersect] Ray buffer size: 3072Kbytes [LuxRays] [OpenCL device::GeForce GTX 580Intersect] Ray hits buffer size: 1024Kbytes [LuxRays] [OpenCL device::GeForce GTX 580Intersect] OpenCL image support is available [LuxRays] [OpenCL device::GeForce GTX 580Intersect] OpenCL max. image buffer size: 8192x8192 [LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH node image buffer size: 553x551 [LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH triangle image buffer size: 940x927 [LuxRays] [OpenCL device::GeForce GTX 580Intersect] Enabled QBVH scene storage inside image [LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH max. stack size: 24 [LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH Image Storage kernel work group size: 512 [LuxRays] [OpenCL device::GeForce GTX 580Intersect] Suggested work group size: 512 [LuxRays] [OpenCL device::GeForce GTX 580Intersect] Forced work group size: 64 [sLG] Configuring 1 CPU render threads [sLG] [PathOCLRenderThread] OpenCL task count: 65536 [sLG] [PathOCLRenderThread] OpenCL max. page memory size: 393216Kbytes [sLG] [PathOCLRenderThread::CompiledScene] Compile Camera [sLG] [PathOCLRenderThread::CompiledScene] Compile Geometry [sLG] [PathOCLRenderThread::CompiledScene] Scene geometry compilation time: 6ms [sLG] [PathOCLRenderThread::CompiledScene] Compile Materials [sLG] [PathOCLRenderThread::CompiledScene] Compile Textures [sLG] [PathOCLRenderThread::CompiledScene] Textures compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Material compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile Triangle AreaLights [sLG] [PathOCLRenderThread::CompiledScene] Triangle area lights compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile InfiniteLight [sLG] [PathOCLRenderThread::CompiledScene] Infinitelight compilation time: 0ms [sLG] [PathOCLRenderThread::CompiledScene] Compile SunLight [sLG] [PathOCLRenderThread::CompiledScene] Compile SkyLight [sLG] [PathOCLRenderThread::CompiledScene] Compile ImageMaps [sLG] [PathOCLRenderThread::CompiledScene] Image maps page count: 0 [sLG] [PathOCLRenderThread::CompiledScene] Texture maps compilation time: 0ms [sLG] Starting 1 PathOCL render threads [sLG] [PathOCLRenderThread::0] FrameBuffer buffer size: 4835Kbytes [sLG] [PathOCLRenderThread::0] Camera buffer size: 272bytes [sLG] [PathOCLRenderThread::0] MeshIDs buffer size: 1018Kbytes [sLG] [PathOCLRenderThread::0] Normals buffer size: 2617Kbytes [sLG] [PathOCLRenderThread::0] UVs buffer size: 1745Kbytes [sLG] [PathOCLRenderThread::0] Vertices buffer size: 2617Kbytes [sLG] [PathOCLRenderThread::0] Triangles buffer size: 3056Kbytes [sLG] [PathOCLRenderThread::0] Textures buffer size: 324bytes [sLG] [PathOCLRenderThread::0] Materials buffer size: 160bytes [sLG] [PathOCLRenderThread::0] Mesh material index buffer size: 28bytes [sLG] [PathOCLRenderThread::0] Triangle AreaLights buffer size: 272bytes [sLG] [PathOCLRenderThread::0] Triangle AreaLights index buffer size: 1018Kbytes [sLG] [PathOCLRenderThread::0] Ray buffer size: 3072Kbytes [sLG] [PathOCLRenderThread::0] RayHit buffer size: 1024Kbytes [sLG] [PathOCLRenderThread::0] GPUTask buffer size: 23296Kbytes [sLG] [PathOCLRenderThread::0] Size of a SampleData: 8bytes [sLG] [PathOCLRenderThread::0] SampleData buffer size: 512Kbytes [sLG] [PathOCLRenderThread::0] GPUTask Stats buffer size: 256Kbytes [sLG] [PathOCLRenderThread::0] OpenCL buffer creation time: 0ms [sLG] [PathOCLRenderThread::0] Defined symbols: -D LUXRAYS_OPENCL_KERNEL -D SLG_OPENCL_KERNEL -D PARAM_TASK_COUNT=65536 -D PARAM_IMAGE_WIDTH=640 -D PARAM_IMAGE_HEIGHT=480 -D PARAM_RAY_EPSILON_MIN=1.000000e-09f -D PARAM_RAY_EPSILON_MAX=1.000000e-01f -D PARAM_MAX_PATH_DEPTH=16 -D PARAM_RR_DEPTH=5 -D PARAM_RR_CAP=5.000000e-01f -D PARAM_ACCEL_QBVH -D PARAM_ENABLE_TEX_CONST_FLOAT -D PARAM_ENABLE_TEX_CONST_FLOAT3 -D PARAM_ENABLE_MAT_MATTE -D PARAM_ENABLE_MAT_GLASS -D PARAM_HAS_PASSTHROUGH -D PARAM_DIRECT_LIGHT_SAMPLING -D PARAM_DL_LIGHT_COUNT=4 -D PARAM_IMAGE_FILTER_TYPE=0 -D PARAM_SAMPLER_TYPE=0 [sLG] [PathOCLRenderThread::0] Compiling kernels [sLG] [PathOCLRenderThread::0] Kernels not cached [sLG] [PathOCLRenderThread::0] Compiling Init Kernel [sLG] [PathOCLRenderThread::0] Compiling AdvancePaths Kernel [sLG] [PathOCLRenderThread::0] Kernels compilation time: 220ms [sLG] [PathOCLRenderThread::0] Rendering thread ERROR: clFlush(CL_INVALID_COMMAND_QUEUE) [sLG] ========================================================= [sLG] Unhandled exception [sLG] Obtained 5 stack frames. [sLG] 0 [sLG] 1 [sLG] 2 [sLG] 3 [sLG] 4 Abort trap: 6 Link to comment https://www.insanelymac.com/forum/topic/285685-smallluxgpu-4-alpha-nvidia-amd-tests-ended-thanks/#findComment-1889007 Share on other sites More sharing options...
mitch_de Posted February 17, 2013 Author Share Posted February 17, 2013 New version and tests. One test red ball and two with textures. Test run on CPU or GPU and should look same - and of course should not crash http://www.jensverwi...full_check2.zip Please read + open also my little changed instructions for testing. Instructions_for_testing.txt First start time of texture tests may take up to 30 sec compile time (of OpenCL part). Second runs takes 1 sec. PS: Results for textured and checkerboard test looks on GPU (compared to CPU) more dark on Nvidia. Thats GPU checkboard test (darker than running on CPU). Happy testing & hope not always crashing on Nvidia fermi. You can also submit your M samples/sec for those texture / chckerbaord GPU test. My 9600 GT is slow with 0.13 M samles/sec. My C2D CPU is a bit faster. Link to comment https://www.insanelymac.com/forum/topic/285685-smallluxgpu-4-alpha-nvidia-amd-tests-ended-thanks/#findComment-1889852 Share on other sites More sharing options...
Recommended Posts