Jump to content
29 posts in this topic

Recommended Posts

Hi,

smallluxgpu (SLG) is an OpenCL powered gpu raytraycing used in Luxrender and well known Luxmark.

The devs work on SLG4 and need to know which OS X / GPU combinations work / not work (errors in terminal insted luxball window).

 

EDIT: AMD OpenCL drivers seems to have an BUG which does overbrighten the result.

EDIT: 18.02. new test version :)

EDIT: try, new special scenes with slg4 test from post #25

 

 

 

We have collected AMD reports THANKS.

You may check if older, non Fermi gpus work like my 9600GT.

 

Bildschirmfoto 2013-01-15 um 16.34.59.jpg

 

Please test it and report:

1. OS X version, 2. gpu type, 3. speed : xyM samples/sec

 

how to:

download the zip, unpack it somewhere, start the terminal, cd path_to_main folder_slg4 (hint: type cd SPACE and than drag& drop the folder into the terminal + ENTER),

copy (from Textedit opened Instructions_for_testing.txt the commands into the terminal ENTER

 

ESC key quits slg.

 

EDIT: try, new special scenes with slg4 test from post #25

 

results: mostly gpu speed bound but also cpu speed has an minor effect for speed!

Nvidia 9600 GT, OS X 10.8.3 D50, 0.13M samples/sec, WORKING

AMD 5570, 10.8.2, 0.86M, BUG

AMD 6870, 10.8.3 D50, 1.37M, BUG

log - working, no errors:

...

[sLG] [PathOCLRenderThread::0] OpenCL buffer creation time: 0ms

[sLG] [PathOCLRenderThread::0] Defined symbols: -D LUXRAYS_OPENCL_KERNEL -D SLG_OPENCL_KERNEL -D PARAM_TASK_COUNT=65536 -D PARAM_IMAGE_WIDTH=640 -D PARAM_IMAGE_HEIGHT=480 -D PARAM_RAY_EPSILON_MIN=1.000000e-09f -D PARAM_RAY_EPSILON_MAX=1.000000e-01f -D PARAM_MAX_PATH_DEPTH=16 -D PARAM_RR_DEPTH=5 -D PARAM_RR_CAP=5.000000e-01f -D PARAM_ACCEL_QBVH -D PARAM_ENABLE_TEX_CONST_FLOAT -D PARAM_ENABLE_TEX_CONST_FLOAT3 -D PARAM_ENABLE_MAT_MATTE -D PARAM_ENABLE_MAT_GLASS -D PARAM_HAS_PASSTHROUGH -D PARAM_DIRECT_LIGHT_SAMPLING -D PARAM_DL_LIGHT_COUNT=4 -D PARAM_IMAGE_FILTER_TYPE=0 -D PARAM_SAMPLER_TYPE=0

[sLG] [PathOCLRenderThread::0] Compiling kernels

[sLG] [PathOCLRenderThread::0] Kernels not cached

[sLG] [PathOCLRenderThread::0] Compiling Init Kernel

[sLG] [PathOCLRenderThread::0] Compiling AdvancePaths Kernel

[sLG] [PathOCLRenderThread::0] Kernels compilation time: 4ms

[sLG] Done.

bash-3.2# cd /Users/iFIRE/Downloads/SLG4_alpha

bash-3.2# /Users/iFIRE/Downloads/SLG4_alpha/slg4

[sLG] Reading configuration file: scenes/luxball/luxball.cfg

[sLG] Configuration:

[sLG] image.width = 640

[sLG] image.height = 480

[sLG] batch.halttime = 0

[sLG] scene.file = scenes/luxball/luxball.scn

[sLG] scene.fieldofview = 45

[sLG] opencl.cpu.use = 0

[sLG] opencl.gpu.use = 1

[sLG] opencl.gpu.workgroup.size = 64

[sLG] screen.refresh.interval = 100

[sLG] film.gamma = 2.2

[sLG] path.maxdepth = 16

[sLG] path.russianroulette.depth = 5

[sLG] light.maxdepth = 16

[sLG] light.russianroulette.depth = 5

[LuxRays::SDL] Reading scene: scenes/luxball/luxball.scn

[LuxRays::SDL] Camera postion: Point[0.5, -1.4, 1.7]

[LuxRays::SDL] Camera target: Point[0, 0, 0.4]

[LuxRays::SDL] Material definition: shell

[LuxRays::SDL] Material definition: whitematte

[LuxRays::SDL] Material definition: luxtext

[LuxRays::SDL] Material definition: blacktext

[LuxRays::SDL] Material definition: whitelight

[LuxRays::SDL] The l1 object is a light sources with 2 triangles

[LuxRays::SDL] The l2 object is a light sources with 2 triangles

[LuxRays::SDL] PLY object count: 7

[LuxRays] OpenCL Platform 0: Apple

[LuxRays] Device 0 name: NativeThread-000

[LuxRays] Device 0 type: NATIVE_THREAD

[LuxRays] Device 0 compute units: 1

[LuxRays] Device 0 max allocable memory: 0MBytes

[LuxRays] Device 0 max allocable memory block size: 0MBytes

[LuxRays] Device 1 name: NativeThread-001

[LuxRays] Device 1 type: NATIVE_THREAD

[LuxRays] Device 1 compute units: 1

[LuxRays] Device 1 max allocable memory: 0MBytes

[LuxRays] Device 1 max allocable memory block size: 0MBytes

[LuxRays] Device 2 name: NativeThread-002

[LuxRays] Device 2 type: NATIVE_THREAD

[LuxRays] Device 2 compute units: 1

[LuxRays] Device 2 max allocable memory: 0MBytes

[LuxRays] Device 2 max allocable memory block size: 0MBytes

[LuxRays] Device 3 name: NativeThread-003

[LuxRays] Device 3 type: NATIVE_THREAD

[LuxRays] Device 3 compute units: 1

[LuxRays] Device 3 max allocable memory: 0MBytes

[LuxRays] Device 3 max allocable memory block size: 0MBytes

[LuxRays] Device 4 name: NativeThread-004

[LuxRays] Device 4 type: NATIVE_THREAD

[LuxRays] Device 4 compute units: 1

[LuxRays] Device 4 max allocable memory: 0MBytes

[LuxRays] Device 4 max allocable memory block size: 0MBytes

[LuxRays] Device 5 name: NativeThread-005

[LuxRays] Device 5 type: NATIVE_THREAD

[LuxRays] Device 5 compute units: 1

[LuxRays] Device 5 max allocable memory: 0MBytes

[LuxRays] Device 5 max allocable memory block size: 0MBytes

[LuxRays] Device 6 name: NativeThread-006

[LuxRays] Device 6 type: NATIVE_THREAD

[LuxRays] Device 6 compute units: 1

[LuxRays] Device 6 max allocable memory: 0MBytes

[LuxRays] Device 6 max allocable memory block size: 0MBytes

[LuxRays] Device 7 name: NativeThread-007

[LuxRays] Device 7 type: NATIVE_THREAD

[LuxRays] Device 7 compute units: 1

[LuxRays] Device 7 max allocable memory: 0MBytes

[LuxRays] Device 7 max allocable memory block size: 0MBytes

[LuxRays] Device 8 name: Intel® Core™ i7 CPU 860 @ 2.80GHz

[LuxRays] Device 8 type: OPENCL_CPU

[LuxRays] Device 8 compute units: 8

[LuxRays] Device 8 max allocable memory: 4096MBytes

[LuxRays] Device 8 max allocable memory block size: 1024MBytes

[LuxRays] Device 9 name: ATI Radeon HD 5770

[LuxRays] Device 9 type: OPENCL_GPU

[LuxRays] Device 9 compute units: 10

[LuxRays] Device 9 max allocable memory: 1024MBytes

[LuxRays] Device 9 max allocable memory block size: 256MBytes

[LuxRays] Preprocessing DataSet

[LuxRays] Total vertex count: 223380

[LuxRays] Total triangle count: 260794

[LuxRays] Total vertices memory usage: 2617Kbytes

[LuxRays] Total triangles memory usage: 3056Kbytes

[LuxRays] Building QBVH, primitives: 260794, initial nodes: 21734

[LuxRays] QBVH completed with 43467/43468 nodes

[LuxRays] Total QBVH memory usage: 4754Kbytes

[LuxRays] Total QBVH QuadTriangle count: 87103

[LuxRays] Max. QBVH Depth: 25

[LuxRays] Creating 1 intersection device(s)

[LuxRays] Allocating intersection device 0: ATI Radeon HD 5770 (Type = OPENCL_GPU)

[sLG] OpenCL Devices used:

[sLG] [ATI Radeon HD 5770Intersect]

[LuxRays] [OpenCL device::ATI Radeon HD 5770Intersect] Ray buffer size: 3072Kbytes

[LuxRays] [OpenCL device::ATI Radeon HD 5770Intersect] Ray hits buffer size: 1024Kbytes

[LuxRays] [OpenCL device::ATI Radeon HD 5770Intersect] OpenCL image support is available

[LuxRays] [OpenCL device::ATI Radeon HD 5770Intersect] OpenCL max. image buffer size: 8192x8192

[LuxRays] [OpenCL device::ATI Radeon HD 5770Intersect] QBVH node image buffer size: 553x551

[LuxRays] [OpenCL device::ATI Radeon HD 5770Intersect] QBVH triangle image buffer size: 940x927

[LuxRays] [OpenCL device::ATI Radeon HD 5770Intersect] Enabled QBVH scene storage inside image

[LuxRays] [OpenCL device::ATI Radeon HD 5770Intersect] QBVH max. stack size: 24

[LuxRays] [OpenCL device::ATI Radeon HD 5770Intersect] QBVH Image Storage kernel work group size: 256

[LuxRays] [OpenCL device::ATI Radeon HD 5770Intersect] Suggested work group size: 256

[LuxRays] [OpenCL device::ATI Radeon HD 5770Intersect] Forced work group size: 64

[sLG] Configuring 1 CPU render threads

[sLG] [PathOCLRenderThread] OpenCL task count: 65536

[sLG] [PathOCLRenderThread] OpenCL max. page memory size: 262144Kbytes

[sLG] [PathOCLRenderThread::CompiledScene] Compile Camera

[sLG] [PathOCLRenderThread::CompiledScene] Compile Geometry

[sLG] [PathOCLRenderThread::CompiledScene] Scene geometry compilation time: 14ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile Materials

[sLG] [PathOCLRenderThread::CompiledScene] Compile Textures

[sLG] [PathOCLRenderThread::CompiledScene] Textures compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Material compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile Triangle AreaLights

[sLG] [PathOCLRenderThread::CompiledScene] Triangle area lights compilation time: 1ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile InfiniteLight

[sLG] [PathOCLRenderThread::CompiledScene] Infinitelight compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile SunLight

[sLG] [PathOCLRenderThread::CompiledScene] Compile SkyLight

[sLG] [PathOCLRenderThread::CompiledScene] Compile ImageMaps

[sLG] [PathOCLRenderThread::CompiledScene] Image maps page count: 0

[sLG] [PathOCLRenderThread::CompiledScene] Texture maps compilation time: 0ms

[sLG] Starting 1 PathOCL render threads

[sLG] [PathOCLRenderThread::0] FrameBuffer buffer size: 4835Kbytes

[sLG] [PathOCLRenderThread::0] Camera buffer size: 272bytes

[sLG] [PathOCLRenderThread::0] MeshIDs buffer size: 1018Kbytes

[sLG] [PathOCLRenderThread::0] Normals buffer size: 2617Kbytes

[sLG] [PathOCLRenderThread::0] UVs buffer size: 1745Kbytes

[sLG] [PathOCLRenderThread::0] Vertices buffer size: 2617Kbytes

[sLG] [PathOCLRenderThread::0] Triangles buffer size: 3056Kbytes

[sLG] [PathOCLRenderThread::0] Textures buffer size: 324bytes

[sLG] [PathOCLRenderThread::0] Materials buffer size: 160bytes

[sLG] [PathOCLRenderThread::0] Mesh material index buffer size: 28bytes

[sLG] [PathOCLRenderThread::0] Triangle AreaLights buffer size: 272bytes

[sLG] [PathOCLRenderThread::0] Triangle AreaLights index buffer size: 1018Kbytes

[sLG] [PathOCLRenderThread::0] Ray buffer size: 3072Kbytes

[sLG] [PathOCLRenderThread::0] RayHit buffer size: 1024Kbytes

[sLG] [PathOCLRenderThread::0] GPUTask buffer size: 23296Kbytes

[sLG] [PathOCLRenderThread::0] Size of a SampleData: 8bytes

[sLG] [PathOCLRenderThread::0] SampleData buffer size: 512Kbytes

[sLG] [PathOCLRenderThread::0] GPUTask Stats buffer size: 256Kbytes

[sLG] [PathOCLRenderThread::0] OpenCL buffer creation time: 0ms

[sLG] [PathOCLRenderThread::0] Defined symbols: -D LUXRAYS_OPENCL_KERNEL -D SLG_OPENCL_KERNEL -D PARAM_TASK_COUNT=65536 -D PARAM_IMAGE_WIDTH=640 -D PARAM_IMAGE_HEIGHT=480 -D PARAM_RAY_EPSILON_MIN=1.000000e-09f -D PARAM_RAY_EPSILON_MAX=1.000000e-01f -D PARAM_MAX_PATH_DEPTH=16 -D PARAM_RR_DEPTH=5 -D PARAM_RR_CAP=5.000000e-01f -D PARAM_ACCEL_QBVH -D PARAM_ENABLE_TEX_CONST_FLOAT -D PARAM_ENABLE_TEX_CONST_FLOAT3 -D PARAM_ENABLE_MAT_MATTE -D PARAM_ENABLE_MAT_GLASS -D PARAM_HAS_PASSTHROUGH -D PARAM_DIRECT_LIGHT_SAMPLING -D PARAM_DL_LIGHT_COUNT=4 -D PARAM_IMAGE_FILTER_TYPE=0 -D PARAM_SAMPLER_TYPE=0

[sLG] [PathOCLRenderThread::0] Compiling kernels

[sLG] [PathOCLRenderThread::0] Kernels not cached

[sLG] [PathOCLRenderThread::0] Compiling Init Kernel

[sLG] [PathOCLRenderThread::0] Compiling AdvancePaths Kernel

[sLG] [PathOCLRenderThread::0] Kernels compilation time: 743ms

[sLG] Done.

bash-3.2#

 

post-333439-0-37860500-1358272568_thumb.png HD 5770

10.8.3 12D50

GTX 580

 

App comes up as black screen then locks up:

 

 

[sLG] Reading configuration file: scenes/luxball/luxball.cfg

[sLG] Configuration:

[sLG] image.width = 640

[sLG] image.height = 480

[sLG] batch.halttime = 0

[sLG] scene.file = scenes/luxball/luxball.scn

[sLG] scene.fieldofview = 45

[sLG] opencl.cpu.use = 0

[sLG] opencl.gpu.use = 1

[sLG] opencl.gpu.workgroup.size = 64

[sLG] screen.refresh.interval = 100

[sLG] film.gamma = 2.2

[sLG] path.maxdepth = 16

[sLG] path.russianroulette.depth = 5

[sLG] light.maxdepth = 16

[sLG] light.russianroulette.depth = 5

[LuxRays::SDL] Reading scene: scenes/luxball/luxball.scn

[LuxRays::SDL] Camera postion: Point[0.5, -1.4, 1.7]

[LuxRays::SDL] Camera target: Point[0, 0, 0.4]

[LuxRays::SDL] Material definition: shell

[LuxRays::SDL] Material definition: whitematte

[LuxRays::SDL] Material definition: luxtext

[LuxRays::SDL] Material definition: blacktext

[LuxRays::SDL] Material definition: whitelight

[LuxRays::SDL] The l1 object is a light sources with 2 triangles

[LuxRays::SDL] The l2 object is a light sources with 2 triangles

[LuxRays::SDL] PLY object count: 7

[LuxRays] OpenCL Platform 0: Apple

[LuxRays] Device 0 name: NativeThread-000

[LuxRays] Device 0 type: NATIVE_THREAD

[LuxRays] Device 0 compute units: 1

[LuxRays] Device 0 max allocable memory: 0MBytes

[LuxRays] Device 0 max allocable memory block size: 0MBytes

[LuxRays] Device 1 name: NativeThread-001

[LuxRays] Device 1 type: NATIVE_THREAD

[LuxRays] Device 1 compute units: 1

[LuxRays] Device 1 max allocable memory: 0MBytes

[LuxRays] Device 1 max allocable memory block size: 0MBytes

[LuxRays] Device 2 name: NativeThread-002

[LuxRays] Device 2 type: NATIVE_THREAD

[LuxRays] Device 2 compute units: 1

[LuxRays] Device 2 max allocable memory: 0MBytes

[LuxRays] Device 2 max allocable memory block size: 0MBytes

[LuxRays] Device 3 name: NativeThread-003

[LuxRays] Device 3 type: NATIVE_THREAD

[LuxRays] Device 3 compute units: 1

[LuxRays] Device 3 max allocable memory: 0MBytes

[LuxRays] Device 3 max allocable memory block size: 0MBytes

[LuxRays] Device 4 name: NativeThread-004

[LuxRays] Device 4 type: NATIVE_THREAD

[LuxRays] Device 4 compute units: 1

[LuxRays] Device 4 max allocable memory: 0MBytes

[LuxRays] Device 4 max allocable memory block size: 0MBytes

[LuxRays] Device 5 name: NativeThread-005

[LuxRays] Device 5 type: NATIVE_THREAD

[LuxRays] Device 5 compute units: 1

[LuxRays] Device 5 max allocable memory: 0MBytes

[LuxRays] Device 5 max allocable memory block size: 0MBytes

[LuxRays] Device 6 name: NativeThread-006

[LuxRays] Device 6 type: NATIVE_THREAD

[LuxRays] Device 6 compute units: 1

[LuxRays] Device 6 max allocable memory: 0MBytes

[LuxRays] Device 6 max allocable memory block size: 0MBytes

[LuxRays] Device 7 name: NativeThread-007

[LuxRays] Device 7 type: NATIVE_THREAD

[LuxRays] Device 7 compute units: 1

[LuxRays] Device 7 max allocable memory: 0MBytes

[LuxRays] Device 7 max allocable memory block size: 0MBytes

[LuxRays] Device 8 name: Intel® Core i7-3820 CPU @ 3.60GHz

[LuxRays] Device 8 type: OPENCL_CPU

[LuxRays] Device 8 compute units: 8

[LuxRays] Device 8 max allocable memory: 65536MBytes

[LuxRays] Device 8 max allocable memory block size: 16384MBytes

[LuxRays] Device 9 name: GeForce GTX 580

[LuxRays] Device 9 type: OPENCL_GPU

[LuxRays] Device 9 compute units: 16

[LuxRays] Device 9 max allocable memory: 1536MBytes

[LuxRays] Device 9 max allocable memory block size: 384MBytes

[LuxRays] Preprocessing DataSet

[LuxRays] Total vertex count: 223380

[LuxRays] Total triangle count: 260794

[LuxRays] Total vertices memory usage: 2617Kbytes

[LuxRays] Total triangles memory usage: 3056Kbytes

[LuxRays] Building QBVH, primitives: 260794, initial nodes: 21734

[LuxRays] QBVH completed with 43467/43468 nodes

[LuxRays] Total QBVH memory usage: 4754Kbytes

[LuxRays] Total QBVH QuadTriangle count: 87103

[LuxRays] Max. QBVH Depth: 25

[LuxRays] Creating 1 intersection device(s)

[LuxRays] Allocating intersection device 0: GeForce GTX 580 (Type = OPENCL_GPU)

[sLG] OpenCL Devices used:

[sLG] [GeForce GTX 580Intersect]

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] Ray buffer size: 3072Kbytes

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] Ray hits buffer size: 1024Kbytes

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] OpenCL image support is available

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] OpenCL max. image buffer size: 8192x8192

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH node image buffer size: 553x551

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH triangle image buffer size: 940x927

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] Enabled QBVH scene storage inside image

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH max. stack size: 24

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH Image Storage kernel work group size: 512

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] Suggested work group size: 512

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] Forced work group size: 64

[sLG] Configuring 1 CPU render threads

[sLG] [PathOCLRenderThread] OpenCL task count: 65536

[sLG] [PathOCLRenderThread] OpenCL max. page memory size: 393216Kbytes

[sLG] [PathOCLRenderThread::CompiledScene] Compile Camera

[sLG] [PathOCLRenderThread::CompiledScene] Compile Geometry

[sLG] [PathOCLRenderThread::CompiledScene] Scene geometry compilation time: 6ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile Materials

[sLG] [PathOCLRenderThread::CompiledScene] Compile Textures

[sLG] [PathOCLRenderThread::CompiledScene] Textures compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Material compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile Triangle AreaLights

[sLG] [PathOCLRenderThread::CompiledScene] Triangle area lights compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile InfiniteLight

[sLG] [PathOCLRenderThread::CompiledScene] Infinitelight compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile SunLight

[sLG] [PathOCLRenderThread::CompiledScene] Compile SkyLight

[sLG] [PathOCLRenderThread::CompiledScene] Compile ImageMaps

[sLG] [PathOCLRenderThread::CompiledScene] Image maps page count: 0

[sLG] [PathOCLRenderThread::CompiledScene] Texture maps compilation time: 0ms

[sLG] Starting 1 PathOCL render threads

[sLG] [PathOCLRenderThread::0] FrameBuffer buffer size: 4835Kbytes

[sLG] [PathOCLRenderThread::0] Camera buffer size: 272bytes

[sLG] [PathOCLRenderThread::0] MeshIDs buffer size: 1018Kbytes

[sLG] [PathOCLRenderThread::0] Normals buffer size: 2617Kbytes

[sLG] [PathOCLRenderThread::0] UVs buffer size: 1745Kbytes

[sLG] [PathOCLRenderThread::0] Vertices buffer size: 2617Kbytes

[sLG] [PathOCLRenderThread::0] Triangles buffer size: 3056Kbytes

[sLG] [PathOCLRenderThread::0] Textures buffer size: 324bytes

[sLG] [PathOCLRenderThread::0] Materials buffer size: 160bytes

[sLG] [PathOCLRenderThread::0] Mesh material index buffer size: 28bytes

[sLG] [PathOCLRenderThread::0] Triangle AreaLights buffer size: 272bytes

[sLG] [PathOCLRenderThread::0] Triangle AreaLights index buffer size: 1018Kbytes

[sLG] [PathOCLRenderThread::0] Ray buffer size: 3072Kbytes

[sLG] [PathOCLRenderThread::0] RayHit buffer size: 1024Kbytes

[sLG] [PathOCLRenderThread::0] GPUTask buffer size: 23296Kbytes

[sLG] [PathOCLRenderThread::0] Size of a SampleData: 8bytes

[sLG] [PathOCLRenderThread::0] SampleData buffer size: 512Kbytes

[sLG] [PathOCLRenderThread::0] GPUTask Stats buffer size: 256Kbytes

[sLG] [PathOCLRenderThread::0] OpenCL buffer creation time: 0ms

[sLG] [PathOCLRenderThread::0] Defined symbols: -D LUXRAYS_OPENCL_KERNEL -D SLG_OPENCL_KERNEL -D PARAM_TASK_COUNT=65536 -D PARAM_IMAGE_WIDTH=640 -D PARAM_IMAGE_HEIGHT=480 -D PARAM_RAY_EPSILON_MIN=1.000000e-09f -D PARAM_RAY_EPSILON_MAX=1.000000e-01f -D PARAM_MAX_PATH_DEPTH=16 -D PARAM_RR_DEPTH=5 -D PARAM_RR_CAP=5.000000e-01f -D PARAM_ACCEL_QBVH -D PARAM_ENABLE_TEX_CONST_FLOAT -D PARAM_ENABLE_TEX_CONST_FLOAT3 -D PARAM_ENABLE_MAT_MATTE -D PARAM_ENABLE_MAT_GLASS -D PARAM_HAS_PASSTHROUGH -D PARAM_DIRECT_LIGHT_SAMPLING -D PARAM_DL_LIGHT_COUNT=4 -D PARAM_IMAGE_FILTER_TYPE=0 -D PARAM_SAMPLER_TYPE=0

[sLG] [PathOCLRenderThread::0] Compiling kernels

[sLG] [PathOCLRenderThread::0] Kernels not cached

[sLG] [PathOCLRenderThread::0] Compiling Init Kernel

[sLG] [PathOCLRenderThread::0] Compiling AdvancePaths Kernel

[sLG] [PathOCLRenderThread::0] Kernels compilation time: 240ms

Thanks! To be sure that your GPU runs well with stable(older SLG), check if Luxmark (OpenCL Benchmark) works. If not it is an general prob with your OpenCL/Fermi combi. Normally Luxmark works also on Fermi gpus (if OpenCL works)

http://www.luxrender..._OSXIntel64.zip

10.8.2 (12C60)

GTX 560 Ti

 

 

 

Last login: Wed Jan 16 03:12:52 on console
localhost:~ victoriano$ cd /Users/victoriano/Downloads/SLG4_alpha
localhost:SLG4_alpha victoriano$ /Users/victoriano/Downloads/SLG4_alpha/slg4
[sLG] Reading configuration file: scenes/luxball/luxball.cfg
[sLG] Configuration:
[sLG] image.width = 640
[sLG] image.height = 480
[sLG] batch.halttime = 0
[sLG] scene.file = scenes/luxball/luxball.scn
[sLG] scene.fieldofview = 45
[sLG] opencl.cpu.use = 0
[sLG] opencl.gpu.use = 1
[sLG] opencl.gpu.workgroup.size = 64
[sLG] screen.refresh.interval = 100
[sLG] film.gamma = 2.2
[sLG] path.maxdepth = 16
[sLG] path.russianroulette.depth = 5
[sLG] light.maxdepth = 16
[sLG] light.russianroulette.depth = 5
[LuxRays::SDL] Reading scene: scenes/luxball/luxball.scn
[LuxRays::SDL] Camera postion: Point[0.5, -1.4, 1.7]
[LuxRays::SDL] Camera target: Point[0, 0, 0.4]
[LuxRays::SDL] Material definition: shell
[LuxRays::SDL] Material definition: whitematte
[LuxRays::SDL] Material definition: luxtext
[LuxRays::SDL] Material definition: blacktext
[LuxRays::SDL] Material definition: whitelight
[LuxRays::SDL] The l1 object is a light sources with 2 triangles
[LuxRays::SDL] The l2 object is a light sources with 2 triangles
[LuxRays::SDL] PLY object count: 7
[LuxRays] OpenCL Platform 0: Apple
[LuxRays] Device 0 name: NativeThread-000
[LuxRays] Device 0 type: NATIVE_THREAD
[LuxRays] Device 0 compute units: 1
[LuxRays] Device 0 max allocable memory: 0MBytes
[LuxRays] Device 0 max allocable memory block size: 0MBytes
[LuxRays] Device 1 name: NativeThread-001
[LuxRays] Device 1 type: NATIVE_THREAD
[LuxRays] Device 1 compute units: 1
[LuxRays] Device 1 max allocable memory: 0MBytes
[LuxRays] Device 1 max allocable memory block size: 0MBytes
[LuxRays] Device 2 name: Intel(R) Core(TM)2 Duo CPU	 E8400 @ 3.00GHz
[LuxRays] Device 2 type: OPENCL_CPU
[LuxRays] Device 2 compute units: 2
[LuxRays] Device 2 max allocable memory: 4096MBytes
[LuxRays] Device 2 max allocable memory block size: 1024MBytes
[LuxRays] Device 3 name: GeForce GTX 560 Ti
[LuxRays] Device 3 type: OPENCL_GPU
[LuxRays] Device 3 compute units: 16
[LuxRays] Device 3 max allocable memory: 1024MBytes
[LuxRays] Device 3 max allocable memory block size: 256MBytes
[LuxRays] Preprocessing DataSet
[LuxRays] Total vertex count: 223380
[LuxRays] Total triangle count: 260794
[LuxRays] Total vertices memory usage: 2617Kbytes
[LuxRays] Total triangles memory usage: 3056Kbytes
[LuxRays] Building QBVH, primitives: 260794, initial nodes: 21734
[LuxRays] QBVH completed with 43467/43468 nodes
[LuxRays] Total QBVH memory usage: 4754Kbytes
[LuxRays] Total QBVH QuadTriangle count: 87103
[LuxRays] Max. QBVH Depth: 25
[LuxRays] Creating 1 intersection device(s)
[LuxRays] Allocating intersection device 0: GeForce GTX 560 Ti (Type = OPENCL_GPU)
[sLG] OpenCL Devices used:
[sLG] [GeForce GTX 560 TiIntersect]
[LuxRays] [OpenCL device::GeForce GTX 560 TiIntersect] Ray buffer size: 3072Kbytes
[LuxRays] [OpenCL device::GeForce GTX 560 TiIntersect] Ray hits buffer size: 1024Kbytes
[LuxRays] [OpenCL device::GeForce GTX 560 TiIntersect] OpenCL image support is available
[LuxRays] [OpenCL device::GeForce GTX 560 TiIntersect] OpenCL max. image buffer size: 8192x8192
[LuxRays] [OpenCL device::GeForce GTX 560 TiIntersect] QBVH node image buffer size: 553x551
[LuxRays] [OpenCL device::GeForce GTX 560 TiIntersect] QBVH triangle image buffer size: 940x927
[LuxRays] [OpenCL device::GeForce GTX 560 TiIntersect] Enabled QBVH scene storage inside image
[LuxRays] [OpenCL device::GeForce GTX 560 TiIntersect] QBVH max. stack size: 24
[LuxRays] [OpenCL device::GeForce GTX 560 TiIntersect] QBVH Image Storage kernel work group size: 512
[LuxRays] [OpenCL device::GeForce GTX 560 TiIntersect] Suggested work group size: 512
[LuxRays] [OpenCL device::GeForce GTX 560 TiIntersect] Forced work group size: 64
[sLG] Configuring 1 CPU render threads
[sLG] [PathOCLRenderThread] OpenCL task count: 65536
[sLG] [PathOCLRenderThread] OpenCL max. page memory size: 262144Kbytes
[sLG] [PathOCLRenderThread::CompiledScene] Compile Camera
[sLG] [PathOCLRenderThread::CompiledScene] Compile Geometry
[sLG] [PathOCLRenderThread::CompiledScene] Scene geometry compilation time: 14ms
[sLG] [PathOCLRenderThread::CompiledScene] Compile Materials
[sLG] [PathOCLRenderThread::CompiledScene] Compile Textures
[sLG] [PathOCLRenderThread::CompiledScene] Textures compilation time: 0ms
[sLG] [PathOCLRenderThread::CompiledScene] Material compilation time: 0ms
[sLG] [PathOCLRenderThread::CompiledScene] Compile Triangle AreaLights
[sLG] [PathOCLRenderThread::CompiledScene] Triangle area lights compilation time: 1ms
[sLG] [PathOCLRenderThread::CompiledScene] Compile InfiniteLight
[sLG] [PathOCLRenderThread::CompiledScene] Infinitelight compilation time: 0ms
[sLG] [PathOCLRenderThread::CompiledScene] Compile SunLight
[sLG] [PathOCLRenderThread::CompiledScene] Compile SkyLight
[sLG] [PathOCLRenderThread::CompiledScene] Compile ImageMaps
[sLG] [PathOCLRenderThread::CompiledScene] Image maps page count: 0
[sLG] [PathOCLRenderThread::CompiledScene] Texture maps compilation time: 0ms
[sLG] Starting 1 PathOCL render threads
[sLG] [PathOCLRenderThread::0] FrameBuffer buffer size: 4835Kbytes
[sLG] [PathOCLRenderThread::0] Camera buffer size: 272bytes
[sLG] [PathOCLRenderThread::0] MeshIDs buffer size: 1018Kbytes
[sLG] [PathOCLRenderThread::0] Normals buffer size: 2617Kbytes
[sLG] [PathOCLRenderThread::0] UVs buffer size: 1745Kbytes
[sLG] [PathOCLRenderThread::0] Vertices buffer size: 2617Kbytes
[sLG] [PathOCLRenderThread::0] Triangles buffer size: 3056Kbytes
[sLG] [PathOCLRenderThread::0] Textures buffer size: 324bytes
[sLG] [PathOCLRenderThread::0] Materials buffer size: 160bytes
[sLG] [PathOCLRenderThread::0] Mesh material index buffer size: 28bytes
[sLG] [PathOCLRenderThread::0] Triangle AreaLights buffer size: 272bytes
[sLG] [PathOCLRenderThread::0] Triangle AreaLights index buffer size: 1018Kbytes
[sLG] [PathOCLRenderThread::0] Ray buffer size: 3072Kbytes
[sLG] [PathOCLRenderThread::0] RayHit buffer size: 1024Kbytes
[sLG] [PathOCLRenderThread::0] GPUTask buffer size: 23296Kbytes
[sLG] [PathOCLRenderThread::0] Size of a SampleData: 8bytes
[sLG] [PathOCLRenderThread::0] SampleData buffer size: 512Kbytes
[sLG] [PathOCLRenderThread::0] GPUTask Stats buffer size: 256Kbytes
[sLG] [PathOCLRenderThread::0] OpenCL buffer creation time: 0ms
[sLG] [PathOCLRenderThread::0] Defined symbols: -D LUXRAYS_OPENCL_KERNEL -D SLG_OPENCL_KERNEL -D PARAM_TASK_COUNT=65536 -D PARAM_IMAGE_WIDTH=640 -D PARAM_IMAGE_HEIGHT=480 -D PARAM_RAY_EPSILON_MIN=1.000000e-09f -D PARAM_RAY_EPSILON_MAX=1.000000e-01f -D PARAM_MAX_PATH_DEPTH=16 -D PARAM_RR_DEPTH=5 -D PARAM_RR_CAP=5.000000e-01f -D PARAM_ACCEL_QBVH -D PARAM_ENABLE_TEX_CONST_FLOAT -D PARAM_ENABLE_TEX_CONST_FLOAT3 -D PARAM_ENABLE_MAT_MATTE -D PARAM_ENABLE_MAT_GLASS -D PARAM_HAS_PASSTHROUGH -D PARAM_DIRECT_LIGHT_SAMPLING -D PARAM_DL_LIGHT_COUNT=4 -D PARAM_IMAGE_FILTER_TYPE=0 -D PARAM_SAMPLER_TYPE=0
[sLG] [PathOCLRenderThread::0] Compiling kernels
[sLG] [PathOCLRenderThread::0] Kernels not cached
[sLG] [PathOCLRenderThread::0] Compiling Init Kernel
[sLG] [PathOCLRenderThread::0] Compiling AdvancePaths Kernel
[sLG] [PathOCLRenderThread::0] Kernels compilation time: 612ms
[sLG] [PathOCLRenderThread::0] Rendering thread ERROR: clFlush(CL_INVALID_COMMAND_QUEUE)
[sLG] =========================================================
[sLG] Unhandled exception
[sLG] Obtained 5 stack frames.
[sLG] 0
[sLG] 1
[sLG] 2
[sLG] 3
[sLG] 4
Abort trap: 6
localhost:SLG4_alpha victoriano$

Thanks, at least fermi OpenCL has , like AMD OpenCL probs with new OpenCL code (SLG4 uses newer OpenCL 1.1+1.2 features if available for the gpu type).

 

1. If you test SLG4 with fermi/kelper gpu please also crosstest if LUXMARK (SLG3 usage) works http://www.luxrender..._OSXIntel64.zip

2. Nvidia gpus like GTX 260, GTX 285 (less features for OpenCL) would be interesting, also 8800GT,9800GT

 

3. please also reprot OpenCL features (Opencl version usage) with attached OpenCLinfo.

 

OpenCL_info.zip

 

Looks like:

GA_EP35:~ andreasm$ /Volumes/Programme_1/OPENCL/oclinfo ; exit;
1 OpenCL platform found!
[Platform 0]
Name: Apple
Vendor: Apple
Version: OpenCL 1.2 (Dec 4 2012 18:26:30)
Profile: FULL_PROFILE
Extensions: cl_APPLE_SetMemObjectDestructor cl_APPLE_ContextLoggingFunctions cl_APPLE_clut cl_APPLE_query_kernel_names cl_APPLE_gl_sharing cl_khr_gl_event

2 OpenCL devices found!
[Device 0]
Name: Intel(R) Core(TM)2 Duo CPU	 E7300 @ 2.66GHz
Vendor: Intel
Type: CPU
Device Version: OpenCL 1.2
...
[Device 1]
Name: GeForce 9600 GT
Vendor: NVIDIA
Type: GPU
Device Version: OpenCL 1.0
Driver Version: CLH 1.0
Compute Units: 8
Work Group Size: 512
Clock: 1680 MHz
Global Memory: 512 MB
Local Memory: 16 KB
Cache Size: 0 KB
Cache Line Size: 0 Bytes
Available: Yes
Double-Precision: No
Extensions:
cl_APPLE_SetMemObjectDestructor
cl_APPLE_ContextLoggingFunctions
cl_APPLE_clut
cl_APPLE_query_kernel_names
cl_APPLE_gl_sharing
cl_khr_gl_event
cl_khr_byte_addressable_store
cl_khr_global_int32_base_atomics
cl_khr_global_int32_extended_atomics
logout

It crashes with this:

[sLG] [PathOCLRenderThread::0] Rendering thread ERROR: clFlush(CL_INVALID_COMMAND_QUEUE)
[sLG] Unhandled exception
[sLG] Obtained 5 stack frames.
[sLG] 0
[sLG] 1
[sLG] 2
[sLG] 3
[sLG] 4
Abort trap: 6

 

Luxmark works ok.

 

 

OpenCLinfo:

1 OpenCL platform found!
[Platform 0]
Name: Apple
Vendor: Apple
Version: OpenCL 1.2 (Aug 24 2012 00:53:09)
Profile: FULL_PROFILE
Extensions: cl_APPLE_SetMemObjectDestructor cl_APPLE_ContextLoggingFunctions cl_APPLE_clut cl_APPLE_query_kernel_names cl_APPLE_gl_sharing cl_khr_gl_event

2 OpenCL devices found!
[Device 0]
Name: Intel(R) Core(TM) i7-3820 CPU @ 3.60GHz
Vendor: Intel
Type: CPU
Device Version: OpenCL 1.2
Driver Version: 1.1
Compute Units: 8
Work Group Size: 1024
Clock: 4600 MHz
Global Memory (Total): 8192 MB
Global Memory (Host): 8192 MB
Global Memory (PCIe): 0 MB
Local Memory: 32 KB
Cache Size: 0.0625 KB
Cache Line Size: 10485760 Bytes
Available: Yes
Double-Precision: Yes
Extensions:
cl_APPLE_SetMemObjectDestructor
cl_APPLE_ContextLoggingFunctions
cl_APPLE_clut
cl_APPLE_query_kernel_names
cl_APPLE_gl_sharing
cl_khr_gl_event
cl_khr_fp64
cl_khr_global_int32_base_atomics
cl_khr_global_int32_extended_atomics
cl_khr_local_int32_base_atomics
cl_khr_local_int32_extended_atomics
cl_khr_byte_addressable_store
cl_khr_int64_base_atomics
cl_khr_int64_extended_atomics
cl_khr_3d_image_writes
cl_APPLE_fp64_basic_ops
cl_APPLE_fixed_alpha_channel_orders
[Device 1]
Name: GeForce GTX 580
Vendor: NVIDIA
Type: GPU
Device Version: OpenCL 1.1
Driver Version: CLH 1.0
Compute Units: 64
Work Group Size: 1024
Clock: 1880 MHz
Global Memory: 1536 MB
Local Memory: 48 KB
Cache Size: 0 KB
Cache Line Size: 0 Bytes
Available: Yes
Double-Precision: No
Extensions:
cl_APPLE_SetMemObjectDestructor
cl_APPLE_ContextLoggingFunctions
cl_APPLE_clut
cl_APPLE_query_kernel_names
cl_APPLE_gl_sharing
cl_khr_gl_event
cl_khr_byte_addressable_store
cl_khr_global_int32_base_atomics
cl_khr_global_int32_extended_atomics
cl_khr_local_int32_base_atomics
cl_khr_local_int32_extended_atomics
cl_APPLE_fp64_basic_ops

Hi mitch... so... i've attached oclinfo debug and a screenshot of luxmark test (i think it works fine)

tnx and cheers!

 

p.s. since at the moment i think there's a problem with attachments i've uploaded them to my dropbox, here they are...

 

https://dl.dropbox.com/u/8697187/oclinfo.rtf

 

https://dl.dropbox.com/u/8697187/Schermata%202013-01-16%20alle%2019.43.27.png

10.8.2 (12C3006) ATI-Radeon 5650M

Just as other ATI users report, too bright. Below is the log:

 

[sLG] Reading configuration file: scenes/luxball/luxball.cfg

[sLG] Configuration:

[sLG] image.width = 640

[sLG] image.height = 480

[sLG] batch.halttime = 0

[sLG] scene.file = scenes/luxball/luxball.scn

[sLG] scene.fieldofview = 45

[sLG] opencl.cpu.use = 0

[sLG] opencl.gpu.use = 1

[sLG] opencl.gpu.workgroup.size = 64

[sLG] screen.refresh.interval = 100

[sLG] film.gamma = 2.2

[sLG] path.maxdepth = 16

[sLG] path.russianroulette.depth = 5

[sLG] light.maxdepth = 16

[sLG] light.russianroulette.depth = 5

[LuxRays::SDL] Reading scene: scenes/luxball/luxball.scn

[LuxRays::SDL] Camera postion: Point[0.5, -1.4, 1.7]

[LuxRays::SDL] Camera target: Point[0, 0, 0.4]

[LuxRays::SDL] Material definition: shell

[LuxRays::SDL] Material definition: whitematte

[LuxRays::SDL] Material definition: luxtext

[LuxRays::SDL] Material definition: blacktext

[LuxRays::SDL] Material definition: whitelight

[LuxRays::SDL] The l1 object is a light sources with 2 triangles

[LuxRays::SDL] The l2 object is a light sources with 2 triangles

[LuxRays::SDL] PLY object count: 7

[LuxRays] OpenCL Platform 0: Apple

[LuxRays] Device 0 name: NativeThread-000

[LuxRays] Device 0 type: NATIVE_THREAD

[LuxRays] Device 0 compute units: 1

[LuxRays] Device 0 max allocable memory: 0MBytes

[LuxRays] Device 0 max allocable memory block size: 0MBytes

[LuxRays] Device 1 name: NativeThread-001

[LuxRays] Device 1 type: NATIVE_THREAD

[LuxRays] Device 1 compute units: 1

[LuxRays] Device 1 max allocable memory: 0MBytes

[LuxRays] Device 1 max allocable memory block size: 0MBytes

[LuxRays] Device 2 name: NativeThread-002

[LuxRays] Device 2 type: NATIVE_THREAD

[LuxRays] Device 2 compute units: 1

[LuxRays] Device 2 max allocable memory: 0MBytes

[LuxRays] Device 2 max allocable memory block size: 0MBytes

[LuxRays] Device 3 name: NativeThread-003

[LuxRays] Device 3 type: NATIVE_THREAD

[LuxRays] Device 3 compute units: 1

[LuxRays] Device 3 max allocable memory: 0MBytes

[LuxRays] Device 3 max allocable memory block size: 0MBytes

[LuxRays] Device 4 name: Intel® Core i3 CPU M 380 @ 2.53GHz

[LuxRays] Device 4 type: OPENCL_CPU

[LuxRays] Device 4 compute units: 4

[LuxRays] Device 4 max allocable memory: 8192MBytes

[LuxRays] Device 4 max allocable memory block size: 2048MBytes

[LuxRays] Device 5 name: ATI Radeon HD 5650

[LuxRays] Device 5 type: OPENCL_GPU

[LuxRays] Device 5 compute units: 5

[LuxRays] Device 5 max allocable memory: 1024MBytes

[LuxRays] Device 5 max allocable memory block size: 256MBytes

[LuxRays] Preprocessing DataSet

[LuxRays] Total vertex count: 223380

[LuxRays] Total triangle count: 260794

[LuxRays] Total vertices memory usage: 2617Kbytes

[LuxRays] Total triangles memory usage: 3056Kbytes

[LuxRays] Building QBVH, primitives: 260794, initial nodes: 21734

[LuxRays] QBVH completed with 43467/43468 nodes

[LuxRays] Total QBVH memory usage: 4754Kbytes

[LuxRays] Total QBVH QuadTriangle count: 87103

[LuxRays] Max. QBVH Depth: 25

[LuxRays] Creating 1 intersection device(s)

[LuxRays] Allocating intersection device 0: ATI Radeon HD 5650 (Type = OPENCL_GPU)

[sLG] OpenCL Devices used:

[sLG] [ATI Radeon HD 5650Intersect]

[LuxRays] [OpenCL device::ATI Radeon HD 5650Intersect] Ray buffer size: 3072Kbytes

[LuxRays] [OpenCL device::ATI Radeon HD 5650Intersect] Ray hits buffer size: 1024Kbytes

[LuxRays] [OpenCL device::ATI Radeon HD 5650Intersect] OpenCL image support is available

[LuxRays] [OpenCL device::ATI Radeon HD 5650Intersect] OpenCL max. image buffer size: 8192x8192

[LuxRays] [OpenCL device::ATI Radeon HD 5650Intersect] QBVH node image buffer size: 553x551

[LuxRays] [OpenCL device::ATI Radeon HD 5650Intersect] QBVH triangle image buffer size: 940x927

[LuxRays] [OpenCL device::ATI Radeon HD 5650Intersect] Enabled QBVH scene storage inside image

[LuxRays] [OpenCL device::ATI Radeon HD 5650Intersect] QBVH max. stack size: 24

[LuxRays] [OpenCL device::ATI Radeon HD 5650Intersect] QBVH Image Storage kernel work group size: 256

[LuxRays] [OpenCL device::ATI Radeon HD 5650Intersect] Suggested work group size: 256

[LuxRays] [OpenCL device::ATI Radeon HD 5650Intersect] Forced work group size: 64

[sLG] Configuring 1 CPU render threads

[sLG] [PathOCLRenderThread] OpenCL task count: 65536

[sLG] [PathOCLRenderThread] OpenCL max. page memory size: 262144Kbytes

[sLG] [PathOCLRenderThread::CompiledScene] Compile Camera

[sLG] [PathOCLRenderThread::CompiledScene] Compile Geometry

[sLG] [PathOCLRenderThread::CompiledScene] Scene geometry compilation time: 13ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile Materials

[sLG] [PathOCLRenderThread::CompiledScene] Compile Textures

[sLG] [PathOCLRenderThread::CompiledScene] Textures compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Material compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile Triangle AreaLights

[sLG] [PathOCLRenderThread::CompiledScene] Triangle area lights compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile InfiniteLight

[sLG] [PathOCLRenderThread::CompiledScene] Infinitelight compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile SunLight

[sLG] [PathOCLRenderThread::CompiledScene] Compile SkyLight

[sLG] [PathOCLRenderThread::CompiledScene] Compile ImageMaps

[sLG] [PathOCLRenderThread::CompiledScene] Image maps page count: 0

[sLG] [PathOCLRenderThread::CompiledScene] Texture maps compilation time: 0ms

[sLG] Starting 1 PathOCL render threads

[sLG] [PathOCLRenderThread::0] FrameBuffer buffer size: 4835Kbytes

[sLG] [PathOCLRenderThread::0] Camera buffer size: 272bytes

[sLG] [PathOCLRenderThread::0] MeshIDs buffer size: 1018Kbytes

[sLG] [PathOCLRenderThread::0] Normals buffer size: 2617Kbytes

[sLG] [PathOCLRenderThread::0] UVs buffer size: 1745Kbytes

[sLG] [PathOCLRenderThread::0] Vertices buffer size: 2617Kbytes

[sLG] [PathOCLRenderThread::0] Triangles buffer size: 3056Kbytes

[sLG] [PathOCLRenderThread::0] Textures buffer size: 324bytes

[sLG] [PathOCLRenderThread::0] Materials buffer size: 160bytes

[sLG] [PathOCLRenderThread::0] Mesh material index buffer size: 28bytes

[sLG] [PathOCLRenderThread::0] Triangle AreaLights buffer size: 272bytes

[sLG] [PathOCLRenderThread::0] Triangle AreaLights index buffer size: 1018Kbytes

[sLG] [PathOCLRenderThread::0] Ray buffer size: 3072Kbytes

[sLG] [PathOCLRenderThread::0] RayHit buffer size: 1024Kbytes

[sLG] [PathOCLRenderThread::0] GPUTask buffer size: 23296Kbytes

[sLG] [PathOCLRenderThread::0] Size of a SampleData: 8bytes

[sLG] [PathOCLRenderThread::0] SampleData buffer size: 512Kbytes

[sLG] [PathOCLRenderThread::0] GPUTask Stats buffer size: 256Kbytes

[sLG] [PathOCLRenderThread::0] OpenCL buffer creation time: 0ms

[sLG] [PathOCLRenderThread::0] Defined symbols: -D LUXRAYS_OPENCL_KERNEL -D SLG_OPENCL_KERNEL -D PARAM_TASK_COUNT=65536 -D PARAM_IMAGE_WIDTH=640 -D PARAM_IMAGE_HEIGHT=480 -D PARAM_RAY_EPSILON_MIN=1.000000e-09f -D PARAM_RAY_EPSILON_MAX=1.000000e-01f -D PARAM_MAX_PATH_DEPTH=16 -D PARAM_RR_DEPTH=5 -D PARAM_RR_CAP=5.000000e-01f -D PARAM_ACCEL_QBVH -D PARAM_ENABLE_TEX_CONST_FLOAT -D PARAM_ENABLE_TEX_CONST_FLOAT3 -D PARAM_ENABLE_MAT_MATTE -D PARAM_ENABLE_MAT_GLASS -D PARAM_HAS_PASSTHROUGH -D PARAM_DIRECT_LIGHT_SAMPLING -D PARAM_DL_LIGHT_COUNT=4 -D PARAM_IMAGE_FILTER_TYPE=0 -D PARAM_SAMPLER_TYPE=0

[sLG] [PathOCLRenderThread::0] Compiling kernels

[sLG] [PathOCLRenderThread::0] Kernels not cached

[sLG] [PathOCLRenderThread::0] Compiling Init Kernel

[sLG] [PathOCLRenderThread::0] Compiling AdvancePaths Kernel

[sLG] [PathOCLRenderThread::0] Kernels compilation time: 988ms

[sLG] Done.

it crashed

Dwis-Mac-Pro:SLG4_alpha regae$ ./slg4
[sLG] Reading configuration file: scenes/luxball/luxball.cfg
[sLG] Configuration:
[sLG]   image.width = 640
[sLG]   image.height = 480
[sLG]   batch.halttime = 0
[sLG]   scene.file = scenes/luxball/luxball.scn
[sLG]   scene.fieldofview = 45
[sLG]   opencl.cpu.use = 0
[sLG]   opencl.gpu.use = 1
[sLG]   opencl.gpu.workgroup.size = 64
[sLG]   screen.refresh.interval = 100
[sLG]   film.gamma = 2.2
[sLG]   path.maxdepth = 16
[sLG]   path.russianroulette.depth = 5
[sLG]   light.maxdepth = 16
[sLG]   light.russianroulette.depth = 5
[LuxRays::SDL] Reading scene: scenes/luxball/luxball.scn
[LuxRays::SDL] Camera postion: Point[0.5, -1.4, 1.7]
[LuxRays::SDL] Camera target: Point[0, 0, 0.4]
[LuxRays::SDL] Material definition: shell
[LuxRays::SDL] Material definition: whitematte
[LuxRays::SDL] Material definition: luxtext
[LuxRays::SDL] Material definition: blacktext
[LuxRays::SDL] Material definition: whitelight
[LuxRays::SDL] The l1 object is a light sources with 2 triangles
[LuxRays::SDL] The l2 object is a light sources with 2 triangles
[LuxRays::SDL] PLY object count: 7
[LuxRays] OpenCL Platform 0: Apple
[LuxRays] Device 0 name: NativeThread-000
[LuxRays] Device 0 type: NATIVE_THREAD
[LuxRays] Device 0 compute units: 1
[LuxRays] Device 0 max allocable memory: 0MBytes
[LuxRays] Device 0 max allocable memory block size: 0MBytes
[LuxRays] Device 1 name: NativeThread-001
[LuxRays] Device 1 type: NATIVE_THREAD
[LuxRays] Device 1 compute units: 1
[LuxRays] Device 1 max allocable memory: 0MBytes
[LuxRays] Device 1 max allocable memory block size: 0MBytes
[LuxRays] Device 2 name: NativeThread-002
[LuxRays] Device 2 type: NATIVE_THREAD
[LuxRays] Device 2 compute units: 1
[LuxRays] Device 2 max allocable memory: 0MBytes
[LuxRays] Device 2 max allocable memory block size: 0MBytes
[LuxRays] Device 3 name: NativeThread-003
[LuxRays] Device 3 type: NATIVE_THREAD
[LuxRays] Device 3 compute units: 1
[LuxRays] Device 3 max allocable memory: 0MBytes
[LuxRays] Device 3 max allocable memory block size: 0MBytes
[LuxRays] Device 4 name: NativeThread-004
[LuxRays] Device 4 type: NATIVE_THREAD
[LuxRays] Device 4 compute units: 1
[LuxRays] Device 4 max allocable memory: 0MBytes
[LuxRays] Device 4 max allocable memory block size: 0MBytes
[LuxRays] Device 5 name: NativeThread-005
[LuxRays] Device 5 type: NATIVE_THREAD
[LuxRays] Device 5 compute units: 1
[LuxRays] Device 5 max allocable memory: 0MBytes
[LuxRays] Device 5 max allocable memory block size: 0MBytes
[LuxRays] Device 6 name: NativeThread-006
[LuxRays] Device 6 type: NATIVE_THREAD
[LuxRays] Device 6 compute units: 1
[LuxRays] Device 6 max allocable memory: 0MBytes
[LuxRays] Device 6 max allocable memory block size: 0MBytes
[LuxRays] Device 7 name: NativeThread-007
[LuxRays] Device 7 type: NATIVE_THREAD
[LuxRays] Device 7 compute units: 1
[LuxRays] Device 7 max allocable memory: 0MBytes
[LuxRays] Device 7 max allocable memory block size: 0MBytes
[LuxRays] Device 8 name: Intel(R) Core(TM) i7-2600K CPU @ 3.40GHz
[LuxRays] Device 8 type: OPENCL_CPU
[LuxRays] Device 8 compute units: 8
[LuxRays] Device 8 max allocable memory: 8192MBytes
[LuxRays] Device 8 max allocable memory block size: 2048MBytes
[LuxRays] Device 9 name: GeForce GTX 650 Ti
[LuxRays] Device 9 type: OPENCL_GPU
[LuxRays] Device 9 compute units: 12
[LuxRays] Device 9 max allocable memory: 2048MBytes
[LuxRays] Device 9 max allocable memory block size: 512MBytes
[LuxRays] Preprocessing DataSet
[LuxRays] Total vertex count: 223380
[LuxRays] Total triangle count: 260794
[LuxRays] Total vertices memory usage: 2617Kbytes
[LuxRays] Total triangles memory usage: 3056Kbytes
[LuxRays] Building QBVH, primitives: 260794, initial nodes: 21734
[LuxRays] QBVH completed with 43467/43468 nodes
[LuxRays] Total QBVH memory usage: 4754Kbytes
[LuxRays] Total QBVH QuadTriangle count: 87103
[LuxRays] Max. QBVH Depth: 25
[LuxRays] Creating 1 intersection device(s)
[LuxRays] Allocating intersection device 0: GeForce GTX 650 Ti (Type = OPENCL_GPU)
[sLG] OpenCL Devices used:
[sLG] [GeForce GTX 650 TiIntersect]
[LuxRays] [OpenCL device::GeForce GTX 650 TiIntersect] Ray buffer size: 3072Kbytes
[LuxRays] [OpenCL device::GeForce GTX 650 TiIntersect] Ray hits buffer size: 1024Kbytes
[LuxRays] [OpenCL device::GeForce GTX 650 TiIntersect] OpenCL image support is available
[LuxRays] [OpenCL device::GeForce GTX 650 TiIntersect] OpenCL max. image buffer size: 8192x8192
[LuxRays] [OpenCL device::GeForce GTX 650 TiIntersect] QBVH node image buffer size: 553x551
[LuxRays] [OpenCL device::GeForce GTX 650 TiIntersect] QBVH triangle image buffer size: 940x927
[LuxRays] [OpenCL device::GeForce GTX 650 TiIntersect] Enabled QBVH scene storage inside image
[LuxRays] [OpenCL device::GeForce GTX 650 TiIntersect] QBVH max. stack size: 24
[LuxRays] [OpenCL device::GeForce GTX 650 TiIntersect] QBVH Image Storage kernel work group size: 1024
[LuxRays] [OpenCL device::GeForce GTX 650 TiIntersect] Suggested work group size: 1024
[LuxRays] [OpenCL device::GeForce GTX 650 TiIntersect] Forced work group size: 64
[sLG] Configuring 1 CPU render threads
[sLG] [PathOCLRenderThread] OpenCL task count: 65536
[sLG] [PathOCLRenderThread] OpenCL max. page memory size: 524288Kbytes
[sLG] [PathOCLRenderThread::CompiledScene] Compile Camera
[sLG] [PathOCLRenderThread::CompiledScene] Compile Geometry
[sLG] [PathOCLRenderThread::CompiledScene] Scene geometry compilation time: 8ms
[sLG] [PathOCLRenderThread::CompiledScene] Compile Materials
[sLG] [PathOCLRenderThread::CompiledScene] Compile Textures
[sLG] [PathOCLRenderThread::CompiledScene] Textures compilation time: 0ms
[sLG] [PathOCLRenderThread::CompiledScene] Material compilation time: 0ms
[sLG] [PathOCLRenderThread::CompiledScene] Compile Triangle AreaLights
[sLG] [PathOCLRenderThread::CompiledScene] Triangle area lights compilation time: 0ms
[sLG] [PathOCLRenderThread::CompiledScene] Compile InfiniteLight
[sLG] [PathOCLRenderThread::CompiledScene] Infinitelight compilation time: 0ms
[sLG] [PathOCLRenderThread::CompiledScene] Compile SunLight
[sLG] [PathOCLRenderThread::CompiledScene] Compile SkyLight
[sLG] [PathOCLRenderThread::CompiledScene] Compile ImageMaps
[sLG] [PathOCLRenderThread::CompiledScene] Image maps page count: 0
[sLG] [PathOCLRenderThread::CompiledScene] Texture maps compilation time: 0ms
[sLG] Starting 1 PathOCL render threads
[sLG] [PathOCLRenderThread::0] FrameBuffer buffer size: 4835Kbytes
[sLG] [PathOCLRenderThread::0] Camera buffer size: 272bytes
[sLG] [PathOCLRenderThread::0] MeshIDs buffer size: 1018Kbytes
[sLG] [PathOCLRenderThread::0] Normals buffer size: 2617Kbytes
[sLG] [PathOCLRenderThread::0] UVs buffer size: 1745Kbytes
[sLG] [PathOCLRenderThread::0] Vertices buffer size: 2617Kbytes
[sLG] [PathOCLRenderThread::0] Triangles buffer size: 3056Kbytes
[sLG] [PathOCLRenderThread::0] Textures buffer size: 324bytes
[sLG] [PathOCLRenderThread::0] Materials buffer size: 160bytes
[sLG] [PathOCLRenderThread::0] Mesh material index buffer size: 28bytes
[sLG] [PathOCLRenderThread::0] Triangle AreaLights buffer size: 272bytes
[sLG] [PathOCLRenderThread::0] Triangle AreaLights index buffer size: 1018Kbytes
[sLG] [PathOCLRenderThread::0] Ray buffer size: 3072Kbytes
[sLG] [PathOCLRenderThread::0] RayHit buffer size: 1024Kbytes
[sLG] [PathOCLRenderThread::0] GPUTask buffer size: 23296Kbytes
[sLG] [PathOCLRenderThread::0] Size of a SampleData: 8bytes
[sLG] [PathOCLRenderThread::0] SampleData buffer size: 512Kbytes
[sLG] [PathOCLRenderThread::0] GPUTask Stats buffer size: 256Kbytes
[sLG] [PathOCLRenderThread::0] OpenCL buffer creation time: 0ms
[sLG] [PathOCLRenderThread::0] Defined symbols:  -D LUXRAYS_OPENCL_KERNEL -D SLG_OPENCL_KERNEL -D PARAM_TASK_COUNT=65536 -D PARAM_IMAGE_WIDTH=640 -D PARAM_IMAGE_HEIGHT=480 -D PARAM_RAY_EPSILON_MIN=1.000000e-09f -D PARAM_RAY_EPSILON_MAX=1.000000e-01f -D PARAM_MAX_PATH_DEPTH=16 -D PARAM_RR_DEPTH=5 -D PARAM_RR_CAP=5.000000e-01f -D PARAM_ACCEL_QBVH -D PARAM_ENABLE_TEX_CONST_FLOAT -D PARAM_ENABLE_TEX_CONST_FLOAT3 -D PARAM_ENABLE_MAT_MATTE -D PARAM_ENABLE_MAT_GLASS -D PARAM_HAS_PASSTHROUGH -D PARAM_DIRECT_LIGHT_SAMPLING -D PARAM_DL_LIGHT_COUNT=4 -D PARAM_IMAGE_FILTER_TYPE=0 -D PARAM_SAMPLER_TYPE=0
[sLG] [PathOCLRenderThread::0] Compiling kernels
[sLG] [PathOCLRenderThread::0] Kernels not cached
[sLG] [PathOCLRenderThread::0] Compiling Init Kernel
[sLG] [PathOCLRenderThread::0] Compiling AdvancePaths Kernel
[sLG] [PathOCLRenderThread::0] Kernels compilation time: 294ms
[sLG] [PathOCLRenderThread::0] Rendering thread ERROR: clFlush(CL_INVALID_COMMAND_QUEUE)
[sLG] =========================================================
[sLG] Unhandled exception
[sLG] Obtained 5 stack frames.
[sLG]   0
[sLG]   1
[sLG]   2
[sLG]   3
[sLG]   4
Abort trap: 6

 

here is the openclinfo

1 OpenCL platform found!
[Platform 0]
Name:    Apple
Vendor:   Apple
Version:   OpenCL 1.2 (Aug 24 2012 00:53:09)
Profile:   FULL_PROFILE
Extensions:    cl_APPLE_SetMemObjectDestructor cl_APPLE_ContextLoggingFunctions cl_APPLE_clut cl_APPLE_query_kernel_names cl_APPLE_gl_sharing cl_khr_gl_event

2 OpenCL devices found!
[Device 0]
Name:    Intel(R) Core(TM) i7-2600K CPU @ 3.40GHz
Vendor:   Intel
Type:    CPU
Device Version:  OpenCL 1.2
Driver Version:  1.1
Compute Units:   8
Work Group Size:  1024
Clock:    3400 MHz
Global Memory (Total):  8192 MB
Global Memory (Host):  8192 MB
Global Memory (PCIe):  0 MB
Local Memory:   32 KB
Cache Size:   0.0625 KB
Cache Line Size:  8388608 Bytes
Available:   Yes
Double-Precision:  Yes
Extensions:
   cl_APPLE_SetMemObjectDestructor
   cl_APPLE_ContextLoggingFunctions
   cl_APPLE_clut
   cl_APPLE_query_kernel_names
   cl_APPLE_gl_sharing
   cl_khr_gl_event
   cl_khr_fp64
   cl_khr_global_int32_base_atomics
   cl_khr_global_int32_extended_atomics
   cl_khr_local_int32_base_atomics
   cl_khr_local_int32_extended_atomics
   cl_khr_byte_addressable_store
   cl_khr_int64_base_atomics
   cl_khr_int64_extended_atomics
   cl_khr_3d_image_writes
   cl_APPLE_fp64_basic_ops
   cl_APPLE_fixed_alpha_channel_orders
[Device 1]
Name:    GeForce GTX 650 Ti
Vendor:   NVIDIA
Type:    GPU
Device Version:  OpenCL 1.1
Driver Version:  CLH 1.0
Compute Units:   12
Work Group Size:  1024
Clock:    0 MHz
Global Memory:   2048 MB
Local Memory:   48 KB
Cache Size:   0 KB
Cache Line Size:  0 Bytes
Available:   Yes
Double-Precision:  No
Extensions:
   cl_APPLE_SetMemObjectDestructor
   cl_APPLE_ContextLoggingFunctions
   cl_APPLE_clut
   cl_APPLE_query_kernel_names
   cl_APPLE_gl_sharing
   cl_khr_gl_event
   cl_khr_byte_addressable_store
   cl_khr_global_int32_base_atomics
   cl_khr_global_int32_extended_atomics
   cl_khr_local_int32_base_atomics
   cl_khr_local_int32_extended_atomics
   cl_APPLE_fp64_basic_ops
logout

  • 4 weeks later...

Jens made an new special test build with test special test scene (green ball, green ground).

 

http://www.jensverwi..._slg4_proof.zip

 

unzip zip somewhere

start terminal

cd folderpath of main folder/OSX_AMD_slg4_proof

 

test the 2 builds OK and NOT_OK:

 

 

./test_mac_fix_ok/slg4 -D opencl.devices.select 01 -D sampler.type SOBOL -D opencl.task.count 262144 -w 512 -e 512 scenes/luxball/luxball-hdr.cfg

 

 

./test_mac_fix_not_ok/slg4 -D opencl.devices.select 01 -D sampler.type SOBOL -D opencl.task.count 262144 -w 512 -e 512 scenes/luxball/luxball-hdr.cfg

(you can drag & crop those command lines obove into the already opened terminal window to start the tests - dont forget to set working folder by cd before running)

 

 

Normally - if scene renders correct - it looks like:

Bildschirmfoto 2013-02-10 um 18.38.31.jpg

OK renders correctly and produces:

post-1111706-0-15548200-1360520937_thumb.png

 

[sLG] Reading configuration file: scenes/luxball/luxball-hdr.cfg

[sLG] Configuration:

[sLG] image.width = 512

[sLG] image.height = 512

[sLG] batch.halttime = 0

[sLG] scene.file = scenes/luxball/luxball-hdr.scn

[sLG] opencl.cpu.use = 0

[sLG] opencl.gpu.use = 1

[sLG] opencl.gpu.workgroup.size = 64

[sLG] screen.refresh.interval = 100

[sLG] path.maxdepth = 8

[sLG] path.russianroulette.depth = 5

[sLG] opencl.devices.select = 01

[sLG] sampler.type = SOBOL

[sLG] opencl.task.count = 262144

[LuxRays::SDL] Reading scene: scenes/luxball/luxball-hdr.scn

[LuxRays::SDL] Camera postion: Point[0.6, -1.7, 0.4]

[LuxRays::SDL] Camera target: Point[0, 0, 0.4]

[LuxRays::SDL] Material definition: shell

[LuxRays::SDL] Material definition: whitematte

[LuxRays::SDL] Material definition: luxtext

[LuxRays::SDL] Material definition: blacktext

[LuxRays::SDL] PLY object count: 5

[LuxRays::SDL] Reading texture map: scenes/simple-mat/arch.exr

[LuxRays::SDL] HDR RGB (128bit) texture map size: 1024x512 (6144Kbytes)

[LuxRays] OpenCL Platform 0: Apple

[LuxRays] Device 0 name: NativeThread-000

[LuxRays] Device 0 type: NATIVE_THREAD

[LuxRays] Device 0 compute units: 1

[LuxRays] Device 0 max allocable memory: 0MBytes

[LuxRays] Device 0 max allocable memory block size: 0MBytes

[LuxRays] Device 1 name: NativeThread-001

[LuxRays] Device 1 type: NATIVE_THREAD

[LuxRays] Device 1 compute units: 1

[LuxRays] Device 1 max allocable memory: 0MBytes

[LuxRays] Device 1 max allocable memory block size: 0MBytes

[LuxRays] Device 2 name: NativeThread-002

[LuxRays] Device 2 type: NATIVE_THREAD

[LuxRays] Device 2 compute units: 1

[LuxRays] Device 2 max allocable memory: 0MBytes

[LuxRays] Device 2 max allocable memory block size: 0MBytes

[LuxRays] Device 3 name: NativeThread-003

[LuxRays] Device 3 type: NATIVE_THREAD

[LuxRays] Device 3 compute units: 1

[LuxRays] Device 3 max allocable memory: 0MBytes

[LuxRays] Device 3 max allocable memory block size: 0MBytes

[LuxRays] Device 4 name: NativeThread-004

[LuxRays] Device 4 type: NATIVE_THREAD

[LuxRays] Device 4 compute units: 1

[LuxRays] Device 4 max allocable memory: 0MBytes

[LuxRays] Device 4 max allocable memory block size: 0MBytes

[LuxRays] Device 5 name: NativeThread-005

[LuxRays] Device 5 type: NATIVE_THREAD

[LuxRays] Device 5 compute units: 1

[LuxRays] Device 5 max allocable memory: 0MBytes

[LuxRays] Device 5 max allocable memory block size: 0MBytes

[LuxRays] Device 6 name: NativeThread-006

[LuxRays] Device 6 type: NATIVE_THREAD

[LuxRays] Device 6 compute units: 1

[LuxRays] Device 6 max allocable memory: 0MBytes

[LuxRays] Device 6 max allocable memory block size: 0MBytes

[LuxRays] Device 7 name: NativeThread-007

[LuxRays] Device 7 type: NATIVE_THREAD

[LuxRays] Device 7 compute units: 1

[LuxRays] Device 7 max allocable memory: 0MBytes

[LuxRays] Device 7 max allocable memory block size: 0MBytes

[LuxRays] Device 8 name: Intel® Core™ i7-3820 CPU @ 3.60GHz

[LuxRays] Device 8 type: OPENCL_CPU

[LuxRays] Device 8 compute units: 8

[LuxRays] Device 8 max allocable memory: 8192MBytes

[LuxRays] Device 8 max allocable memory block size: 2048MBytes

[LuxRays] Device 9 name: GeForce GTX 580

[LuxRays] Device 9 type: OPENCL_GPU

[LuxRays] Device 9 compute units: 16

[LuxRays] Device 9 max allocable memory: 1536MBytes

[LuxRays] Device 9 max allocable memory block size: 384MBytes

[LuxRays] Preprocessing DataSet

[LuxRays] Total vertex count: 223352

[LuxRays] Total triangle count: 260780

[LuxRays] Total vertices memory usage: 2617Kbytes

[LuxRays] Total triangles memory usage: 3056Kbytes

[LuxRays] Building QBVH, primitives: 260780, initial nodes: 21733

[LuxRays] QBVH completed with 43609/86932 nodes

[LuxRays] Total QBVH memory usage: 4769Kbytes

[LuxRays] Total QBVH QuadTriangle count: 87045

[LuxRays] Max. QBVH Depth: 24

[LuxRays] Creating 1 intersection device(s)

[LuxRays] Allocating intersection device 0: GeForce GTX 580 (Type = OPENCL_GPU)

[sLG] OpenCL Devices used:

[sLG] [GeForce GTX 580Intersect]

[sLG] Device OpenCL version: OpenCL 1.1

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] Ray buffer size: 3072Kbytes

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] Ray hits buffer size: 1024Kbytes

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] OpenCL image support is available

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] OpenCL max. image buffer size: 8192x8192

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH node image buffer size: 553x553

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH triangle image buffer size: 940x927

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] Enabled QBVH scene storage inside image

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH max. stack size: 24

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH Image Storage kernel work group size: 512

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] Suggested work group size: 512

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] Forced work group size: 64

[sLG] Configuring 1 CPU render threads

[sLG] [PathOCLRenderThread] OpenCL task count: 262144

[sLG] [PathOCLRenderThread] OpenCL max. page memory size: 393216Kbytes

[sLG] [PathOCLRenderThread::CompiledScene] Compile Camera

[sLG] [PathOCLRenderThread::CompiledScene] Compile Geometry

[sLG] [PathOCLRenderThread::CompiledScene] Scene geometry compilation time: 6ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile Materials

[sLG] [PathOCLRenderThread::CompiledScene] Compile Textures

[sLG] [PathOCLRenderThread::CompiledScene] Textures compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Material compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile Triangle AreaLights

[sLG] [PathOCLRenderThread::CompiledScene] Triangle area lights compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile InfiniteLight

[sLG] [PathOCLRenderThread::CompiledScene] Infinitelight compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile SunLight

[sLG] [PathOCLRenderThread::CompiledScene] Compile SkyLight

[sLG] [PathOCLRenderThread::CompiledScene] Compile ImageMaps

[sLG] [PathOCLRenderThread::CompiledScene] Image maps page count: 1

[sLG] [PathOCLRenderThread::CompiledScene] RGB channel page 0 size: 6144Kbytes

[sLG] [PathOCLRenderThread::CompiledScene] Texture maps compilation time: 1ms

[sLG] Starting 1 PathOCL render threads

[sLG] [PathOCLRenderThread::0] FrameBuffer buffer size: 4128Kbytes

[sLG] [PathOCLRenderThread::0] Camera buffer size: 272bytes

[sLG] [PathOCLRenderThread::0] MeshIDs buffer size: 1018Kbytes

[sLG] [PathOCLRenderThread::0] Normals buffer size: 2617Kbytes

[sLG] [PathOCLRenderThread::0] UVs buffer size: 1744Kbytes

[sLG] [PathOCLRenderThread::0] Vertices buffer size: 2617Kbytes

[sLG] [PathOCLRenderThread::0] Triangles buffer size: 3056Kbytes

[sLG] [PathOCLRenderThread::0] ImageMaps description buffer size: 20bytes

[sLG] [PathOCLRenderThread::0] ImageMaps buffer size: 6144Kbytes

[sLG] [PathOCLRenderThread::0] Textures buffer size: 144bytes

[sLG] [PathOCLRenderThread::0] Materials buffer size: 192bytes

[sLG] [PathOCLRenderThread::0] Mesh material index buffer size: 20bytes

[sLG] [PathOCLRenderThread::0] InfiniteLight buffer size: 52bytes

[sLG] [PathOCLRenderThread::0] Ray buffer size: 12288Kbytes

[sLG] [PathOCLRenderThread::0] RayHit buffer size: 4096Kbytes

[sLG] [PathOCLRenderThread::0] GPUTask buffer size: 53248Kbytes

[sLG] [PathOCLRenderThread::0] Sample dimensions: 28

[sLG] [PathOCLRenderThread::0] Size of a SampleData: 8bytes

[sLG] [PathOCLRenderThread::0] SampleData buffer size: 2048Kbytes

[sLG] [PathOCLRenderThread::0] GPUTask Stats buffer size: 1024Kbytes

[sLG] [PathOCLRenderThread::0] OpenCL buffer creation time: 0ms

[sLG] [PathOCLRenderThread::0] Sobol table size: 896

[sLG] [PathOCLRenderThread::0] Defined symbols: -D LUXRAYS_OPENCL_KERNEL -D SLG_OPENCL_KERNEL -D PARAM_TASK_COUNT=262144 -D PARAM_IMAGE_WIDTH=512 -D PARAM_IMAGE_HEIGHT=512 -D PARAM_RAY_EPSILON_MIN=1.000000e-09f -D PARAM_RAY_EPSILON_MAX=1.000000e-01f -D PARAM_MAX_PATH_DEPTH=8 -D PARAM_RR_DEPTH=5 -D PARAM_RR_CAP=5.000000e-01f -D PARAM_ACCEL_QBVH -D PARAM_ENABLE_TEX_CONST_FLOAT3 -D PARAM_ENABLE_MAT_MATTE -D PARAM_CAMERA_HAS_DOF -D PARAM_HAS_INFINITELIGHT -D PARAM_HAS_IMAGEMAPS -D PARAM_IMAGEMAPS_PAGE_0 -D PARAM_IMAGE_FILTER_TYPE=0 -D PARAM_SAMPLER_TYPE=2 -D PARAM_SAMPLER_SOBOL_MAXDEPTH=8

[sLG] [PathOCLRenderThread::0] Compiling kernels

[sLG] [PathOCLRenderThread::0] Kernels not cached

[sLG] [PathOCLRenderThread::0] Compiling Init Kernel

[sLG] [PathOCLRenderThread::0] Compiling AdvancePaths Kernel

[sLG] [PathOCLRenderThread::0] Kernels compilation time: 111ms

 

 

This is NOT_OK:

post-1111706-0-37824000-1360520849_thumb.png

 

[sLG] Reading configuration file: scenes/luxball/luxball-hdr.cfg

[sLG] Configuration:

[sLG] image.width = 512

[sLG] image.height = 512

[sLG] batch.halttime = 0

[sLG] scene.file = scenes/luxball/luxball-hdr.scn

[sLG] opencl.cpu.use = 0

[sLG] opencl.gpu.use = 1

[sLG] opencl.gpu.workgroup.size = 64

[sLG] screen.refresh.interval = 100

[sLG] path.maxdepth = 8

[sLG] path.russianroulette.depth = 5

[sLG] opencl.devices.select = 01

[sLG] sampler.type = SOBOL

[sLG] opencl.task.count = 262144

[LuxRays::SDL] Reading scene: scenes/luxball/luxball-hdr.scn

[LuxRays::SDL] Camera postion: Point[0.6, -1.7, 0.4]

[LuxRays::SDL] Camera target: Point[0, 0, 0.4]

[LuxRays::SDL] Material definition: shell

[LuxRays::SDL] Material definition: whitematte

[LuxRays::SDL] Material definition: luxtext

[LuxRays::SDL] Material definition: blacktext

[LuxRays::SDL] PLY object count: 5

[LuxRays::SDL] Reading texture map: scenes/simple-mat/arch.exr

[LuxRays::SDL] HDR RGB (128bit) texture map size: 1024x512 (6144Kbytes)

[LuxRays] OpenCL Platform 0: Apple

[LuxRays] Device 0 name: NativeThread-000

[LuxRays] Device 0 type: NATIVE_THREAD

[LuxRays] Device 0 compute units: 1

[LuxRays] Device 0 max allocable memory: 0MBytes

[LuxRays] Device 0 max allocable memory block size: 0MBytes

[LuxRays] Device 1 name: NativeThread-001

[LuxRays] Device 1 type: NATIVE_THREAD

[LuxRays] Device 1 compute units: 1

[LuxRays] Device 1 max allocable memory: 0MBytes

[LuxRays] Device 1 max allocable memory block size: 0MBytes

[LuxRays] Device 2 name: NativeThread-002

[LuxRays] Device 2 type: NATIVE_THREAD

[LuxRays] Device 2 compute units: 1

[LuxRays] Device 2 max allocable memory: 0MBytes

[LuxRays] Device 2 max allocable memory block size: 0MBytes

[LuxRays] Device 3 name: NativeThread-003

[LuxRays] Device 3 type: NATIVE_THREAD

[LuxRays] Device 3 compute units: 1

[LuxRays] Device 3 max allocable memory: 0MBytes

[LuxRays] Device 3 max allocable memory block size: 0MBytes

[LuxRays] Device 4 name: NativeThread-004

[LuxRays] Device 4 type: NATIVE_THREAD

[LuxRays] Device 4 compute units: 1

[LuxRays] Device 4 max allocable memory: 0MBytes

[LuxRays] Device 4 max allocable memory block size: 0MBytes

[LuxRays] Device 5 name: NativeThread-005

[LuxRays] Device 5 type: NATIVE_THREAD

[LuxRays] Device 5 compute units: 1

[LuxRays] Device 5 max allocable memory: 0MBytes

[LuxRays] Device 5 max allocable memory block size: 0MBytes

[LuxRays] Device 6 name: NativeThread-006

[LuxRays] Device 6 type: NATIVE_THREAD

[LuxRays] Device 6 compute units: 1

[LuxRays] Device 6 max allocable memory: 0MBytes

[LuxRays] Device 6 max allocable memory block size: 0MBytes

[LuxRays] Device 7 name: NativeThread-007

[LuxRays] Device 7 type: NATIVE_THREAD

[LuxRays] Device 7 compute units: 1

[LuxRays] Device 7 max allocable memory: 0MBytes

[LuxRays] Device 7 max allocable memory block size: 0MBytes

[LuxRays] Device 8 name: Intel® Core™ i7-3820 CPU @ 3.60GHz

[LuxRays] Device 8 type: OPENCL_CPU

[LuxRays] Device 8 compute units: 8

[LuxRays] Device 8 max allocable memory: 8192MBytes

[LuxRays] Device 8 max allocable memory block size: 2048MBytes

[LuxRays] Device 9 name: GeForce GTX 580

[LuxRays] Device 9 type: OPENCL_GPU

[LuxRays] Device 9 compute units: 16

[LuxRays] Device 9 max allocable memory: 1536MBytes

[LuxRays] Device 9 max allocable memory block size: 384MBytes

[LuxRays] Preprocessing DataSet

[LuxRays] Total vertex count: 223352

[LuxRays] Total triangle count: 260780

[LuxRays] Total vertices memory usage: 2617Kbytes

[LuxRays] Total triangles memory usage: 3056Kbytes

[LuxRays] Building QBVH, primitives: 260780, initial nodes: 21733

[LuxRays] QBVH completed with 43609/86932 nodes

[LuxRays] Total QBVH memory usage: 4769Kbytes

[LuxRays] Total QBVH QuadTriangle count: 87045

[LuxRays] Max. QBVH Depth: 24

[LuxRays] Creating 1 intersection device(s)

[LuxRays] Allocating intersection device 0: GeForce GTX 580 (Type = OPENCL_GPU)

[sLG] OpenCL Devices used:

[sLG] [GeForce GTX 580Intersect]

[sLG] Device OpenCL version: OpenCL 1.1

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] Ray buffer size: 3072Kbytes

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] Ray hits buffer size: 1024Kbytes

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] OpenCL image support is available

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] OpenCL max. image buffer size: 8192x8192

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH node image buffer size: 553x553

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH triangle image buffer size: 940x927

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] Enabled QBVH scene storage inside image

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH max. stack size: 24

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH Image Storage kernel work group size: 512

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] Suggested work group size: 512

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] Forced work group size: 64

[sLG] Configuring 1 CPU render threads

[sLG] [PathOCLRenderThread] OpenCL task count: 262144

[sLG] [PathOCLRenderThread] OpenCL max. page memory size: 393216Kbytes

[sLG] [PathOCLRenderThread::CompiledScene] Compile Camera

[sLG] [PathOCLRenderThread::CompiledScene] Compile Geometry

[sLG] [PathOCLRenderThread::CompiledScene] Scene geometry compilation time: 6ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile Materials

[sLG] [PathOCLRenderThread::CompiledScene] Compile Textures

[sLG] [PathOCLRenderThread::CompiledScene] Textures compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Material compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile Triangle AreaLights

[sLG] [PathOCLRenderThread::CompiledScene] Triangle area lights compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile InfiniteLight

[sLG] [PathOCLRenderThread::CompiledScene] Infinitelight compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile SunLight

[sLG] [PathOCLRenderThread::CompiledScene] Compile SkyLight

[sLG] [PathOCLRenderThread::CompiledScene] Compile ImageMaps

[sLG] [PathOCLRenderThread::CompiledScene] Image maps page count: 1

[sLG] [PathOCLRenderThread::CompiledScene] RGB channel page 0 size: 6144Kbytes

[sLG] [PathOCLRenderThread::CompiledScene] Texture maps compilation time: 1ms

[sLG] Starting 1 PathOCL render threads

[sLG] [PathOCLRenderThread::0] FrameBuffer buffer size: 4128Kbytes

[sLG] [PathOCLRenderThread::0] Camera buffer size: 272bytes

[sLG] [PathOCLRenderThread::0] MeshIDs buffer size: 1018Kbytes

[sLG] [PathOCLRenderThread::0] Normals buffer size: 2617Kbytes

[sLG] [PathOCLRenderThread::0] UVs buffer size: 1744Kbytes

[sLG] [PathOCLRenderThread::0] Vertices buffer size: 2617Kbytes

[sLG] [PathOCLRenderThread::0] Triangles buffer size: 3056Kbytes

[sLG] [PathOCLRenderThread::0] ImageMaps description buffer size: 20bytes

[sLG] [PathOCLRenderThread::0] ImageMaps buffer size: 6144Kbytes

[sLG] [PathOCLRenderThread::0] Textures buffer size: 144bytes

[sLG] [PathOCLRenderThread::0] Materials buffer size: 192bytes

[sLG] [PathOCLRenderThread::0] Mesh material index buffer size: 20bytes

[sLG] [PathOCLRenderThread::0] InfiniteLight buffer size: 52bytes

[sLG] [PathOCLRenderThread::0] Ray buffer size: 12288Kbytes

[sLG] [PathOCLRenderThread::0] RayHit buffer size: 4096Kbytes

[sLG] [PathOCLRenderThread::0] GPUTask buffer size: 53248Kbytes

[sLG] [PathOCLRenderThread::0] Sample dimensions: 28

[sLG] [PathOCLRenderThread::0] Size of a SampleData: 8bytes

[sLG] [PathOCLRenderThread::0] SampleData buffer size: 2048Kbytes

[sLG] [PathOCLRenderThread::0] GPUTask Stats buffer size: 1024Kbytes

[sLG] [PathOCLRenderThread::0] OpenCL buffer creation time: 0ms

[sLG] [PathOCLRenderThread::0] Sobol table size: 896

[sLG] [PathOCLRenderThread::0] Defined symbols: -D LUXRAYS_OPENCL_KERNEL -D SLG_OPENCL_KERNEL -D PARAM_TASK_COUNT=262144 -D PARAM_IMAGE_WIDTH=512 -D PARAM_IMAGE_HEIGHT=512 -D PARAM_RAY_EPSILON_MIN=1.000000e-09f -D PARAM_RAY_EPSILON_MAX=1.000000e-01f -D PARAM_MAX_PATH_DEPTH=8 -D PARAM_RR_DEPTH=5 -D PARAM_RR_CAP=5.000000e-01f -D PARAM_ACCEL_QBVH -D PARAM_ENABLE_TEX_CONST_FLOAT3 -D PARAM_ENABLE_MAT_MATTE -D PARAM_CAMERA_HAS_DOF -D PARAM_HAS_INFINITELIGHT -D PARAM_HAS_IMAGEMAPS -D PARAM_IMAGEMAPS_PAGE_0 -D PARAM_IMAGE_FILTER_TYPE=0 -D PARAM_SAMPLER_TYPE=2 -D PARAM_SAMPLER_SOBOL_MAXDEPTH=8

[sLG] [PathOCLRenderThread::0] Compiling kernels

[sLG] [PathOCLRenderThread::0] Kernels not cached

[sLG] [PathOCLRenderThread::0] Compiling Init Kernel

[sLG] [PathOCLRenderThread::0] Compiling AdvancePaths Kernel

[sLG] [PathOCLRenderThread::0] Kernels compilation time: 111ms

 

1) test_mac_fix_ok: OK

2) test_mac_fix_not_ok: NOT OK :P

post-129936-0-56036300-1360521042_thumb.jpg

 

 

 

> alex@box [~/Downloads/OSX_AMD_slg4_proof] ./test_mac_fix_not_ok/slg4 -D opencl.devices.select 01 -D sampler.type SOBOL -D opencl.task.count 262144 -w 512 -e 512 scenes/luxball/luxball-hdr.cfg
[sLG] Reading configuration file: scenes/luxball/luxball-hdr.cfg
[sLG] Configuration: 
[sLG]   image.width = 512
[sLG]   image.height = 512
[sLG]   batch.halttime = 0
[sLG]   scene.file = scenes/luxball/luxball-hdr.scn
[sLG]   opencl.cpu.use = 0
[sLG]   opencl.gpu.use = 1
[sLG]   opencl.gpu.workgroup.size = 64
[sLG]   screen.refresh.interval = 100
[sLG]   path.maxdepth = 8
[sLG]   path.russianroulette.depth = 5
[sLG]   opencl.devices.select = 01
[sLG]   sampler.type = SOBOL
[sLG]   opencl.task.count = 262144
[LuxRays::SDL] Reading scene: scenes/luxball/luxball-hdr.scn
[LuxRays::SDL] Camera postion: Point[0.6, -1.7, 0.4]
[LuxRays::SDL] Camera target: Point[0, 0, 0.4]
[LuxRays::SDL] Material definition: shell
[LuxRays::SDL] Material definition: whitematte
[LuxRays::SDL] Material definition: luxtext
[LuxRays::SDL] Material definition: blacktext
[LuxRays::SDL] PLY object count: 5
[LuxRays::SDL] Reading texture map: scenes/simple-mat/arch.exr
[LuxRays::SDL] HDR RGB (128bit) texture map size: 1024x512 (6144Kbytes)
[LuxRays] OpenCL Platform 0: Apple
[LuxRays] Device 0 name: NativeThread-000
[LuxRays] Device 0 type: NATIVE_THREAD
[LuxRays] Device 0 compute units: 1
[LuxRays] Device 0 max allocable memory: 0MBytes
[LuxRays] Device 0 max allocable memory block size: 0MBytes
[LuxRays] Device 1 name: NativeThread-001
[LuxRays] Device 1 type: NATIVE_THREAD
[LuxRays] Device 1 compute units: 1
[LuxRays] Device 1 max allocable memory: 0MBytes
[LuxRays] Device 1 max allocable memory block size: 0MBytes
[LuxRays] Device 2 name: NativeThread-002
[LuxRays] Device 2 type: NATIVE_THREAD
[LuxRays] Device 2 compute units: 1
[LuxRays] Device 2 max allocable memory: 0MBytes
[LuxRays] Device 2 max allocable memory block size: 0MBytes
[LuxRays] Device 3 name: NativeThread-003
[LuxRays] Device 3 type: NATIVE_THREAD
[LuxRays] Device 3 compute units: 1
[LuxRays] Device 3 max allocable memory: 0MBytes
[LuxRays] Device 3 max allocable memory block size: 0MBytes
[LuxRays] Device 4 name: Intel(R) Core(TM) i5-3570K CPU @ 3.40GHz
[LuxRays] Device 4 type: OPENCL_CPU
[LuxRays] Device 4 compute units: 4
[LuxRays] Device 4 max allocable memory: 8192MBytes
[LuxRays] Device 4 max allocable memory block size: 2048MBytes
[LuxRays] Device 5 name: AMD Radeon HD Pitcairn XT Prototype Compute Engine
[LuxRays] Device 5 type: OPENCL_GPU
[LuxRays] Device 5 compute units: 20
[LuxRays] Device 5 max allocable memory: 1024MBytes
[LuxRays] Device 5 max allocable memory block size: 256MBytes
[LuxRays] Preprocessing DataSet
[LuxRays] Total vertex count: 223352
[LuxRays] Total triangle count: 260780
[LuxRays] Total vertices memory usage: 2617Kbytes
[LuxRays] Total triangles memory usage: 3056Kbytes
[LuxRays] Building QBVH, primitives: 260780, initial nodes: 21733
[LuxRays] QBVH completed with 43609/86932 nodes
[LuxRays] Total QBVH memory usage: 4769Kbytes
[LuxRays] Total QBVH QuadTriangle count: 87045
[LuxRays] Max. QBVH Depth: 24
[LuxRays] Creating 1 intersection device(s)
[LuxRays] Allocating intersection device 0: AMD Radeon HD Pitcairn XT Prototype Compute Engine (Type = OPENCL_GPU)
[sLG] OpenCL Devices used:
[sLG] [AMD Radeon HD Pitcairn XT Prototype Compute EngineIntersect]
[sLG]   Device OpenCL version: OpenCL 1.1 
[LuxRays] [OpenCL device::AMD Radeon HD Pitcairn XT Prototype Compute EngineIntersect] Ray buffer size: 3072Kbytes
[LuxRays] [OpenCL device::AMD Radeon HD Pitcairn XT Prototype Compute EngineIntersect] Ray hits buffer size: 1024Kbytes
[LuxRays] [OpenCL device::AMD Radeon HD Pitcairn XT Prototype Compute EngineIntersect] OpenCL image support is available
[LuxRays] [OpenCL device::AMD Radeon HD Pitcairn XT Prototype Compute EngineIntersect] OpenCL max. image buffer size: 8192x8192
[LuxRays] [OpenCL device::AMD Radeon HD Pitcairn XT Prototype Compute EngineIntersect] QBVH node image buffer size: 553x553
[LuxRays] [OpenCL device::AMD Radeon HD Pitcairn XT Prototype Compute EngineIntersect] QBVH triangle image buffer size: 940x927
[LuxRays] [OpenCL device::AMD Radeon HD Pitcairn XT Prototype Compute EngineIntersect] Enabled QBVH scene storage inside image
[LuxRays] [OpenCL device::AMD Radeon HD Pitcairn XT Prototype Compute EngineIntersect] QBVH max. stack size: 24
[LuxRays] [OpenCL device::AMD Radeon HD Pitcairn XT Prototype Compute EngineIntersect] QBVH Image Storage kernel work group size: 256
[LuxRays] [OpenCL device::AMD Radeon HD Pitcairn XT Prototype Compute EngineIntersect] Suggested work group size: 256
[LuxRays] [OpenCL device::AMD Radeon HD Pitcairn XT Prototype Compute EngineIntersect] Forced work group size: 64
[sLG] Configuring 1 CPU render threads
[sLG] [PathOCLRenderThread] OpenCL task count: 262144
[sLG] [PathOCLRenderThread] OpenCL max. page memory size: 262144Kbytes
[sLG] [PathOCLRenderThread::CompiledScene] Compile Camera
[sLG] [PathOCLRenderThread::CompiledScene] Compile Geometry
[sLG] [PathOCLRenderThread::CompiledScene] Scene geometry compilation time: 7ms
[sLG] [PathOCLRenderThread::CompiledScene] Compile Materials
[sLG] [PathOCLRenderThread::CompiledScene] Compile Textures
[sLG] [PathOCLRenderThread::CompiledScene] Textures compilation time: 0ms
[sLG] [PathOCLRenderThread::CompiledScene] Material compilation time: 0ms
[sLG] [PathOCLRenderThread::CompiledScene] Compile Triangle AreaLights
[sLG] [PathOCLRenderThread::CompiledScene] Triangle area lights compilation time: 0ms
[sLG] [PathOCLRenderThread::CompiledScene] Compile InfiniteLight
[sLG] [PathOCLRenderThread::CompiledScene] Infinitelight compilation time: 0ms
[sLG] [PathOCLRenderThread::CompiledScene] Compile SunLight
[sLG] [PathOCLRenderThread::CompiledScene] Compile SkyLight
[sLG] [PathOCLRenderThread::CompiledScene] Compile ImageMaps
[sLG] [PathOCLRenderThread::CompiledScene] Image maps page count: 1
[sLG] [PathOCLRenderThread::CompiledScene]  RGB channel page 0 size: 6144Kbytes
[sLG] [PathOCLRenderThread::CompiledScene] Texture maps compilation time: 1ms
[sLG] Starting 1 PathOCL render threads
[sLG] [PathOCLRenderThread::0] FrameBuffer buffer size: 4128Kbytes
[sLG] [PathOCLRenderThread::0] Camera buffer size: 272bytes
[sLG] [PathOCLRenderThread::0] MeshIDs buffer size: 1018Kbytes
[sLG] [PathOCLRenderThread::0] Normals buffer size: 2617Kbytes
[sLG] [PathOCLRenderThread::0] UVs buffer size: 1744Kbytes
[sLG] [PathOCLRenderThread::0] Vertices buffer size: 2617Kbytes
[sLG] [PathOCLRenderThread::0] Triangles buffer size: 3056Kbytes
[sLG] [PathOCLRenderThread::0] ImageMaps description buffer size: 20bytes
[sLG] [PathOCLRenderThread::0] ImageMaps buffer size: 6144Kbytes
[sLG] [PathOCLRenderThread::0] Textures buffer size: 144bytes
[sLG] [PathOCLRenderThread::0] Materials buffer size: 192bytes
[sLG] [PathOCLRenderThread::0] Mesh material index buffer size: 20bytes
[sLG] [PathOCLRenderThread::0] InfiniteLight buffer size: 52bytes
[sLG] [PathOCLRenderThread::0] Ray buffer size: 12288Kbytes
[sLG] [PathOCLRenderThread::0] RayHit buffer size: 4096Kbytes
[sLG] [PathOCLRenderThread::0] GPUTask buffer size: 53248Kbytes
[sLG] [PathOCLRenderThread::0] Sample dimensions: 28
[sLG] [PathOCLRenderThread::0] Size of a SampleData: 8bytes
[sLG] [PathOCLRenderThread::0] SampleData buffer size: 2048Kbytes
[sLG] [PathOCLRenderThread::0] GPUTask Stats buffer size: 1024Kbytes
[sLG] [PathOCLRenderThread::0] OpenCL buffer creation time: 0ms
[sLG] [PathOCLRenderThread::0] Sobol table size: 896
[sLG] [PathOCLRenderThread::0] Defined symbols:  -D LUXRAYS_OPENCL_KERNEL -D SLG_OPENCL_KERNEL -D PARAM_TASK_COUNT=262144 -D PARAM_IMAGE_WIDTH=512 -D PARAM_IMAGE_HEIGHT=512 -D PARAM_RAY_EPSILON_MIN=1.000000e-09f -D PARAM_RAY_EPSILON_MAX=1.000000e-01f -D PARAM_MAX_PATH_DEPTH=8 -D PARAM_RR_DEPTH=5 -D PARAM_RR_CAP=5.000000e-01f -D PARAM_ACCEL_QBVH -D PARAM_ENABLE_TEX_CONST_FLOAT3 -D PARAM_ENABLE_MAT_MATTE -D PARAM_CAMERA_HAS_DOF -D PARAM_HAS_INFINITELIGHT -D PARAM_HAS_IMAGEMAPS -D PARAM_IMAGEMAPS_PAGE_0 -D PARAM_IMAGE_FILTER_TYPE=0 -D PARAM_SAMPLER_TYPE=2 -D PARAM_SAMPLER_SOBOL_MAXDEPTH=8
[sLG] [PathOCLRenderThread::0] Compiling kernels 
[sLG] [PathOCLRenderThread::0] Kernels not cached
[sLG] [PathOCLRenderThread::0] Compiling Init Kernel
[sLG] [PathOCLRenderThread::0] Compiling AdvancePaths Kernel
[sLG] [PathOCLRenderThread::0] Kernels compilation time: 2ms

 

 

AMD ATHLON X2 7850BE

NVIDIA GT440

 

compu:SLG4_alpha cc$ ./slg4

[sLG] Reading configuration file: scenes/luxball/luxball.cfg

[sLG] Configuration:

[sLG] image.width = 640

[sLG] image.height = 480

[sLG] batch.halttime = 0

[sLG] scene.file = scenes/luxball/luxball.scn

[sLG] scene.fieldofview = 45

[sLG] opencl.cpu.use = 0

[sLG] opencl.gpu.use = 1

[sLG] opencl.gpu.workgroup.size = 64

[sLG] screen.refresh.interval = 100

[sLG] film.gamma = 2.2

[sLG] path.maxdepth = 16

[sLG] path.russianroulette.depth = 5

[sLG] light.maxdepth = 16

[sLG] light.russianroulette.depth = 5

[LuxRays::SDL] Reading scene: scenes/luxball/luxball.scn

[LuxRays::SDL] Camera postion: Point[0.5, -1.4, 1.7]

[LuxRays::SDL] Camera target: Point[0, 0, 0.4]

[LuxRays::SDL] Material definition: shell

[LuxRays::SDL] Material definition: whitematte

[LuxRays::SDL] Material definition: luxtext

[LuxRays::SDL] Material definition: blacktext

[LuxRays::SDL] Material definition: whitelight

[LuxRays::SDL] The l1 object is a light sources with 2 triangles

[LuxRays::SDL] The l2 object is a light sources with 2 triangles

[LuxRays::SDL] PLY object count: 7

[LuxRays] OpenCL Platform 0: Apple

[LuxRays] Device 0 name: NativeThread-000

[LuxRays] Device 0 type: NATIVE_THREAD

[LuxRays] Device 0 compute units: 1

[LuxRays] Device 0 max allocable memory: 0MBytes

[LuxRays] Device 0 max allocable memory block size: 0MBytes

[LuxRays] Device 1 name: NativeThread-001

[LuxRays] Device 1 type: NATIVE_THREAD

[LuxRays] Device 1 compute units: 1

[LuxRays] Device 1 max allocable memory: 0MBytes

[LuxRays] Device 1 max allocable memory block size: 0MBytes

[LuxRays] Device 2 name: GeForce GT 440

[LuxRays] Device 2 type: OPENCL_GPU

[LuxRays] Device 2 compute units: 2

[LuxRays] Device 2 max allocable memory: 1024MBytes

[LuxRays] Device 2 max allocable memory block size: 256MBytes

[LuxRays] Device 3 name: AMD Athlon 7850 Dual-Core Processor

[LuxRays] Device 3 type: OPENCL_CPU

[LuxRays] Device 3 compute units: 2

[LuxRays] Device 3 max allocable memory: 3072MBytes

[LuxRays] Device 3 max allocable memory block size: 1024MBytes

[LuxRays] Preprocessing DataSet

[LuxRays] Total vertex count: 223380

[LuxRays] Total triangle count: 260794

[LuxRays] Total vertices memory usage: 2617Kbytes

[LuxRays] Total triangles memory usage: 3056Kbytes

[LuxRays] Building QBVH, primitives: 260794, initial nodes: 21734

[LuxRays] QBVH completed with 43467/43468 nodes

[LuxRays] Total QBVH memory usage: 4754Kbytes

[LuxRays] Total QBVH QuadTriangle count: 87103

[LuxRays] Max. QBVH Depth: 25

[LuxRays] Creating 1 intersection device(s)

[LuxRays] Allocating intersection device 0: GeForce GT 440 (Type = OPENCL_GPU)

[sLG] OpenCL Devices used:

[sLG] [GeForce GT 440Intersect]

[sLG] RUNTIME ERROR: OpenCL version 1.1 or better is required for device: GeForce GT 440Intersect

compu:SLG4_alpha cc$ uname -a

Darwin compu.local 10.8.0 Darwin Kernel Version 10.8.0: ma 22 aug 2011 22:12:25 CEST; legacy kernel v8 :xnu-1504.15.3/BUILD/obj/RELEASE_I386 i386

compu:SLG4_alpha cc$ /Users/cc/Desktop/oclinfo

1 OpenCL platform found!

 

[Platform 0]

Name: Apple

Vendor: Apple

Version: OpenCL 1.0 (Dec 23 2010 17:30:26)

Profile: FULL_PROFILE

 

 

2 OpenCL devices found!

 

[Device 0]

Name: GeForce GT 440

Vendor: NVIDIA

Type: GPU

Device Version: OpenCL 1.0

Driver Version: CLH 1.0

Compute Units: 2

Work Group Size: 1024

Clock: 1620 MHz

Global Memory: 1024 MB

Local Memory: 48 KB

Cache Size: 0 KB

Cache Line Size: 0 Bytes

Available: Yes

Double-Precision: No

Extensions:

cl_khr_byte_addressable_store

cl_khr_global_int32_base_atomics

cl_khr_global_int32_extended_atomics

cl_APPLE_gl_sharing

cl_APPLE_SetMemObjectDestructor

cl_APPLE_ContextLoggingFunctions

 

[Device 1]

Name: AMD Athlon 7850 Dual-Core Processor

Vendor: Intel

Type: CPU

Device Version: OpenCL 1.0

Driver Version: 1.0

Compute Units: 2

Work Group Size: 1

Clock: 3000 MHz

Global Memory (Total): 5120 MB

Global Memory (Host): 3072 MB

Global Memory (PCIe): 2048 MB

Local Memory: 16 KB

Cache Size: 4096 KB

Cache Line Size: 64 Bytes

Available: Yes

Double-Precision: Yes

Extensions:

cl_khr_fp64

cl_khr_global_int32_base_atomics

cl_khr_global_int32_extended_atomics

cl_khr_local_int32_base_atomics

cl_khr_local_int32_extended_atomics

cl_khr_byte_addressable_store

cl_APPLE_gl_sharing

cl_APPLE_SetMemObjectDestructor

cl_APPLE_ContextLoggingFunctions

 

Thanks - AMD GPU developers found the bug in the MAD OpenCL part (which made overbrigthen in first test / green test wrong). They will fix it and submit changes to Apple.

 

In the past i see that non other Nvidia gpu could run slg4 alpha without chrashing - at least the submitted Fermi based gpus crashed.

Anyone else with nvidia (Fermi +non fermi ) may try the newer test (green ball) again?

DL + Info look post #15

tested on my GT 430

results:

post-11772-0-32866500-1360844383_thumb.png

 

Last login: Wed Feb 13 20:18:45 on ttys000

Leslies-Mac-Pro:~ leslie$ cd /Users/leslie/Downloads/OSX_AMD_slg4_proof

Leslies-Mac-Pro:OSX_AMD_slg4_proof leslie$ ./test_mac_fix_ok/slg4 -D opencl.devices.select 01 -D sampler.type SOBOL -D opencl.task.count 262144 -w 512 -e 512 scenes/luxball/luxball-hdr.cfg

[sLG] Reading configuration file: scenes/luxball/luxball-hdr.cfg

[sLG] Configuration:

[sLG] image.width = 512

[sLG] image.height = 512

[sLG] batch.halttime = 0

[sLG] scene.file = scenes/luxball/luxball-hdr.scn

[sLG] opencl.cpu.use = 0

[sLG] opencl.gpu.use = 1

[sLG] opencl.gpu.workgroup.size = 64

[sLG] screen.refresh.interval = 100

[sLG] path.maxdepth = 8

[sLG] path.russianroulette.depth = 5

[sLG] opencl.devices.select = 01

[sLG] sampler.type = SOBOL

[sLG] opencl.task.count = 262144

[LuxRays::SDL] Reading scene: scenes/luxball/luxball-hdr.scn

[LuxRays::SDL] Camera postion: Point[0.6, -1.7, 0.4]

[LuxRays::SDL] Camera target: Point[0, 0, 0.4]

[LuxRays::SDL] Material definition: shell

[LuxRays::SDL] Material definition: whitematte

[LuxRays::SDL] Material definition: luxtext

[LuxRays::SDL] Material definition: blacktext

[LuxRays::SDL] PLY object count: 5

[LuxRays::SDL] Reading texture map: scenes/simple-mat/arch.exr

[LuxRays::SDL] HDR RGB (128bit) texture map size: 1024x512 (6144Kbytes)

[LuxRays] OpenCL Platform 0: Apple

[LuxRays] Device 0 name: NativeThread-000

[LuxRays] Device 0 type: NATIVE_THREAD

[LuxRays] Device 0 compute units: 1

[LuxRays] Device 0 max allocable memory: 0MBytes

[LuxRays] Device 0 max allocable memory block size: 0MBytes

[LuxRays] Device 1 name: NativeThread-001

[LuxRays] Device 1 type: NATIVE_THREAD

[LuxRays] Device 1 compute units: 1

[LuxRays] Device 1 max allocable memory: 0MBytes

[LuxRays] Device 1 max allocable memory block size: 0MBytes

[LuxRays] Device 2 name: Intel® Core™2 Duo CPU E6550 @ 2.33GHz

[LuxRays] Device 2 type: OPENCL_CPU

[LuxRays] Device 2 compute units: 2

[LuxRays] Device 2 max allocable memory: 8192MBytes

[LuxRays] Device 2 max allocable memory block size: 2048MBytes

[LuxRays] Device 3 name: GeForce GT 430

[LuxRays] Device 3 type: OPENCL_GPU

[LuxRays] Device 3 compute units: 2

[LuxRays] Device 3 max allocable memory: 1024MBytes

[LuxRays] Device 3 max allocable memory block size: 256MBytes

[LuxRays] Preprocessing DataSet

[LuxRays] Total vertex count: 223352

[LuxRays] Total triangle count: 260780

[LuxRays] Total vertices memory usage: 2617Kbytes

[LuxRays] Total triangles memory usage: 3056Kbytes

[LuxRays] Building QBVH, primitives: 260780, initial nodes: 21733

[LuxRays] QBVH completed with 43609/86932 nodes

[LuxRays] Total QBVH memory usage: 4769Kbytes

[LuxRays] Total QBVH QuadTriangle count: 87045

[LuxRays] Max. QBVH Depth: 24

[LuxRays] Creating 1 intersection device(s)

[LuxRays] Allocating intersection device 0: GeForce GT 430 (Type = OPENCL_GPU)

[sLG] OpenCL Devices used:

[sLG] [GeForce GT 430Intersect]

[sLG] Device OpenCL version: OpenCL 1.1

[LuxRays] [OpenCL device::GeForce GT 430Intersect] Ray buffer size: 3072Kbytes

[LuxRays] [OpenCL device::GeForce GT 430Intersect] Ray hits buffer size: 1024Kbytes

[LuxRays] [OpenCL device::GeForce GT 430Intersect] OpenCL image support is available

[LuxRays] [OpenCL device::GeForce GT 430Intersect] OpenCL max. image buffer size: 8192x8192

[LuxRays] [OpenCL device::GeForce GT 430Intersect] QBVH node image buffer size: 553x553

[LuxRays] [OpenCL device::GeForce GT 430Intersect] QBVH triangle image buffer size: 940x927

[LuxRays] [OpenCL device::GeForce GT 430Intersect] Enabled QBVH scene storage inside image

[LuxRays] [OpenCL device::GeForce GT 430Intersect] QBVH max. stack size: 24

[LuxRays] [OpenCL device::GeForce GT 430Intersect] QBVH Image Storage kernel work group size: 512

[LuxRays] [OpenCL device::GeForce GT 430Intersect] Suggested work group size: 512

[LuxRays] [OpenCL device::GeForce GT 430Intersect] Forced work group size: 64

[sLG] Configuring 1 CPU render threads

[sLG] [PathOCLRenderThread] OpenCL task count: 262144

[sLG] [PathOCLRenderThread] OpenCL max. page memory size: 262144Kbytes

[sLG] [PathOCLRenderThread::CompiledScene] Compile Camera

[sLG] [PathOCLRenderThread::CompiledScene] Compile Geometry

[sLG] [PathOCLRenderThread::CompiledScene] Scene geometry compilation time: 13ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile Materials

[sLG] [PathOCLRenderThread::CompiledScene] Compile Textures

[sLG] [PathOCLRenderThread::CompiledScene] Textures compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Material compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile Triangle AreaLights

[sLG] [PathOCLRenderThread::CompiledScene] Triangle area lights compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile InfiniteLight

[sLG] [PathOCLRenderThread::CompiledScene] Infinitelight compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile SunLight

[sLG] [PathOCLRenderThread::CompiledScene] Compile SkyLight

[sLG] [PathOCLRenderThread::CompiledScene] Compile ImageMaps

[sLG] [PathOCLRenderThread::CompiledScene] Image maps page count: 1

[sLG] [PathOCLRenderThread::CompiledScene] RGB channel page 0 size: 6144Kbytes

[sLG] [PathOCLRenderThread::CompiledScene] Texture maps compilation time: 2ms

[sLG] Starting 1 PathOCL render threads

[sLG] [PathOCLRenderThread::0] FrameBuffer buffer size: 4128Kbytes

[sLG] [PathOCLRenderThread::0] Camera buffer size: 272bytes

[sLG] [PathOCLRenderThread::0] MeshIDs buffer size: 1018Kbytes

[sLG] [PathOCLRenderThread::0] Normals buffer size: 2617Kbytes

[sLG] [PathOCLRenderThread::0] UVs buffer size: 1744Kbytes

[sLG] [PathOCLRenderThread::0] Vertices buffer size: 2617Kbytes

[sLG] [PathOCLRenderThread::0] Triangles buffer size: 3056Kbytes

[sLG] [PathOCLRenderThread::0] ImageMaps description buffer size: 20bytes

[sLG] [PathOCLRenderThread::0] ImageMaps buffer size: 6144Kbytes

[sLG] [PathOCLRenderThread::0] Textures buffer size: 144bytes

[sLG] [PathOCLRenderThread::0] Materials buffer size: 192bytes

[sLG] [PathOCLRenderThread::0] Mesh material index buffer size: 20bytes

[sLG] [PathOCLRenderThread::0] InfiniteLight buffer size: 52bytes

[sLG] [PathOCLRenderThread::0] Ray buffer size: 12288Kbytes

[sLG] [PathOCLRenderThread::0] RayHit buffer size: 4096Kbytes

[sLG] [PathOCLRenderThread::0] GPUTask buffer size: 53248Kbytes

[sLG] [PathOCLRenderThread::0] Sample dimensions: 28

[sLG] [PathOCLRenderThread::0] Size of a SampleData: 8bytes

[sLG] [PathOCLRenderThread::0] SampleData buffer size: 2048Kbytes

[sLG] [PathOCLRenderThread::0] GPUTask Stats buffer size: 1024Kbytes

[sLG] [PathOCLRenderThread::0] OpenCL buffer creation time: 0ms

[sLG] [PathOCLRenderThread::0] Sobol table size: 896

[sLG] [PathOCLRenderThread::0] Defined symbols: -D LUXRAYS_OPENCL_KERNEL -D SLG_OPENCL_KERNEL -D PARAM_TASK_COUNT=262144 -D PARAM_IMAGE_WIDTH=512 -D PARAM_IMAGE_HEIGHT=512 -D PARAM_RAY_EPSILON_MIN=1.000000e-09f -D PARAM_RAY_EPSILON_MAX=1.000000e-01f -D PARAM_MAX_PATH_DEPTH=8 -D PARAM_RR_DEPTH=5 -D PARAM_RR_CAP=5.000000e-01f -D PARAM_ACCEL_QBVH -D PARAM_ENABLE_TEX_CONST_FLOAT3 -D PARAM_ENABLE_MAT_MATTE -D PARAM_CAMERA_HAS_DOF -D PARAM_HAS_INFINITELIGHT -D PARAM_HAS_IMAGEMAPS -D PARAM_IMAGEMAPS_PAGE_0 -D PARAM_IMAGE_FILTER_TYPE=0 -D PARAM_SAMPLER_TYPE=2 -D PARAM_SAMPLER_SOBOL_MAXDEPTH=8

[sLG] [PathOCLRenderThread::0] Compiling kernels

[sLG] [PathOCLRenderThread::0] Kernels not cached

[sLG] [PathOCLRenderThread::0] Compiling Init Kernel

[sLG] [PathOCLRenderThread::0] Compiling AdvancePaths Kernel

[sLG] [PathOCLRenderThread::0] Kernels compilation time: 258ms

 

 

post-11772-0-27244100-1360844394_thumb.png

 

Leslies-Mac-Pro:OSX_AMD_slg4_proof leslie$ ./test_mac_fix_ok/slg4 -D opencl.devices.select 01 -D sampler.type SOBOL -D opencl.task.count 262144 -w 512 -e 512 scenes/luxball/luxball-hdr.cfg

[sLG] Reading configuration file: scenes/luxball/luxball-hdr.cfg

[sLG] Configuration:

[sLG] image.width = 512

[sLG] image.height = 512

[sLG] batch.halttime = 0

[sLG] scene.file = scenes/luxball/luxball-hdr.scn

[sLG] opencl.cpu.use = 0

[sLG] opencl.gpu.use = 1

[sLG] opencl.gpu.workgroup.size = 64

[sLG] screen.refresh.interval = 100

[sLG] path.maxdepth = 8

[sLG] path.russianroulette.depth = 5

[sLG] opencl.devices.select = 01

[sLG] sampler.type = SOBOL

[sLG] opencl.task.count = 262144

[LuxRays::SDL] Reading scene: scenes/luxball/luxball-hdr.scn

[LuxRays::SDL] Camera postion: Point[0.6, -1.7, 0.4]

[LuxRays::SDL] Camera target: Point[0, 0, 0.4]

[LuxRays::SDL] Material definition: shell

[LuxRays::SDL] Material definition: whitematte

[LuxRays::SDL] Material definition: luxtext

[LuxRays::SDL] Material definition: blacktext

[LuxRays::SDL] PLY object count: 5

[LuxRays::SDL] Reading texture map: scenes/simple-mat/arch.exr

[LuxRays::SDL] HDR RGB (128bit) texture map size: 1024x512 (6144Kbytes)

[LuxRays] OpenCL Platform 0: Apple

[LuxRays] Device 0 name: NativeThread-000

[LuxRays] Device 0 type: NATIVE_THREAD

[LuxRays] Device 0 compute units: 1

[LuxRays] Device 0 max allocable memory: 0MBytes

[LuxRays] Device 0 max allocable memory block size: 0MBytes

[LuxRays] Device 1 name: NativeThread-001

[LuxRays] Device 1 type: NATIVE_THREAD

[LuxRays] Device 1 compute units: 1

[LuxRays] Device 1 max allocable memory: 0MBytes

[LuxRays] Device 1 max allocable memory block size: 0MBytes

[LuxRays] Device 2 name: Intel® Core™2 Duo CPU E6550 @ 2.33GHz

[LuxRays] Device 2 type: OPENCL_CPU

[LuxRays] Device 2 compute units: 2

[LuxRays] Device 2 max allocable memory: 8192MBytes

[LuxRays] Device 2 max allocable memory block size: 2048MBytes

[LuxRays] Device 3 name: GeForce GT 430

[LuxRays] Device 3 type: OPENCL_GPU

[LuxRays] Device 3 compute units: 2

[LuxRays] Device 3 max allocable memory: 1024MBytes

[LuxRays] Device 3 max allocable memory block size: 256MBytes

[LuxRays] Preprocessing DataSet

[LuxRays] Total vertex count: 223352

[LuxRays] Total triangle count: 260780

[LuxRays] Total vertices memory usage: 2617Kbytes

[LuxRays] Total triangles memory usage: 3056Kbytes

[LuxRays] Building QBVH, primitives: 260780, initial nodes: 21733

[LuxRays] QBVH completed with 43609/86932 nodes

[LuxRays] Total QBVH memory usage: 4769Kbytes

[LuxRays] Total QBVH QuadTriangle count: 87045

[LuxRays] Max. QBVH Depth: 24

[LuxRays] Creating 1 intersection device(s)

[LuxRays] Allocating intersection device 0: GeForce GT 430 (Type = OPENCL_GPU)

[sLG] OpenCL Devices used:

[sLG] [GeForce GT 430Intersect]

[sLG] Device OpenCL version: OpenCL 1.1

[LuxRays] [OpenCL device::GeForce GT 430Intersect] Ray buffer size: 3072Kbytes

[LuxRays] [OpenCL device::GeForce GT 430Intersect] Ray hits buffer size: 1024Kbytes

[LuxRays] [OpenCL device::GeForce GT 430Intersect] OpenCL image support is available

[LuxRays] [OpenCL device::GeForce GT 430Intersect] OpenCL max. image buffer size: 8192x8192

[LuxRays] [OpenCL device::GeForce GT 430Intersect] QBVH node image buffer size: 553x553

[LuxRays] [OpenCL device::GeForce GT 430Intersect] QBVH triangle image buffer size: 940x927

[LuxRays] [OpenCL device::GeForce GT 430Intersect] Enabled QBVH scene storage inside image

[LuxRays] [OpenCL device::GeForce GT 430Intersect] QBVH max. stack size: 24

[LuxRays] [OpenCL device::GeForce GT 430Intersect] QBVH Image Storage kernel work group size: 512

[LuxRays] [OpenCL device::GeForce GT 430Intersect] Suggested work group size: 512

[LuxRays] [OpenCL device::GeForce GT 430Intersect] Forced work group size: 64

[sLG] Configuring 1 CPU render threads

[sLG] [PathOCLRenderThread] OpenCL task count: 262144

[sLG] [PathOCLRenderThread] OpenCL max. page memory size: 262144Kbytes

[sLG] [PathOCLRenderThread::CompiledScene] Compile Camera

[sLG] [PathOCLRenderThread::CompiledScene] Compile Geometry

[sLG] [PathOCLRenderThread::CompiledScene] Scene geometry compilation time: 13ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile Materials

[sLG] [PathOCLRenderThread::CompiledScene] Compile Textures

[sLG] [PathOCLRenderThread::CompiledScene] Textures compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Material compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile Triangle AreaLights

[sLG] [PathOCLRenderThread::CompiledScene] Triangle area lights compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile InfiniteLight

[sLG] [PathOCLRenderThread::CompiledScene] Infinitelight compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile SunLight

[sLG] [PathOCLRenderThread::CompiledScene] Compile SkyLight

[sLG] [PathOCLRenderThread::CompiledScene] Compile ImageMaps

[sLG] [PathOCLRenderThread::CompiledScene] Image maps page count: 1

[sLG] [PathOCLRenderThread::CompiledScene] RGB channel page 0 size: 6144Kbytes

[sLG] [PathOCLRenderThread::CompiledScene] Texture maps compilation time: 2ms

[sLG] Starting 1 PathOCL render threads

[sLG] [PathOCLRenderThread::0] FrameBuffer buffer size: 4128Kbytes

[sLG] [PathOCLRenderThread::0] Camera buffer size: 272bytes

[sLG] [PathOCLRenderThread::0] MeshIDs buffer size: 1018Kbytes

[sLG] [PathOCLRenderThread::0] Normals buffer size: 2617Kbytes

[sLG] [PathOCLRenderThread::0] UVs buffer size: 1744Kbytes

[sLG] [PathOCLRenderThread::0] Vertices buffer size: 2617Kbytes

[sLG] [PathOCLRenderThread::0] Triangles buffer size: 3056Kbytes

[sLG] [PathOCLRenderThread::0] ImageMaps description buffer size: 20bytes

[sLG] [PathOCLRenderThread::0] ImageMaps buffer size: 6144Kbytes

[sLG] [PathOCLRenderThread::0] Textures buffer size: 144bytes

[sLG] [PathOCLRenderThread::0] Materials buffer size: 192bytes

[sLG] [PathOCLRenderThread::0] Mesh material index buffer size: 20bytes

[sLG] [PathOCLRenderThread::0] InfiniteLight buffer size: 52bytes

[sLG] [PathOCLRenderThread::0] Ray buffer size: 12288Kbytes

[sLG] [PathOCLRenderThread::0] RayHit buffer size: 4096Kbytes

[sLG] [PathOCLRenderThread::0] GPUTask buffer size: 53248Kbytes

[sLG] [PathOCLRenderThread::0] Sample dimensions: 28

[sLG] [PathOCLRenderThread::0] Size of a SampleData: 8bytes

[sLG] [PathOCLRenderThread::0] SampleData buffer size: 2048Kbytes

[sLG] [PathOCLRenderThread::0] GPUTask Stats buffer size: 1024Kbytes

[sLG] [PathOCLRenderThread::0] OpenCL buffer creation time: 0ms

[sLG] [PathOCLRenderThread::0] Sobol table size: 896

[sLG] [PathOCLRenderThread::0] Defined symbols: -D LUXRAYS_OPENCL_KERNEL -D SLG_OPENCL_KERNEL -D PARAM_TASK_COUNT=262144 -D PARAM_IMAGE_WIDTH=512 -D PARAM_IMAGE_HEIGHT=512 -D PARAM_RAY_EPSILON_MIN=1.000000e-09f -D PARAM_RAY_EPSILON_MAX=1.000000e-01f -D PARAM_MAX_PATH_DEPTH=8 -D PARAM_RR_DEPTH=5 -D PARAM_RR_CAP=5.000000e-01f -D PARAM_ACCEL_QBVH -D PARAM_ENABLE_TEX_CONST_FLOAT3 -D PARAM_ENABLE_MAT_MATTE -D PARAM_CAMERA_HAS_DOF -D PARAM_HAS_INFINITELIGHT -D PARAM_HAS_IMAGEMAPS -D PARAM_IMAGEMAPS_PAGE_0 -D PARAM_IMAGE_FILTER_TYPE=0 -D PARAM_SAMPLER_TYPE=2 -D PARAM_SAMPLER_SOBOL_MAXDEPTH=8

[sLG] [PathOCLRenderThread::0] Compiling kernels

[sLG] [PathOCLRenderThread::0] Kernels not cached

[sLG] [PathOCLRenderThread::0] Compiling Init Kernel

[sLG] [PathOCLRenderThread::0] Compiling AdvancePaths Kernel

[sLG] [PathOCLRenderThread::0] Kernels compilation time: 6ms

Leslies-Mac-Pro:OSX_AMD_slg4_proof leslie$

 

 

:smoke:

oclinfo:

 

 

 

Last login: Thu Feb 14 07:03:03 on ttys000

/Applications/Utilities/oclinfo-0.2/oclinfo ; exit;

Leslies-Mac-Pro:~ leslie$ /Applications/Utilities/oclinfo-0.2/oclinfo ; exit;

1 OpenCL platform found!

 

[Platform 0]

Name: Apple

Vendor: Apple

Version: OpenCL 1.2 (Aug 24 2012 00:53:09)

Profile: FULL_PROFILE

Extensions: cl_APPLE_SetMemObjectDestructor cl_APPLE_ContextLoggingFunctions cl_APPLE_clut cl_APPLE_query_kernel_names cl_APPLE_gl_sharing cl_khr_gl_event

 

 

[OpenCL-only Context]

2 OpenCL devices found!

 

[Device 0]

Name: Intel® Core2 Duo CPU E6550 @ 2.33GHz

Vendor: Intel

Type: CPU

Device Version: OpenCL 1.2

Driver Version: 1.1

Compute Units: 2

Work Group Size: 1024

Clock: 2327 MHz

Global Memory (Total): 8192 MB

Global Memory (Host): 8192 MB

Global Memory (PCIe): 0 MB

Local Memory: 32 KB

Cache Size: 0.0625 KB

Cache Line Size: 4194304 Bytes

Available: Yes

Double-Precision: Yes

Extensions:

cl_APPLE_SetMemObjectDestructor

cl_APPLE_ContextLoggingFunctions

cl_APPLE_clut

cl_APPLE_query_kernel_names

cl_APPLE_gl_sharing

cl_khr_gl_event

cl_khr_fp64

cl_khr_global_int32_base_atomics

cl_khr_global_int32_extended_atomics

cl_khr_local_int32_base_atomics

cl_khr_local_int32_extended_atomics

cl_khr_byte_addressable_store

cl_khr_int64_base_atomics

cl_khr_int64_extended_atomics

cl_khr_3d_image_writes

cl_APPLE_fp64_basic_ops

cl_APPLE_fixed_alpha_channel_orders

cl_APPLE_biased_fixed_point_image_formats

 

[Device 1]

Name: GeForce GT 430

Vendor: NVIDIA

Type: GPU

Device Version: OpenCL 1.1

Driver Version: CLH 1.0

Compute Units: 2

Work Group Size: 1024

Clock: 1400 MHz

Global Memory: 1024 MB

Local Memory: 48 KB

Cache Size: 0 KB

Cache Line Size: 0 Bytes

Available: Yes

Double-Precision: No

Extensions:

cl_APPLE_SetMemObjectDestructor

cl_APPLE_ContextLoggingFunctions

cl_APPLE_clut

cl_APPLE_query_kernel_names

cl_APPLE_gl_sharing

cl_khr_gl_event

cl_khr_byte_addressable_store

cl_khr_global_int32_base_atomics

cl_khr_global_int32_extended_atomics

cl_khr_local_int32_base_atomics

cl_khr_local_int32_extended_atomics

cl_APPLE_fp64_basic_ops

 

[shared OpenCL+OpenGL Context]

2 OpenCL devices found!

 

[Device 0]

Name: GeForce GT 430

Vendor: NVIDIA

Type: GPU

Device Version: OpenCL 1.1

Driver Version: CLH 1.0

Compute Units: 2

Work Group Size: 1024

Clock: 1400 MHz

Global Memory: 1024 MB

Local Memory: 48 KB

Cache Size: 0 KB

Cache Line Size: 0 Bytes

Available: Yes

Double-Precision: No

Extensions:

cl_APPLE_SetMemObjectDestructor

cl_APPLE_ContextLoggingFunctions

cl_APPLE_clut

cl_APPLE_query_kernel_names

cl_APPLE_gl_sharing

cl_khr_gl_event

cl_khr_byte_addressable_store

cl_khr_global_int32_base_atomics

cl_khr_global_int32_extended_atomics

cl_khr_local_int32_base_atomics

cl_khr_local_int32_extended_atomics

cl_APPLE_fp64_basic_ops

 

[Device 1]

Name: Intel® Core2 Duo CPU E6550 @ 2.33GHz

Vendor: Intel

Type: CPU

Device Version: OpenCL 1.2

Driver Version: 1.1

Compute Units: 2

Work Group Size: 1024

Clock: 2327 MHz

Global Memory (Total): 8192 MB

Global Memory (Host): 8192 MB

Global Memory (PCIe): 0 MB

Local Memory: 32 KB

Cache Size: 0.0625 KB

Cache Line Size: 4194304 Bytes

Available: Yes

Double-Precision: Yes

Extensions:

cl_APPLE_SetMemObjectDestructor

cl_APPLE_ContextLoggingFunctions

cl_APPLE_clut

cl_APPLE_query_kernel_names

cl_APPLE_gl_sharing

cl_khr_gl_event

cl_khr_fp64

cl_khr_global_int32_base_atomics

cl_khr_global_int32_extended_atomics

cl_khr_local_int32_base_atomics

cl_khr_local_int32_extended_atomics

cl_khr_byte_addressable_store

cl_khr_int64_base_atomics

cl_khr_int64_extended_atomics

cl_khr_3d_image_writes

cl_APPLE_fp64_basic_ops

cl_APPLE_fixed_alpha_channel_orders

cl_APPLE_biased_fixed_point_image_formats

 

logout

 

[Process completed]

 

 

 

...hi Mitch, the file from post #1 crashes here...

 

 

Leslies-Mac-Pro:SLG4_alpha leslie$ /Users/leslie/Downloads/SLG4_alpha/slg4

[sLG] Reading configuration file: scenes/luxball/luxball.cfg

[sLG] Configuration:

[sLG] image.width = 640

[sLG] image.height = 480

[sLG] batch.halttime = 0

[sLG] scene.file = scenes/luxball/luxball.scn

[sLG] scene.fieldofview = 45

[sLG] opencl.cpu.use = 0

[sLG] opencl.gpu.use = 1

[sLG] opencl.gpu.workgroup.size = 64

[sLG] screen.refresh.interval = 100

[sLG] film.gamma = 2.2

[sLG] path.maxdepth = 16

[sLG] path.russianroulette.depth = 5

[sLG] light.maxdepth = 16

[sLG] light.russianroulette.depth = 5

[LuxRays::SDL] Reading scene: scenes/luxball/luxball.scn

[LuxRays::SDL] Camera postion: Point[0.5, -1.4, 1.7]

[LuxRays::SDL] Camera target: Point[0, 0, 0.4]

[LuxRays::SDL] Material definition: shell

[LuxRays::SDL] Material definition: whitematte

[LuxRays::SDL] Material definition: luxtext

[LuxRays::SDL] Material definition: blacktext

[LuxRays::SDL] Material definition: whitelight

[LuxRays::SDL] The l1 object is a light sources with 2 triangles

[LuxRays::SDL] The l2 object is a light sources with 2 triangles

[LuxRays::SDL] PLY object count: 7

[LuxRays] OpenCL Platform 0: Apple

[LuxRays] Device 0 name: NativeThread-000

[LuxRays] Device 0 type: NATIVE_THREAD

[LuxRays] Device 0 compute units: 1

[LuxRays] Device 0 max allocable memory: 0MBytes

[LuxRays] Device 0 max allocable memory block size: 0MBytes

[LuxRays] Device 1 name: NativeThread-001

[LuxRays] Device 1 type: NATIVE_THREAD

[LuxRays] Device 1 compute units: 1

[LuxRays] Device 1 max allocable memory: 0MBytes

[LuxRays] Device 1 max allocable memory block size: 0MBytes

[LuxRays] Device 2 name: Intel® Core™2 Duo CPU E6550 @ 2.33GHz

[LuxRays] Device 2 type: OPENCL_CPU

[LuxRays] Device 2 compute units: 2

[LuxRays] Device 2 max allocable memory: 8192MBytes

[LuxRays] Device 2 max allocable memory block size: 2048MBytes

[LuxRays] Device 3 name: GeForce GT 430

[LuxRays] Device 3 type: OPENCL_GPU

[LuxRays] Device 3 compute units: 2

[LuxRays] Device 3 max allocable memory: 1024MBytes

[LuxRays] Device 3 max allocable memory block size: 256MBytes

[LuxRays] Preprocessing DataSet

[LuxRays] Total vertex count: 223380

[LuxRays] Total triangle count: 260794

[LuxRays] Total vertices memory usage: 2617Kbytes

[LuxRays] Total triangles memory usage: 3056Kbytes

[LuxRays] Building QBVH, primitives: 260794, initial nodes: 21734

[LuxRays] QBVH completed with 43467/43468 nodes

[LuxRays] Total QBVH memory usage: 4754Kbytes

[LuxRays] Total QBVH QuadTriangle count: 87103

[LuxRays] Max. QBVH Depth: 25

[LuxRays] Creating 1 intersection device(s)

[LuxRays] Allocating intersection device 0: GeForce GT 430 (Type = OPENCL_GPU)

[sLG] OpenCL Devices used:

[sLG] [GeForce GT 430Intersect]

[LuxRays] [OpenCL device::GeForce GT 430Intersect] Ray buffer size: 3072Kbytes

[LuxRays] [OpenCL device::GeForce GT 430Intersect] Ray hits buffer size: 1024Kbytes

[LuxRays] [OpenCL device::GeForce GT 430Intersect] OpenCL image support is available

[LuxRays] [OpenCL device::GeForce GT 430Intersect] OpenCL max. image buffer size: 8192x8192

[LuxRays] [OpenCL device::GeForce GT 430Intersect] QBVH node image buffer size: 553x551

[LuxRays] [OpenCL device::GeForce GT 430Intersect] QBVH triangle image buffer size: 940x927

[LuxRays] [OpenCL device::GeForce GT 430Intersect] Enabled QBVH scene storage inside image

[LuxRays] [OpenCL device::GeForce GT 430Intersect] QBVH max. stack size: 24

[LuxRays] [OpenCL device::GeForce GT 430Intersect] QBVH Image Storage kernel work group size: 512

[LuxRays] [OpenCL device::GeForce GT 430Intersect] Suggested work group size: 512

[LuxRays] [OpenCL device::GeForce GT 430Intersect] Forced work group size: 64

[sLG] Configuring 1 CPU render threads

[sLG] [PathOCLRenderThread] OpenCL task count: 65536

[sLG] [PathOCLRenderThread] OpenCL max. page memory size: 262144Kbytes

[sLG] [PathOCLRenderThread::CompiledScene] Compile Camera

[sLG] [PathOCLRenderThread::CompiledScene] Compile Geometry

[sLG] [PathOCLRenderThread::CompiledScene] Scene geometry compilation time: 12ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile Materials

[sLG] [PathOCLRenderThread::CompiledScene] Compile Textures

[sLG] [PathOCLRenderThread::CompiledScene] Textures compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Material compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile Triangle AreaLights

[sLG] [PathOCLRenderThread::CompiledScene] Triangle area lights compilation time: 1ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile InfiniteLight

[sLG] [PathOCLRenderThread::CompiledScene] Infinitelight compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile SunLight

[sLG] [PathOCLRenderThread::CompiledScene] Compile SkyLight

[sLG] [PathOCLRenderThread::CompiledScene] Compile ImageMaps

[sLG] [PathOCLRenderThread::CompiledScene] Image maps page count: 0

[sLG] [PathOCLRenderThread::CompiledScene] Texture maps compilation time: 0ms

[sLG] Starting 1 PathOCL render threads

[sLG] [PathOCLRenderThread::0] FrameBuffer buffer size: 4835Kbytes

[sLG] [PathOCLRenderThread::0] Camera buffer size: 272bytes

[sLG] [PathOCLRenderThread::0] MeshIDs buffer size: 1018Kbytes

[sLG] [PathOCLRenderThread::0] Normals buffer size: 2617Kbytes

[sLG] [PathOCLRenderThread::0] UVs buffer size: 1745Kbytes

[sLG] [PathOCLRenderThread::0] Vertices buffer size: 2617Kbytes

[sLG] [PathOCLRenderThread::0] Triangles buffer size: 3056Kbytes

[sLG] [PathOCLRenderThread::0] Textures buffer size: 324bytes

[sLG] [PathOCLRenderThread::0] Materials buffer size: 160bytes

[sLG] [PathOCLRenderThread::0] Mesh material index buffer size: 28bytes

[sLG] [PathOCLRenderThread::0] Triangle AreaLights buffer size: 272bytes

[sLG] [PathOCLRenderThread::0] Triangle AreaLights index buffer size: 1018Kbytes

[sLG] [PathOCLRenderThread::0] Ray buffer size: 3072Kbytes

[sLG] [PathOCLRenderThread::0] RayHit buffer size: 1024Kbytes

[sLG] [PathOCLRenderThread::0] GPUTask buffer size: 23296Kbytes

[sLG] [PathOCLRenderThread::0] Size of a SampleData: 8bytes

[sLG] [PathOCLRenderThread::0] SampleData buffer size: 512Kbytes

[sLG] [PathOCLRenderThread::0] GPUTask Stats buffer size: 256Kbytes

[sLG] [PathOCLRenderThread::0] OpenCL buffer creation time: 0ms

[sLG] [PathOCLRenderThread::0] Defined symbols: -D LUXRAYS_OPENCL_KERNEL -D SLG_OPENCL_KERNEL -D PARAM_TASK_COUNT=65536 -D PARAM_IMAGE_WIDTH=640 -D PARAM_IMAGE_HEIGHT=480 -D PARAM_RAY_EPSILON_MIN=1.000000e-09f -D PARAM_RAY_EPSILON_MAX=1.000000e-01f -D PARAM_MAX_PATH_DEPTH=16 -D PARAM_RR_DEPTH=5 -D PARAM_RR_CAP=5.000000e-01f -D PARAM_ACCEL_QBVH -D PARAM_ENABLE_TEX_CONST_FLOAT -D PARAM_ENABLE_TEX_CONST_FLOAT3 -D PARAM_ENABLE_MAT_MATTE -D PARAM_ENABLE_MAT_GLASS -D PARAM_HAS_PASSTHROUGH -D PARAM_DIRECT_LIGHT_SAMPLING -D PARAM_DL_LIGHT_COUNT=4 -D PARAM_IMAGE_FILTER_TYPE=0 -D PARAM_SAMPLER_TYPE=0

[sLG] [PathOCLRenderThread::0] Compiling kernels

[sLG] [PathOCLRenderThread::0] Kernels not cached

[sLG] [PathOCLRenderThread::0] Compiling Init Kernel

[sLG] [PathOCLRenderThread::0] Compiling AdvancePaths Kernel

[sLG] [PathOCLRenderThread::0] Kernels compilation time: 5ms

[sLG] [PathOCLRenderThread::0] Rendering thread ERROR: clFlush(CL_INVALID_COMMAND_QUEUE)

[sLG] =========================================================

[sLG] Unhandled exception

[sLG] Obtained 5 stack frames.

[sLG] 0

[sLG] 1

[sLG] 2

[sLG] 3

[sLG] 4

Abort trap: 6

Leslies-Mac-Pro:SLG4_alpha leslie$

 

:smoke:

Fermi based gpus reported always crashing in OpenCL part.

Yes, it crashes :( Fermi GTX580:

 

 

SLG4_alpha np$ ./slg4

[sLG] Reading configuration file: scenes/luxball/luxball.cfg

[sLG] Configuration:

[sLG] image.width = 640

[sLG] image.height = 480

[sLG] batch.halttime = 0

[sLG] scene.file = scenes/luxball/luxball.scn

[sLG] scene.fieldofview = 45

[sLG] opencl.cpu.use = 0

[sLG] opencl.gpu.use = 1

[sLG] opencl.gpu.workgroup.size = 64

[sLG] screen.refresh.interval = 100

[sLG] film.gamma = 2.2

[sLG] path.maxdepth = 16

[sLG] path.russianroulette.depth = 5

[sLG] light.maxdepth = 16

[sLG] light.russianroulette.depth = 5

[LuxRays::SDL] Reading scene: scenes/luxball/luxball.scn

[LuxRays::SDL] Camera postion: Point[0.5, -1.4, 1.7]

[LuxRays::SDL] Camera target: Point[0, 0, 0.4]

[LuxRays::SDL] Material definition: shell

[LuxRays::SDL] Material definition: whitematte

[LuxRays::SDL] Material definition: luxtext

[LuxRays::SDL] Material definition: blacktext

[LuxRays::SDL] Material definition: whitelight

[LuxRays::SDL] The l1 object is a light sources with 2 triangles

[LuxRays::SDL] The l2 object is a light sources with 2 triangles

[LuxRays::SDL] PLY object count: 7

[LuxRays] OpenCL Platform 0: Apple

[LuxRays] Device 0 name: NativeThread-000

[LuxRays] Device 0 type: NATIVE_THREAD

[LuxRays] Device 0 compute units: 1

[LuxRays] Device 0 max allocable memory: 0MBytes

[LuxRays] Device 0 max allocable memory block size: 0MBytes

[LuxRays] Device 1 name: NativeThread-001

[LuxRays] Device 1 type: NATIVE_THREAD

[LuxRays] Device 1 compute units: 1

[LuxRays] Device 1 max allocable memory: 0MBytes

[LuxRays] Device 1 max allocable memory block size: 0MBytes

[LuxRays] Device 2 name: NativeThread-002

[LuxRays] Device 2 type: NATIVE_THREAD

[LuxRays] Device 2 compute units: 1

[LuxRays] Device 2 max allocable memory: 0MBytes

[LuxRays] Device 2 max allocable memory block size: 0MBytes

[LuxRays] Device 3 name: NativeThread-003

[LuxRays] Device 3 type: NATIVE_THREAD

[LuxRays] Device 3 compute units: 1

[LuxRays] Device 3 max allocable memory: 0MBytes

[LuxRays] Device 3 max allocable memory block size: 0MBytes

[LuxRays] Device 4 name: NativeThread-004

[LuxRays] Device 4 type: NATIVE_THREAD

[LuxRays] Device 4 compute units: 1

[LuxRays] Device 4 max allocable memory: 0MBytes

[LuxRays] Device 4 max allocable memory block size: 0MBytes

[LuxRays] Device 5 name: NativeThread-005

[LuxRays] Device 5 type: NATIVE_THREAD

[LuxRays] Device 5 compute units: 1

[LuxRays] Device 5 max allocable memory: 0MBytes

[LuxRays] Device 5 max allocable memory block size: 0MBytes

[LuxRays] Device 6 name: NativeThread-006

[LuxRays] Device 6 type: NATIVE_THREAD

[LuxRays] Device 6 compute units: 1

[LuxRays] Device 6 max allocable memory: 0MBytes

[LuxRays] Device 6 max allocable memory block size: 0MBytes

[LuxRays] Device 7 name: NativeThread-007

[LuxRays] Device 7 type: NATIVE_THREAD

[LuxRays] Device 7 compute units: 1

[LuxRays] Device 7 max allocable memory: 0MBytes

[LuxRays] Device 7 max allocable memory block size: 0MBytes

[LuxRays] Device 8 name: Intel® Core i7-2700K CPU @ 3.50GHz

[LuxRays] Device 8 type: OPENCL_CPU

[LuxRays] Device 8 compute units: 8

[LuxRays] Device 8 max allocable memory: 8192MBytes

[LuxRays] Device 8 max allocable memory block size: 2048MBytes

[LuxRays] Device 9 name: GeForce GTX 580

[LuxRays] Device 9 type: OPENCL_GPU

[LuxRays] Device 9 compute units: 64

[LuxRays] Device 9 max allocable memory: 1536MBytes

[LuxRays] Device 9 max allocable memory block size: 384MBytes

[LuxRays] Preprocessing DataSet

[LuxRays] Total vertex count: 223380

[LuxRays] Total triangle count: 260794

[LuxRays] Total vertices memory usage: 2617Kbytes

[LuxRays] Total triangles memory usage: 3056Kbytes

[LuxRays] Building QBVH, primitives: 260794, initial nodes: 21734

[LuxRays] QBVH completed with 43467/43468 nodes

[LuxRays] Total QBVH memory usage: 4754Kbytes

[LuxRays] Total QBVH QuadTriangle count: 87103

[LuxRays] Max. QBVH Depth: 25

[LuxRays] Creating 1 intersection device(s)

[LuxRays] Allocating intersection device 0: GeForce GTX 580 (Type = OPENCL_GPU)

[sLG] OpenCL Devices used:

[sLG] [GeForce GTX 580Intersect]

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] Ray buffer size: 3072Kbytes

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] Ray hits buffer size: 1024Kbytes

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] OpenCL image support is available

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] OpenCL max. image buffer size: 8192x8192

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH node image buffer size: 553x551

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH triangle image buffer size: 940x927

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] Enabled QBVH scene storage inside image

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH max. stack size: 24

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] QBVH Image Storage kernel work group size: 512

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] Suggested work group size: 512

[LuxRays] [OpenCL device::GeForce GTX 580Intersect] Forced work group size: 64

[sLG] Configuring 1 CPU render threads

[sLG] [PathOCLRenderThread] OpenCL task count: 65536

[sLG] [PathOCLRenderThread] OpenCL max. page memory size: 393216Kbytes

[sLG] [PathOCLRenderThread::CompiledScene] Compile Camera

[sLG] [PathOCLRenderThread::CompiledScene] Compile Geometry

[sLG] [PathOCLRenderThread::CompiledScene] Scene geometry compilation time: 6ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile Materials

[sLG] [PathOCLRenderThread::CompiledScene] Compile Textures

[sLG] [PathOCLRenderThread::CompiledScene] Textures compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Material compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile Triangle AreaLights

[sLG] [PathOCLRenderThread::CompiledScene] Triangle area lights compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile InfiniteLight

[sLG] [PathOCLRenderThread::CompiledScene] Infinitelight compilation time: 0ms

[sLG] [PathOCLRenderThread::CompiledScene] Compile SunLight

[sLG] [PathOCLRenderThread::CompiledScene] Compile SkyLight

[sLG] [PathOCLRenderThread::CompiledScene] Compile ImageMaps

[sLG] [PathOCLRenderThread::CompiledScene] Image maps page count: 0

[sLG] [PathOCLRenderThread::CompiledScene] Texture maps compilation time: 0ms

[sLG] Starting 1 PathOCL render threads

[sLG] [PathOCLRenderThread::0] FrameBuffer buffer size: 4835Kbytes

[sLG] [PathOCLRenderThread::0] Camera buffer size: 272bytes

[sLG] [PathOCLRenderThread::0] MeshIDs buffer size: 1018Kbytes

[sLG] [PathOCLRenderThread::0] Normals buffer size: 2617Kbytes

[sLG] [PathOCLRenderThread::0] UVs buffer size: 1745Kbytes

[sLG] [PathOCLRenderThread::0] Vertices buffer size: 2617Kbytes

[sLG] [PathOCLRenderThread::0] Triangles buffer size: 3056Kbytes

[sLG] [PathOCLRenderThread::0] Textures buffer size: 324bytes

[sLG] [PathOCLRenderThread::0] Materials buffer size: 160bytes

[sLG] [PathOCLRenderThread::0] Mesh material index buffer size: 28bytes

[sLG] [PathOCLRenderThread::0] Triangle AreaLights buffer size: 272bytes

[sLG] [PathOCLRenderThread::0] Triangle AreaLights index buffer size: 1018Kbytes

[sLG] [PathOCLRenderThread::0] Ray buffer size: 3072Kbytes

[sLG] [PathOCLRenderThread::0] RayHit buffer size: 1024Kbytes

[sLG] [PathOCLRenderThread::0] GPUTask buffer size: 23296Kbytes

[sLG] [PathOCLRenderThread::0] Size of a SampleData: 8bytes

[sLG] [PathOCLRenderThread::0] SampleData buffer size: 512Kbytes

[sLG] [PathOCLRenderThread::0] GPUTask Stats buffer size: 256Kbytes

[sLG] [PathOCLRenderThread::0] OpenCL buffer creation time: 0ms

[sLG] [PathOCLRenderThread::0] Defined symbols: -D LUXRAYS_OPENCL_KERNEL -D SLG_OPENCL_KERNEL -D PARAM_TASK_COUNT=65536 -D PARAM_IMAGE_WIDTH=640 -D PARAM_IMAGE_HEIGHT=480 -D PARAM_RAY_EPSILON_MIN=1.000000e-09f -D PARAM_RAY_EPSILON_MAX=1.000000e-01f -D PARAM_MAX_PATH_DEPTH=16 -D PARAM_RR_DEPTH=5 -D PARAM_RR_CAP=5.000000e-01f -D PARAM_ACCEL_QBVH -D PARAM_ENABLE_TEX_CONST_FLOAT -D PARAM_ENABLE_TEX_CONST_FLOAT3 -D PARAM_ENABLE_MAT_MATTE -D PARAM_ENABLE_MAT_GLASS -D PARAM_HAS_PASSTHROUGH -D PARAM_DIRECT_LIGHT_SAMPLING -D PARAM_DL_LIGHT_COUNT=4 -D PARAM_IMAGE_FILTER_TYPE=0 -D PARAM_SAMPLER_TYPE=0

[sLG] [PathOCLRenderThread::0] Compiling kernels

[sLG] [PathOCLRenderThread::0] Kernels not cached

[sLG] [PathOCLRenderThread::0] Compiling Init Kernel

[sLG] [PathOCLRenderThread::0] Compiling AdvancePaths Kernel

[sLG] [PathOCLRenderThread::0] Kernels compilation time: 220ms

[sLG] [PathOCLRenderThread::0] Rendering thread ERROR: clFlush(CL_INVALID_COMMAND_QUEUE)

[sLG] =========================================================

[sLG] Unhandled exception

[sLG] Obtained 5 stack frames.

[sLG] 0

[sLG] 1

[sLG] 2

[sLG] 3

[sLG] 4

Abort trap: 6

 

New version and tests. One test red ball and two with textures.

Test run on CPU or GPU and should look same - and of course should not crash ;)

 

http://www.jensverwi...full_check2.zip

 

Please read + open also my little changed instructions for testing.

Instructions_for_testing.txt

 

First start time of texture tests may take up to 30 sec compile time (of OpenCL part). Second runs takes 1 sec.

 

PS: Results for textured and checkerboard test looks on GPU (compared to CPU) more dark on Nvidia.

Thats GPU checkboard test (darker than running on CPU).

Bildschirmfoto 2013-02-17 um 20.34.53.jpg

 

Happy testing & hope not always crashing on Nvidia fermi.

You can also submit your M samples/sec for those texture / chckerbaord GPU test. My 9600 GT is slow with 0.13 M samles/sec. My C2D CPU is a bit faster.

×
×
  • Create New...