First of all thank you so much for letting me work on your machine to try and track down the issue. It appears that the compiled OpenCL kernel cannot be cached to disk. So the gap you are seeing is caused by the OpenCL kernel being compiled each time. I think I have a possible fix for this. I will contact you again when I have rewritten one of the functions so we can test it that works.
↧