compute

Author	SHA1	Message	Date
Thomas Trummer	68d373c38a	Add missing include for std::cerr (on macOS)	2016-07-10 18:38:23 +02:00
Jakub Szuppe	e199304780	Turn off inplace_reduce and radix_sort tests on Apple Tests for inplace_reduce, radix_sort and radix_sort_by_key are disabled for CPU devices on Apple platform because for CPU devices on Apple platform when local memory is used in a kernel, local work group size must be [1;1;1]. Those algorithms are designed for GPU devices, therefore we do no lose any functionality.	2016-05-05 14:23:28 +02:00
Jakub Szuppe	f29bbda7f8	Tests for radix sort in desc order	2016-04-23 18:24:24 +02:00
Kyle Lutz	fda67a22d0	Update GitHub links	2015-05-17 20:32:09 -07:00
Kyle Lutz	5c1aac441c	Fix bug when calling radix_sort() with partial ranges	2014-11-09 10:53:23 -08:00
Kyle Lutz	ccd6f21d98	Change vector constructors to take queue argument This changes the vector<T> constructors which copy or initialize data to take a queue argument used for performing the operations. Previously they just took a context argument used to initialize the buffer and then created a new command queue to use. This improves performance by not requiring a new command queue and also fixes issues when performing operations on a different command queue while the vector was still being initialized.	2014-01-27 23:39:19 -08:00
Kyle Lutz	d16309f57e	Add program_cache This adds a program cache which can be used by algorithms and other functions to store programs which may be re-used. This improves performance by reducing the need for costly recompilation of commonly used programs. Program caches are context specific and multiple copies of the same context will use the same program cache. They are created and accessed by the global get_program_cache() function. For now, only a few algorithms and functions (radix sort, mersenne twister, fixed size sorts) make use of the program cache.	2013-09-07 22:58:34 -04:00
Kyle Lutz	1b9e904cc7	Add CHECK_RANGE_EQUAL() test macro This adds a new macro for the unit-tests which checks a range of values on the device against an array of values on the host. This simplifies writing tests and removes the need to explicitly copy values back to the host for verification.	2013-05-13 23:06:40 -04:00
Denis Demidov	5d77bbebee	Global setup for OpenCL context in tests refs kylelutz/compute#9 device, context, and queue are initialized statically in `context_setup.hpp`. With this change all tests are able to complete when an NVIDIA GPU is in exclusive compute mode. Side effect of the change: Time for all tests to complete reduced from 15.71 to 13.03 sec Tesla C2075.	2013-04-19 14:53:59 +04:00
Kyle Lutz	30e5f6a836	Fix test module name for TestRadixSort This fixes the module name for the radix sort test.	2013-03-10 20:20:32 -04:00
Kyle Lutz	d34cdaac59	Initial commit	2013-03-02 15:14:17 -05:00

11 Commits