compute

Author	SHA1	Message	Date
Kyle Lutz	1be19a6305	Add multiplies<T> specialization for std::complex<T> This adds a specialization of multiplies<T> for std::complex<T> which implements complex number multiplication. Also adds a simple test using transform() to verify the complex multiplication works correctly.	2013-04-10 22:04:04 -04:00
Kyle Lutz	8d13920dc4	Move swizzle_iterator to detail namespace This moves the swizzle_iterator class to the detail namespace.	2013-04-10 21:51:24 -04:00
Kyle Lutz	bcc3aed40f	Move pixel_input_iterator to detail namespace This moves the pixel_input_iterator class to the detail namespace.	2013-04-10 21:38:05 -04:00
Kyle Lutz	5cce555d8c	Move binary_transform_iterator to detail namespace This moves the binary_transform_iterator class to the detail namespace.	2013-04-10 21:33:29 -04:00
Kyle Lutz	e30ec9f26c	Move adjacent_transform_iterator to detail namespace This moves the adjacent_transform_iterator class to the detail namespace.	2013-04-10 21:24:15 -04:00
Kyle Lutz	6dd6e11c7d	Fix unused variable warning in get_base_iterator_buffer() This fixes an unused variable warning which occurs in the get_base_iterator_buffer() function when the base iterator is not a buffer iterator and thus the iter argument is not used.	2013-04-10 21:09:17 -04:00
Kyle Lutz	6fdffd8a2b	Replace usages of result_of() with tr1_result_of() This fixes a bug in which boost::result_of() would return the wrong result type for a function due to the new implementation using decltype instead of the result_of protocol on compilers that sufficently support C++11 (such as clang >= 3.2). Now, boost::tr1_result_of() is used to explicitly request that the result_of protocol be used even when decltype is supported by the compiler.	2013-04-10 20:17:34 -04:00
Kyle Lutz	652f99e449	Fix bug in get_buffer() for iterator adaptors This fixes a bug in which the get_buffer() method was not properly disabled for iterator adaptors with a non-buffer base iterator.	2013-04-09 21:56:24 -04:00
Kyle Lutz	5164ab4bd0	Cleanup constructors for wrapper classes This cleans up the constructor methods for the OpenCL wrapper classes and unifies the API used for creating a wrapper class object from the underlying OpenCL objects. Now, every wrapper class has a constructor taking the OpenCL object and an optional boolean retain parameter which indicates whether the constructor should increment the reference count.	2013-04-07 15:03:24 -04:00
Kyle Lutz	25a084deda	Fix indentation in kernel::get_arg_info() This fixes the indentation in the kernel::get_arg_info() method.	2013-04-07 12:57:26 -04:00
Kyle Lutz	48e1bb4da0	Update image2d/3d constuctors for OpenCL 1.2 This updates the constructors for the image2d and image3d classes to use the new clCreateImage() function instead of the deprecated clCreateImage2D/3D() functions.	2013-03-31 15:01:30 -04:00
Kyle Lutz	d56e58b48e	Add OpenCL 1.2 error codes to runtime_exception This adds support for the OpenCL 1.2 error codes to the runtime_exception class.	2013-03-31 14:58:00 -04:00
Kyle Lutz	0aa3d024dc	Fix command_queue::enqueue_marker() for OpenCL 1.2 This changes the enqueue_marker() method in the command_queue class to use clEnqueueMarkerWithWaitList() instead of the deprecated clEnqueueMarker() function when compiling with OpenCL 1.2.	2013-03-31 12:08:23 -04:00
Kyle Lutz	7e7e09b704	Fix command_queue::enqueue_barrier() for OpenCL 1.2 This changes the enqueue_barrier() method in the command_queue class to use clEnqueueBarrierWithWaitList() instead of the deprecated clEnqueueBarrier() function when compiling with OpenCL 1.2.	2013-03-31 12:03:38 -04:00
Kyle Lutz	52fef4de6b	Remove command_queue::enqueue_wait_for_event() method This remove the enqueue_wait_for_event() method from the command_queue class as the clEnqueueWaitForEvents() function has been deprecated in OpenCL 1.2.	2013-03-31 11:59:14 -04:00
Kyle Lutz	c7a3bc8af6	Move unload_compiler() method to platform This moves the unload_compiler() method from the system class to the platform class. Also changes the method to use the clUnloadPlatformCompiler() function instead of the deprecated clUnloadCompiler() when compiling with OpenCL 1.2.	2013-03-31 11:29:40 -04:00
Kyle Lutz	d28354184c	Move get_extension_function_address() method to platform This moves the get_extension_function_address() method from the system class to the platform class. Also changes the method to use the clGetExtensionFunctionAddressForPlatform() function instead of the deprecated clGetExtensionFunctionAddress() when compiling with OpenCL 1.2.	2013-03-31 11:26:18 -04:00
Kyle Lutz	00fcb737cc	Fix bug in move-constuctor for vector<T> This fixes a bug in the move-constuctor for the vector<T> class. Previously, the moved-from object was also deallocating the memory buffer leading to an error when the moved-to object attempted to use it. Now, the constructor checks if the buffer is non-empty before deallocating it.	2013-03-30 19:55:51 -04:00
Kyle Lutz	1161f89031	Make get_object_info() inline This marks the get_object_info() method as inline.	2013-03-30 19:38:40 -04:00
Kyle Lutz	d585fbebad	Make stream operator for vector types inline This marks the stream operator for vector types as inline.	2013-03-30 19:35:56 -04:00
Kyle Lutz	da1d7794b5	Remove support for cl_half This removes support for cl_half (typedef'd to half_). The issue is that the cl_half type is indistinguishable from the cl_ushort type (both are typedefs for uint16_t) which caused the cl_khr_fp16 pragma to be injected into kernels using cl_ushort which causes errors on platforms that do not support the cl_khr_fp16 extension.	2013-03-27 00:09:51 -04:00
Kyle Lutz	4338b311f7	Add device::partition() method This adds a new set of methods to the device class allowing device objects to be partitioned into multiple sub-devices using the clCreateSubDevices() function. For now, device partitioning is only supported on systems with OpenCL version 1.2 (or later).	2013-03-26 23:36:54 -04:00
Kyle Lutz	4752fb2404	Support returning std::vector<T> from get_info<T>() This adds support for returning a std::vector<T> from the various get_info<T>() methods. This provides a simpler interface to get the values in an array returned from one of the clGet*Info() functions. This also adds a test using the new API to get the maximum work item sizes in each dimension for a device.	2013-03-26 22:44:56 -04:00
Denis Demidov	2df059738a	Option to not retain program on creation	2013-03-25 20:59:50 -04:00
Kyle Lutz	ab0575dd0c	Fix bug in remove_if() algorithm This fixes a bug in which the remove_if() function would overwrite parts of the input before they were properly copied to the output range. This is fixed by first copying the input values to a temporary vector and then passing that as the input range to copy_if().	2013-03-21 23:23:39 -04:00
Kyle Lutz	147baa05fe	Fix bug in count_if() and find_if() with Intel CPUs This fixes a bug in which the Intel OpenCL compiler would fail to compile the count_if() and find_if() kernels for vector types with the following error: error: no matching function for call to 'all' note: candidate function not viable: 1st argument ('__global int4') is in address space 16776960, but parameter must be in address space 0 This is caused when the predicate compares a value from the input buffer (in the global memory space) to a literal value (in the private memory space). This is fixed by first reading the value into a local variable in the private memory space and then calling the predicate function.	2013-03-21 22:46:01 -04:00
Kyle Lutz	25b71579a5	Fix bug when calling fill() with vector types This fixes a bug in which the fill() algorithm was called by scan_impl() with an integer zero rather than zero of the value type which caused issues when using scan() with vector values.	2013-03-20 18:10:48 -04:00
Denis Demidov	6766b07fad	Allow buffer and queue initialization from lowlevel types	2013-03-19 17:18:43 +04:00
Kyle Lutz	2d81f561c4	Add zip_iterator class This adds a zip_iterator class which allows for one or more iterators to be combined into a single iterator object.	2013-03-17 23:42:56 -04:00
Kyle Lutz	ada2351812	Add support for boost::tuple<> This adds support for using boost::tuple<> types with the Boost.Compute containers and algorithms.	2013-03-17 23:28:07 -04:00
Kyle Lutz	ad9309740e	Add meta_kernel::inject_type<Type>() method This adds a new method which allows for type definitions and type pragmas to be added to a meta_kernel. This provides a more generic and general interface and replaces the previously used add_pair_type() method along with the special case handling of half and double types.	2013-03-17 23:20:19 -04:00
Kyle Lutz	2f81872403	Pass binary_status argument to clCreateProgramWithBinary() This fixes a bug in which certain platforms would return CL_INVALID_VALUE from clCreateProgramWithBinary() if the binary_status argument was not provided.	2013-03-17 22:20:32 -04:00
Kyle Lutz	233df55978	Fix bug in program::binary() This fixes a bug in the program::binary() method in which the return size was not being passed to clGetProgramInfo().	2013-03-17 22:17:28 -04:00
Kyle Lutz	32a1926f6b	Fix local array size in serial_insertion_sort() This fixes the local array allocation size for the serial_insertion_sort() function.	2013-03-14 21:27:00 -04:00
Kyle Lutz	77e75bd4cc	Remove unused variable in serial_insertion_sort() This removes the unused 'op' variable from the serial_insertion_sort() function.	2013-03-14 21:22:40 -04:00
Kyle Lutz	ff204e1b61	Add asserts for null host pointers to command_queue This adds assert()'s to the read and write methods in the command_queue class to check for null host_ptr's.	2013-03-12 22:36:36 -04:00
Kyle Lutz	35984ae412	Remove default type_name_trait::value() implementation This removes the default type_name_trait::value() function implementation. Previously, the default implementation would return a null pointer leading to run-time errors if a type name was not provided. Now, a compile-time error will occur if type_name() is called for an unknown type.	2013-03-12 20:29:44 -04:00
Kyle Lutz	71df0d5fa6	Add type_name() specialization for char This adds a type_name() specialization for char which is different than the built-in cl_char type and thus was not covered before.	2013-03-12 20:23:33 -04:00
Kyle Lutz	418468cc4b	Add lambda wrapper for length() function This adds a lambda wrapper for the length() function which allows it to be used in lambda expressions.	2013-03-10 20:16:14 -04:00
Kyle Lutz	69aef15cab	Add merge() algorithm This implements the merge() algorithm which merges two ranges of sorted values into a single sorted range. The current implementation uses a simple serial merge algorithm. A GPU optimized version is coming soon.	2013-03-10 20:10:58 -04:00
Kyle Lutz	d34cdaac59	Initial commit	2013-03-02 15:14:17 -05:00

... 12 13 14 15 16

791 Commits