compute

mirror of https://github.com/boostorg/compute.git synced 2026-01-31 20:12:23 +00:00

Author	SHA1	Message	Date
Kyle Lutz	701bc8a5f3	Add nth_element() algorithm This adds an implementation of the nth_element() algorithm. For now the algorithm is trivially implemented by calling sort().	2013-11-15 20:51:13 -08:00
Kyle Lutz	0daa62e41f	Add experimental copy_index_if() algorithm This adds an experimental algorithm like copy_if() which copies the index of the values for which predicate returns true instead of the values themselves.	2013-11-15 20:30:30 -08:00
Kyle Lutz	adde232fc8	Add context error handler This adds an error handler function which is invoked when an OpenCL context encounters an error condition. The context error is converted to a C++ exception containing the error information and thrown.	2013-11-15 20:26:01 -08:00
Kyle Lutz	953ebb4e26	Add variadic tuple support This adds support for variadic tuples on C++11 compilers.	2013-11-15 20:07:39 -08:00
Kyle Lutz	b5ff4743bb	Add field() function This adds a new function which will return the named field from a value. For example, this can be used to return one of the components of a pair object or to swizzle a vector value.	2013-11-10 15:44:45 -08:00
Kyle Lutz	8213697307	Add BOOST_COMPUTE_FUNCTION() macro This adds a new macro to ease the definition of custom user functions. The BOOST_COMPUTE_FUNCTION() macro creates a new boost::compute::function<> object with the provided return type, argument types, function name and OpenCL source code.	2013-11-10 15:32:15 -08:00
Kyle Lutz	8608e60116	Refactor invoked_function<> This refactors the invoked_function<> classes. Previously each function arity (e.g. unary, binary) had a separate invoked_function<> template class. Now they all use the same class which simplifies the logic in function<> and meta_kernel.	2013-11-10 15:31:56 -08:00
Kyle Lutz	43678410be	Fix bugs with type definitions in meta_kernel This fixes a bug in which type definitions were being inserted into meta_kernel's multiple times. Also forces zip_iterator to insert its type definitions when used in a kernel.	2013-11-10 15:13:46 -08:00
Kyle Lutz	a0b635e201	Add type_name<void>() specialization This adds a type_name<>() specialization for void types.	2013-11-10 14:35:04 -08:00
Kyle Lutz	85812f4e93	Add BOOST_COMPUTE_TYPE_NAME() macro This adds a macro for registering custom type names for C++ types to be used in OpenCL kernel code. Internally the macro specializes the type_name<T>() function.	2013-10-02 21:40:22 -04:00
Kyle Lutz	a2b7595f36	Make type_name<T>() inline This adds the inline specifier to the type_name<T>() function.	2013-10-02 21:23:09 -04:00
Kyle Lutz	feb510a019	Add unpack() function adaptor This adds a new unpack() function adaptor which converts a function with N arguments to a function which takes a single tuple argument with N components. This is useful for calling built-in functions with the tuples values returned from zip_iterator. This also removes the now un-needed binary_transform_iterator.	2013-09-24 23:05:08 -04:00
Kyle Lutz	736f3a17a6	Add min_and_max reduce() test This adds a test for computing the minimum and maximum values of a vector simultaneously using reduce() with a custom reduction function. Also fixes a bug in reduce() in which inplace_reduce() was being used even if the input type and result type differed.	2013-09-24 22:47:16 -04:00
Kyle Lutz	a1155bc343	Store source strings for binary and ternary functions This fixes an issue in which the source strings for binary and ternary functions were not being stored and thus not being inserted into kernels when they were invoked.	2013-09-24 22:42:50 -04:00
Kyle Lutz	dc6b3228eb	Add as() and convert() type-conversion functions This adds the as() and convert() functions for converting between OpenCL types.	2013-09-24 22:27:50 -04:00
Kyle Lutz	3412d0935d	Add not1() and not2() function adaptors This adds the not1() and not2() function adaptors which negate unary and binary functions respectively.	2013-09-24 22:22:52 -04:00
Kyle Lutz	07e4a6b3aa	Remove BLAS functions This removes the incomplete BLAS API functions.	2013-09-24 22:19:56 -04:00
Kyle Lutz	d16309f57e	Add program_cache This adds a program cache which can be used by algorithms and other functions to store programs which may be re-used. This improves performance by reducing the need for costly recompilation of commonly used programs. Program caches are context specific and multiple copies of the same context will use the same program cache. They are created and accessed by the global get_program_cache() function. For now, only a few algorithms and functions (radix sort, mersenne twister, fixed size sorts) make use of the program cache.	2013-09-07 22:58:34 -04:00
Kyle Lutz	d04e628367	Add experimental sort_by_transform() algorithm This adds a sort_by_transform() algorithm which sorts a sets of values based on the value of a transform function. For example, this can be used to sort a set of vectors by their length (when used with the length<T>() function) or by a single component (when used with the get<N>() function).	2013-09-07 17:10:15 -04:00
Kyle Lutz	3389a5c741	Add sort_by_key() algorithm This adds a new sort_by_key() algorithm which sorts a range of values by a range of keys with a comparison operator. For now this is only implemented by the serial insertion sort algorithm. In the future it will be ported to the other sorting algorithms (e.g. radix sort).	2013-09-07 17:02:08 -04:00
Kyle Lutz	f9d887e30d	Add experimental tabulate() algorithm This adds a tabulate() algorithm which fills a range with values calculated from a function given each elements index.	2013-09-07 16:53:08 -04:00
Kyle Lutz	a96c9c0182	Add result argument to reduce() algorithm This adds an output iterator result argument to the reduce() algorithm. Now, instead of returning the reduced result, the result is written to an output iterator. This allows the value to stay on the device and avoids a device-to-host copy in cases where the result is not needed on the host (e.g. it is part of a larger computation). This is an API breaking change to users of reduce(). Affected code should now declare a result variable and then pass a pointer to it as the new result argument.	2013-09-07 15:36:49 -04:00
Kyle Lutz	a8f4421739	Add copy() specialization for host-to-host transfers This adds a copy() specialization for host-to-host transfers which simply forwards the call to std::copy(). This is useful in templated algorithms which may in certain circumstances copy() between data ranges on the host.	2013-09-07 15:29:48 -04:00
Kyle Lutz	78a561eff1	Add scan_on_cpu() algorithm This adds a new scan_on_cpu() algorithm which implements the scan() algorithm for CPU devices. Also renames the existing scan() algorithm to scan_on_gpu(). This fixes some tests failures on POCL which were caused by the prior GPU scan() algorithm not functioning properly with POCL.	2013-09-07 15:03:42 -04:00
Kyle Lutz	518d39fc2b	Use bitwise-and to check device::type() This changes the checks for the device type to use the bitwise-and operator instead of the equaility operator. The returned type is a bitset and this would cause errors when multiple bits were set. This fixes a bug on POCL which returns the device type as a combination of CL_DEVICE_TYPE_DEFAULT and CL_DEVICE_TYPE_CPU. Now the correct device type (device::cpu) is detected for POCL.	2013-09-07 14:16:20 -04:00
Kyle Lutz	3a7b90ff06	Fix issue with comparison operators in lambda expressions This fixes an issue in which comparison operators (e.g. <, ==) in lambda expressions would return the wrong result type causing compilation errors. Also adds a few test cases to ensure the correct result type and that lambda expressions can be properly used with count_if().	2013-08-15 22:10:03 -04:00
Kyle Lutz	bacec5b8fe	Add uniform_real_distribution This adds a random number distribution which generates random numbers in a uniform distribution. Also adds a convenience algorithm which fills a range with uniformly distributed random numbers between two values.	2013-08-13 20:40:42 -04:00
Kyle Lutz	767589fe0d	Rearrange type headers This rearranges the type headers to live under the <boost/compute/types/...> directory instead of the top-level <boost/compute/...> directory.	2013-08-13 20:37:56 -04:00
Kyle Lutz	b539e8413c	Add Doxygen documentation This replaces the BoostBook/XML based reference documentation with Doxygen auto-generated documentation.	2013-07-16 21:48:16 -04:00
Kyle Lutz	b3d2fbb7eb	Add fill_async() algorithm This adds a fill_async() which fills a range with a given value asynchronously.	2013-07-02 21:57:19 -04:00
Kyle Lutz	5203506c16	Add support for on-device copy_async() This adds support for copy_async() when copying between memory objects on a compute device.	2013-07-02 21:57:19 -04:00
Kyle Lutz	8459fdeb0e	Change meta_kernel::exec*() methods to return events This changes the exec() and exec_1d() methods in the meta_kernel class to return event objects.	2013-07-02 21:57:19 -04:00
Kyle Lutz	d8f5a5b503	Change enqueue_*_buffer() methods to return events This changes the enqueue_copy_buffer() and enqueue_fill_buffer() methods in the command_queue class to return event objects.	2013-07-02 21:57:19 -04:00
Kyle Lutz	c1bf707b41	Add event::get_command_type() method This adds a get_command_type() method to the event class which returns the OpenCL type for an event object.	2013-07-02 21:57:19 -04:00
Kyle Lutz	ee5f581094	Add command_queue::enqueue_migrate_memory_objects() method This adds an enqueue_migrate_memory_objects() method to the command_queue class which allows memory objects to be migrated between compute devices and to the host.	2013-07-02 21:57:19 -04:00
Kyle Lutz	2ca028c37b	Improve reduce() performance This makes a few tweaks to the reduce() algorithm in order to improve performance. An unnecessary barrier() has been removed and now multiple values are reduced on the initial read.	2013-07-02 21:57:15 -04:00
Denis Demidov	84394de119	Get rid of type convesion warnings inside VS2010	2013-06-24 09:57:22 +02:00
Denis Demidov	b28d8697bc	Silence MSVC security warning C4996 in system.hpp	2013-06-24 09:55:40 +02:00
Denis Demidov	f5c86057a1	Get rid of clang v3.3 warning -Wconstexpr-not-const	2013-06-21 15:27:00 +04:00
Kyle Lutz	f2b812019c	Fix bugs with char/uchar/bool literals in meta_kernel This fixes a few issues that occurred when using char, uchar and bool literals with meta_kernel.	2013-06-19 23:55:22 -04:00
Kyle Lutz	e01569049b	Add type_name<bool>() specialization This adds a type_name() specialization for bool.	2013-06-19 23:48:49 -04:00
Kyle Lutz	0d285d8a30	Change meta_kernel::add_arg(name, value) to add_set_arg() This changes the meta_kernel::add_arg() overload with a name and a value to a separate method. This fixes conflict when using add_arg() with string values.	2013-06-11 21:19:47 -04:00
Kyle Lutz	7fb77ef9c5	Add test for any/all/none_if() with NaN and inf This adds a test for the any_of(), all_of() and none_of() functions with NaN and Inf values.	2013-06-11 21:16:15 -04:00
Kyle Lutz	8e51a0a162	Refactor lambda expression framework to use meta_kernel This refactors the lambda expression framework to use meta_kernel to construct kernel source code instead of using plain strings.	2013-06-11 21:14:28 -04:00
Kyle Lutz	64e94549b3	Add specialization for get<N>() with zip_iterator This adds a specialization for the get<N>() function when used with zip_iterator's. Now, only the N'th iterator for the expression will be dereferenced instead of dereferencing all of the iterators into a tuple and then extracting the N'th component.	2013-06-11 20:37:23 -04:00
Kyle Lutz	15bc98b94f	Remove cv-qualifiers from get<N>()'s value-type This removes the cv-qualifiers for the value-type returned from get<N>() expressions. This fixes issues when specializing based on the type (e.g. pair, tuple).	2013-06-11 20:29:06 -04:00
Kyle Lutz	98b593b937	Fix meta_kernel streaming operators with float This fixes a bug in the meta_kernel streaming operators with float values. Now, float scalar and vector literals are inserted into the kernel source with the proper 'f' suffix.	2013-06-11 20:23:47 -04:00
Kyle Lutz	36dd3f1306	Improve the system::find_default_device() method This makes some improvements to the system::find_default_device() method. Now, the devices on the system will only be queried once when searching for the default device. This reduces the number of calls to clGetPlatformIDs() and clGetDeviceIDs(). Also, in the case that no GPU or CPU devices are found, the first device on the system will be selected as the default device. This fixes issues when using Boost.Compute with pocl.	2013-05-24 20:07:38 -04:00
Kyle Lutz	aa7fd2f6fa	Add asserts for clRelease() functions in destructors This adds assert()'s verifying that the clRelease() functions in the destructors for the OpenCL wrapper classes return CL_SUCCESS.	2013-05-23 23:15:43 -04:00
Kyle Lutz	b5068b2027	Fix minor version macro This fixes the minor version macro.	2013-05-23 22:46:52 -04:00

1 2 3

128 Commits