2
0
mirror of https://github.com/boostorg/compute.git synced 2026-02-17 13:42:17 +00:00
Commit Graph

1166 Commits

Author SHA1 Message Date
Jakub Szuppe
790dcab387 Fix benchmark for std::partial_sum()
In partial_sum benchmark for STL std::parial_sum() should not
be run in-place since benchmarks for other libs are not run
in-place.
2016-07-25 22:37:17 +02:00
Jakub Szuppe
32dc57bbf9 Add scan for multicore CPUs 2016-07-25 22:37:17 +02:00
Jakub Szuppe
cd5bb606ec Rename scan_on_cpu to serial_scan 2016-07-25 15:28:19 +02:00
Kyle Lutz
1d2f8cfbd9 Merge pull request #633 from haahh/pr_find_extrema_cpu
find_extrema for CPUs
2016-07-20 22:23:51 -07:00
Kyle Lutz
813a63c4ca Merge pull request #632 from haahh/pr_various_test_fixes
Various fixes
2016-07-20 22:16:03 -07:00
Jakub Szuppe
06124180dc Fix find_extrema_with_reduce()
It fixes wrong type of variable representing index.
2016-07-17 13:24:07 +02:00
Jakub Szuppe
934257b97d Add more tests for find_extrema algorithms 2016-07-17 13:24:07 +02:00
Jakub Szuppe
4c21633bf9 Fix vector copy ctor when using non-default context
Now correct queue (from correct context) is used in copy
constructor of boost::compute::vector.
2016-07-17 00:01:47 +02:00
Jakub Szuppe
2af23fa7e9 Change various tests to work with non-default context
Now all tests work even if Context::context, Context::device,
Context::queue are not default context, device and queue. This
is required for developing better tests in the future.

Note: Some tests may work only for default context/queue/device
since classes that they test work only for default context/q/d.
There are two solutions for this problem: either those tests run
on default queue (no matter what) or they does not run when
Context::context is not the default context. See test_string.cpp.
2016-07-16 20:30:28 +02:00
Jakub Szuppe
fb641de7cc Add operator= for vectors with different allocator
Now code: `vector<T> a(context); vector<T, OtherAlloc> b(context);
a = b;` works.
2016-07-16 20:14:14 +02:00
Jakub Szuppe
8ae5cc9117 Array should always use queue from its context
Until now boost::compute::array worked correctly only
for default context (global) since it was using default
queue (global) for all its operations. Now it uses queue
from its context.
2016-07-16 19:30:25 +02:00
Jakub Szuppe
b810d12aea Making sure serial_reduce_threshold is in the right range
Parameter serial_reduce_threshold for reducing on CPU
should be always greater or equal to the number of compute units
(cores) in the CPU.
2016-07-16 13:02:34 +02:00
Jakub Szuppe
04b6e8fba3 find_extrema() optimized for multicore CPUs 2016-07-16 13:02:34 +02:00
Kyle Lutz
d303097000 Merge pull request #626 from haahh/pr_transform_cpu
Transform for multicore CPUs
2016-07-11 20:16:53 -07:00
Kyle Lutz
6e3213bfcf Merge pull request #628 from haahh/pr_reduce_cpu
Reduce for CPU
2016-07-11 20:16:19 -07:00
Kyle Lutz
3bcc318609 Merge pull request #631 from thtrummer/fix-uint-typo
Fix build error due to possible typo (on macOS)
2016-07-11 20:01:43 -07:00
Kyle Lutz
b1d3e0af83 Merge pull request #629 from thtrummer/develop
Add missing include for std::cerr (on macOS)
2016-07-11 20:01:13 -07:00
Thomas Trummer
a800dd33a1 Fix build error due to possible typo (on macOS) 2016-07-10 19:54:45 +02:00
Thomas Trummer
68d373c38a Add missing include for std::cerr (on macOS) 2016-07-10 18:38:23 +02:00
Jakub Szuppe
a10e7d31f7 Disable CPU-optimized transform/copy_on_device() on Apple
Yet another bug on Apple OpenCL Platform.
2016-07-10 16:17:48 +02:00
Jakub Szuppe
4093481920 Remove unnecessary check 2016-07-10 13:56:39 +02:00
Jakub Szuppe
97f0225072 Fix ‘local_vals_arg’ may be uninitialized warning 2016-07-09 14:39:47 +02:00
Jakub Szuppe
585eb747a0 More tests for adjacent difference 2016-07-09 14:11:01 +02:00
Jakub Szuppe
9e2fc74826 Use serial_reduce for small inputs
For inputs smaller than <serial_reduce_threshold>
serial_reduce algorithm is used.
2016-07-09 14:06:06 +02:00
Jakub Szuppe
a085e6b3e1 Reduce for multicore CPUs 2016-07-09 14:06:06 +02:00
Jakub Szuppe
34c476c87a Tranform/copy on device optimized for CPUs 2016-07-06 19:23:30 +02:00
Kyle Lutz
a3f72e6191 Merge pull request #622 from haahh/pr_adjacent_diff_fix
Fix adjacent_difference when first == result
2016-07-05 16:52:40 -07:00
Jakub Szuppe
c6123c403c Remove unused function 2016-07-05 13:40:42 +02:00
Jakub Szuppe
92a16e459c Fix adjacent_difference when first == result
Situation where first == result was only covered
when binary function was not specified.
2016-07-01 21:25:46 +02:00
Kyle Lutz
e5a9017d69 Merge pull request #620 from haahh/pr_pocl_fix_llvm_apt
Temp. fix for llvm apt being down
2016-06-25 09:15:49 -07:00
Kyle Lutz
b3633eb54b Merge pull request #619 from haahh/pr_comp_sort
Comparison sort (for GPUs)
2016-06-23 20:24:38 -07:00
Jakub Szuppe
a721db2f72 Temp. fix for llvm apt being down
See https://github.com/travis-ci/travis-ci/issues/6120
2016-06-23 15:56:36 +02:00
Jakub Szuppe
5d31bd0f07 Add tests for stable comp. sort and sort-by-key 2016-06-22 17:06:12 +02:00
Jakub Szuppe
c47753c4fa Stable comparison sort, sort-by-key 2016-06-22 17:06:12 +02:00
Jakub Szuppe
7996382178 Use index instead of copying vals in bitonic sort-by-key
Instead of copying values in local memory in bitonic block
sort (when sorting by key), we save and copy local index
and copy value once at the end. This saves local memory.
2016-06-22 15:59:55 +02:00
Jakub Szuppe
1311de15a3 Add benchmark for comparison sort 2016-06-22 15:59:55 +02:00
Jakub Szuppe
cc1cc0e345 Add tests for comparison sort 2016-06-21 17:41:36 +02:00
Jakub Szuppe
8f033e72ee Add comparison sort and sort by key 2016-06-21 17:41:36 +02:00
Kyle Lutz
04e7dde220 Merge pull request #617 from haahh/pr_travis_ci_boost161
Add Travis CI build with latest Boost libs
2016-06-03 20:09:32 -07:00
Jakub Szuppe
c046a5f976 Remove unnecessary source from Travis CI build 2016-06-02 20:00:23 +02:00
Jakub Szuppe
e213006ba2 Cache built Boost libraries 2016-06-02 20:00:23 +02:00
Jakub Szuppe
18146f2fb7 Add Travis CI build with the latest Boost (1.61) 2016-06-02 20:00:05 +02:00
Jakub Szuppe
f669581f02 Travis CI: move downloading cmake to the install step 2016-06-01 15:49:46 +02:00
Kyle Lutz
4c31d07130 Merge pull request #616 from haahh/pr_typesafe_copy
Type-safe copying
2016-05-31 21:26:41 -07:00
Jakub Szuppe
8f4db3d7b7 Reuse dispatch_copy_async() in dispatch_copy() 2016-05-28 22:54:51 +02:00
Jakub Szuppe
bffc9f74fb Fix async copying when first == last (input is empty) 2016-05-28 22:46:35 +02:00
Jakub Szuppe
c9a0aba172 Remove debug macro, fix names of event variables 2016-05-28 22:05:50 +02:00
Jakub Szuppe
69e09f27d8 Minor fixes in test_copy.cpp 2016-05-28 13:15:36 +02:00
Jakub Szuppe
ba0da3f3f4 Tests for copying SVM memory when types mismatch
Tests for copying SVM memory to/from/on device when
value_types of InputIterator and OutputIterator mismatch.
2016-05-28 13:15:36 +02:00
Jakub Szuppe
5c29b50823 Coping SVM to/from host by mapping SVM memory 2016-05-28 13:15:36 +02:00