mirror of https://github.com/boostorg/compute.git synced 2026-01-29 07:22:19 +00:00

Go to file

Denis Demidov 0c8a59ef84 Fix core.system test

The find_device check in core.system is invalid. It could fail when same
device is supported by several platforms. In my case this happens for
Intel CPU when both AMD and Intel platforms are installed. The CPU
returned by boost::compute::system::default_device() is served by the
AMD platform, and the CPU returned by
boost::compute::system::find_device(name) is served by Intel SDK. The
only thing that could be safely asserted here is that both devices have
the same name.

2013-12-27 11:26:25 +04:00

cmake

Using FindOpenCL module from VexCL

2013-04-23 20:03:38 -04:00

doc

Add field() function

2013-11-10 15:44:45 -08:00

example

Use bitwise-and to check device::type()

2013-09-07 14:16:20 -04:00

include/boost

Fix issues with gather() algorithm

2013-12-21 15:34:29 -08:00

perf

Add perf_cart_to_polar benchmark

2013-11-23 12:26:10 -08:00

test

Fix core.system test

2013-12-27 11:26:25 +04:00

.gitignore

Add Doxygen documentation

2013-07-16 21:48:16 -04:00

.travis.yml

Travis-CI integration

2013-06-21 15:49:32 +04:00

CMakeLists.txt

Disable MSVC secure warnings

2013-12-06 23:10:29 -08:00

Jamroot

Initial commit

2013-03-02 15:14:17 -05:00

LICENSE_1_0.txt

Initial commit

2013-03-02 15:14:17 -05:00

README.md

Add Help Wanted section to README

2013-11-23 12:06:32 -08:00

README.md

Boost.Compute

Boost.Compute is a GPU/parallel-computing library for C++ based on OpenCL.

The core library is a thin C++ wrapper over the OpenCL C API and provides access to compute devices, contexts, command queues and memory buffers.

On top of the core library is a generic, STL-like interface providing common algorithms (e.g. transform(), accumulate(), sort()) along with common containers (e.g. vector<T>, flat_set<T>). It also features a number of extensions including parallel-computing algorithms (e.g. exclusive_scan(), scatter(), reduce()) and a number of fancy iterators (e.g. transform_iterator<>, permutation_iterator<>, zip_iterator<>).

The full documentation is available at http://kylelutz.github.io/compute/.

Example

The following example shows how to sort a vector of floats on the GPU:

#include <vector>
#include <algorithm>
#include <boost/compute.hpp>

namespace compute = boost::compute;

int main()
{
    // get the default compute device
    compute::device gpu = compute::system::default_device();

    // create a compute context and command queue
    compute::context ctx(gpu);
    compute::command_queue queue(ctx, gpu);

    // generate random numbers on the host
    std::vector<float> host_vector(1000000);
    std::generate(host_vector.begin(), host_vector.end(), rand);

    // create vector on the device
    compute::vector<float> device_vector(1000000, ctx);

    // copy data to the device
    compute::copy(
        host_vector.begin(),
        host_vector.end(),
        device_vector.begin(),
        queue
    );

    // sort data on the device
    compute::sort(
        device_vector.begin(),
        device_vector.end(),
        queue
    );

    // copy data back to the host
    compute::copy(
        device_vector.begin(),
        device_vector.end(),
        host_vector.begin(),
        queue
    );

    return 0;
}

Boost.Compute is a header-only library, so no linking is required. The example above can be compiled with:

g++ -I/path/to/compute/include sort.cpp -lOpenCL

More examples can be found in the tutorial.

Support

Bugs and feature requests can be reported through the issue tracker.

Also feel free to send me an email with any problems, questions, or feedback.

Help Wanted

The Boost.Compute project is currently looking for additional developers with interest in parallel computing.

Please send an email to Kyle Lutz (kyle.r.lutz@gmail.com) for more information.