mirror of https://github.com/boostorg/compute.git synced 2026-01-27 18:52:15 +00:00

Go to file

Kyle Lutz a2bda0610d Fix memory issues with device_ptr and allocator

This fixes a few memory handling issues between device_ptr,
buffer_iterator, buffer_value, allocator, and malloc/free.

Previously, memory buffers that were allocated by allocator and
malloc were being retained (via clRetainMemObject() in buffer's
constructor) by device_ptr, buffer_iterator and buffer_value.

Now, false is passed for the retain parameter to buffer's
constructor so that the buffer's reference count is not
incremented. Furthermore, the classes now set the buffer to
null before being destructed so that they will not decrement its
reference count (which normally occurs buffer's destructor).

The main effect of this change is that objects which refer to a
memory buffer but do not own it (e.g. device_ptr, buffer_iterator)
will not modify the reference count for the buffer. This fixes a
number of memory leaks which occured in longer running programs.

2013-05-13 22:27:02 -04:00

cmake

Using FindOpenCL module from VexCL

2013-04-23 20:03:38 -04:00

doc

Rename kernel::num_args() method to arity()

2013-05-11 20:15:00 -04:00

example

Fix indexing bug in list_devices example

2013-05-11 20:11:58 -04:00

include/boost

Fix memory issues with device_ptr and allocator

2013-05-13 22:27:02 -04:00

perf

Using FindOpenCL module from VexCL

2013-04-23 20:03:38 -04:00

test

Remove simple_histogram from flat_map test

2013-05-11 20:17:24 -04:00

.gitignore

Initial commit

2013-03-02 15:14:17 -05:00

CMakeLists.txt

Using FindOpenCL module from VexCL

2013-04-23 20:03:38 -04:00

Jamroot

Initial commit

2013-03-02 15:14:17 -05:00

LICENSE_1_0.txt

Initial commit

2013-03-02 15:14:17 -05:00

README.md

Update README file

2013-04-20 22:29:47 -04:00

README.md

Boost.Compute

Boost.Compute is a GPU/parallel-computing library for C++ based on OpenCL.

The core library is a thin C++ wrapper over the OpenCL C API and provides access to compute devices, contexts, command queues and memory buffers.

On top of the core library is a generic, STL-like interface providing common algorithms (e.g. transform(), accumulate(), sort()) along with common containers (e.g. vector<T>, flat_set<T>). It also features a number of extensions including parallel-computing algorithms (e.g. exclusive_scan(), scatter(), reduce()) and a number of fancy iterators (e.g. transform_iterator<>, permutation_iterator<>, zip_iterator<>).

The full documentation is available at http://kylelutz.github.io/compute/.

Example

The following example shows how to sort a vector of floats on the GPU:

#include <vector>
#include <algorithm>
#include <boost/compute.hpp>

namespace compute = boost::compute;

int main()
{
    // get the default compute device
    compute::device gpu = compute::system::default_device();

    // create a compute context and command queue
    compute::context ctx(gpu);
    compute::command_queue queue(ctx, gpu);

    // generate random numbers on the host
    std::vector<float> host_vector(1000000);
    std::generate(host_vector.begin(), host_vector.end(), rand);

    // create vector on the device
    compute::vector<float> device_vector(1000000, ctx);

    // copy data to the device
    compute::copy(
        host_vector.begin(),
        host_vector.end(),
        device_vector.begin(),
        queue
    );

    // sort data on the device
    compute::sort(
        device_vector.begin(),
        device_vector.end(),
        queue
    );

    // copy data back to the host
    compute::copy(
        device_vector.begin(),
        device_vector.end(),
        host_vector.begin(),
        queue
    );

    return 0;
}

Boost.Compute is a header-only library, so no linking is required. The example above can be compiled with:

g++ -I/path/to/compute/include sort.cpp -lOpenCL

More examples can be found in the tutorial.

Support

Bugs and feature requests can be reported through the issue tracker.

Also feel free to send me an email with any problems, questions, or feedback.