2
0
mirror of https://github.com/boostorg/atomic.git synced 2026-02-02 20:32:09 +00:00
Commit Graph

142 Commits

Author SHA1 Message Date
Andrey Semashev
b7dba02b73 In addition to compiler checks for may_alias support, use __has_attribute as well. 2017-05-25 12:54:34 +03:00
Andrey Semashev
7c6917948d Use BOOST_INTEL_CXX_VERSION instead of __INTEL_COMPILER to use workarounds for compiler bugs present in Boost.Config. 2017-05-21 21:44:20 +03:00
Andrey Semashev
11967cc8e2 Mark storage type capable of aliasing other types and simplify CAS when possible
This is an attempt to improve generated code in the calling application that
involves CAS in a tight loop. The neccessity to cast between the value type and
the storage type for the `expected` argument results in inefficient code
that involves copying of the expected value and also saving the CAS result on
the stack. This has been observed at least with gcc 6.3 with a tight loop
on the user's side.

When we can ensure that the storage type can safely alias other types, and the
value type has the same size as the storage type, we can simplify CAS by
performing type punning on the `expected` reference instead of copying it back
and forth.
2017-05-21 17:50:16 +03:00
Andrey Semashev
cf3c4a2d5a Fixed a typo. 2017-04-04 15:54:53 +03:00
Andrey Semashev
f1dc715e41 Added __ARM_ARCH_8A__ to the preprocessor check for ARM. 2017-04-04 15:53:18 +03:00
Andrey Semashev
6d40529a58 Switch back to Boost.TypeTraits for integral traits because libstdc++ doesn't consider __int128 an integral type. 2017-04-03 15:11:30 +03:00
Andrey Semashev
11c785768c Updated to reflect changes from P0558R1 accepted into C++17.
1. Expose value_type and difference_type (where present) to user's code.

2. Prohibit arithmetic operations on pointers to non-object types. In
   particular, arithmetic operations such as fetch_add/fetch_sub will no longer
   compile for pointers to cv void, pointers to functions and pointers to
   non-static class members.

Also, use C++11 <type_traits> when possible instead of Boost.TypeTraits to
reduce dependencies. Cleaned up value_arg_type internal type usage for more
efficient argument passing.
2017-04-01 18:29:26 +03:00
Andrey Semashev
a67cc1b055 Corrected register usage in x86 DCAS asm blocks.
In some of the asm blocks eax was modified as a result of cmpxchg8b but that
was not reflected in the register constraints. This could cause incorrect code
being generated.
2017-01-08 18:09:12 +03:00
Andrey Semashev
2b6ef06b57 Added preliminary support for MSVC for ARM64 target. 2017-01-07 21:27:09 +03:00
Andrey Semashev
e6e6a04114 Changed taking address of the values involved in bitwise_cast to avoid gcc warnings. 2017-01-06 16:35:57 +03:00
Andrey Semashev
07be21c7e8 Silenced some MSVC warnings. 2016-11-06 21:29:33 +03:00
Andrey Semashev
836381dcd1 Added workarounds for Oracle Studio 12.5 on x86. 2016-10-09 19:15:42 +03:00
Andrey Semashev
336a349b9f Added is_always_lock_free static constant from C++17. 2016-10-09 17:53:55 +03:00
Andrey Semashev
e450653451 Extracted x86 mfence detection to caps headers.
Also, for non-gcc compilers which do not allow to auto-detect mfence availability (e.g. Oracle Studio) the instruction is assumed to be supported (since SSE2 is supported by virtually every x86 CPU now). This can be changed by defining BOOST_ATOMIC_NO_MFENCE.
2016-09-15 01:05:51 +03:00
Andrey Semashev
dce87d3c0a Changes for better compatibility with Oracle Studio 12.5.
1. Although the compiler is supposed to support __atomic and __sync intrinsics like gcc, it does not define any macros allowing to detect that. In particular, it is not possible to deduce what sizes of atomic operands are supported in hardware except to check the arch macros.
2. The compiler does not define any macros that allow to deduce the target x86 CPU, which makes it impossible to know whether the CPU supports cmpxchg8b/cmpxchg16b/mfence.

Because of that this commit changes handling of this compiler the following way:

1. On SPARC the compiler will use the gcc_sparc backend. Also, enabled the backend for the SPARCv8+ architecture as it appears to be almost a 32-bit equivalent to SPARCv9 and does support cas/casx instructions.
2. On x86 the compiler will use the gcc_x86 backend. By default cmpxchg8b/cmpxchg16b are assumed to be supported unless BOOST_ATOMIC_NO_CMPXCHG8B/BOOST_ATOMIC_NO_CMPXCHG16B is defined. The mfence instruction requires SSE2 and although ubiquitous these days, it will still be detected as not supported for now.
2016-09-15 00:46:26 +03:00
Andrey Semashev
4274a78f46 Corrected typos that prevented compilation. 2016-09-14 20:36:12 +03:00
Jeremy W. Murphy
0abd9bfd73 Fix some unused parameter warnings. 2015-12-15 20:56:58 +11:00
Andrey Semashev
430169a53c Removed union_cast in favor of memcpy-based implementation and renamed the cast to bitwise_cast. Use __builtin_mem* instead of regular function calls when possible. This saves inclusion of <cstring> and possibly generates better code when -fno-builtin is used. The net result of these changes is no UB that union_cast was introducing while hopefully retaining the same performance. 2015-12-02 20:04:10 +03:00
Andrey Semashev
c989d936c5 Renamed casts.hpp to bitwise_cast.hpp. 2015-12-02 18:36:49 +03:00
Andrey Semashev
8bbf5e19db Corrected compiler check for BOOST_ATOMIC_DETAIL_NO_ASM_IMPLIED_ZERO_DISPLACEMENTS. 2015-11-08 17:24:30 +03:00
Andrey Semashev
6e2d7b28de Restored old versions of DCAS operations that use an extra register for compatibility with old binutils, which apparently are used on OS X with gcc 4.2. 2015-11-04 15:25:12 +03:00
Andrey Semashev
ec4738a8eb Workaround for MSVC-12 ICE. 2015-08-26 23:27:13 +03:00
Andrey Semashev
e3aac4bc65 Simplified memory references in assembler blocks which should reduce the typical number of used registers by one. 2015-08-23 03:27:37 +03:00
Andrey Semashev
4462c1cdf4 Fixed compilation with gcc 4.4. 2015-08-23 02:05:35 +03:00
Andrey Semashev
18fdcb1747 Fixed compilation errors with MSVC caused by using types with large alignment values as function arguments. Alignment is now explicitly specified for all storage types to avoid surprises. Optimized CAS-based exchange implementation on x86. Relaxed initial loads in CAS-based operations as this load is not required to be accurate. 2015-08-23 01:04:08 +03:00
Tim Blechmann
f0f1ef228c atomic: fix compilation on msvc 8 / Windows CE / arm 2015-08-22 10:17:02 +02:00
Andrey Semashev
ff91811c1a Fixes #10994. Fixed compilation with gcc 4.4 when cmpxchg16b is used. 2015-08-17 23:27:37 +03:00
Andrey Semashev
a8661d4cc2 Corrected memory barriers in store, load and thread_fence. Added indentation to the generated assembler code. 2015-08-15 00:29:43 +03:00
Andrey Semashev
e6a188d165 Experimental support for IBM XL C++ compilers for PowerPC. 2015-08-13 23:37:33 +03:00
Andrey Semashev
fcff86225e Added workarounds for nvcc - don't use cc clobber and constraint alternatives in asm statements. Hopefully, should fix #10625. 2014-12-20 16:35:03 +03:00
Andrey Semashev
fe8f1dc4e0 Use #pragma intrinsic only with MSVC. 2014-10-01 13:30:37 +04:00
Andrey Semashev
5f6e9689a3 Silenced MSVC warnings about unused parameters. 2014-10-01 13:02:15 +04:00
Andrey Semashev
415db70547 Added support for extending operations to GCC atomic backend.
Fix for #10446. Some platforms (e.g. Raspberry Pi) only support atomic ops of some particular size but not less. Use extending arithmetic operations for these platforms. Also, make sure bools are always treated as 8-bit values, even if the actual type is larger. This makes its use in atomic<>, atomic_flag and lock pool more consistent.
2014-09-27 20:40:09 +04:00
Andrey Semashev
8f10f20359 Use #pragma intrinsic only with MSVC. 2014-08-30 14:09:18 +04:00
Andrey Semashev
7f17e3b9a3 Use #pragma intrinsic only with MSVC.
Intel compiler doesn't support and doesn't need #pragma intrinsic and
issues warnings about it. Based on changes to boost/detail/
2014-08-24 00:02:16 +04:00
Andrey Semashev
4dee330229 Added support for types with non-trivial default constructors. 2014-07-07 22:40:41 +04:00
Andrey Semashev
6a184d95dd Renamed union_cast.hpp -> casts.hpp. 2014-07-07 21:51:06 +04:00
Andrey Semashev
00c2879066 Made sure that memory_order_consume is equivalent to memory_order_acquire on SPARC. Also make signal fences more strict in case of memory_order_consume since the standard requires them to be equivalent to thread fences except for thread synchronization instructions. 2014-07-07 21:48:53 +04:00
Andrey Semashev
0c9b1b3806 Reworked platform selection, fixed Windows backend.
Platform selection now works in two stages. First compiler is tested for
the supported configuration. If that fails, OS is tested. Lastly, if
nothing succeeded, emulation backend is selected.
2014-06-09 01:07:26 +04:00
Andrey Semashev
b44027a58e Removed dependency on Boost.Detail. Fixed compilation issues with MSVC
7.1
2014-06-09 00:01:12 +04:00
Andrey Semashev
a6ec366f02 Fixed compilation failure due to a missing argument name. 2014-06-04 21:41:06 +04:00
Andrey Semashev
e0f038e92a Fixed compilation failure due to a missing argument name. 2014-06-04 21:38:15 +04:00
Andrey Semashev
21e497c01b Corrected hw memory barriers for memory_order_consume. 2014-06-03 23:57:52 +04:00
Andrey Semashev
d4ee3a3047 Added a comment about the reason why we promote memory_order_consume to memory_order_acquire. 2014-06-03 23:30:21 +04:00
Andrey Semashev
e55bf17309 Changed memory_order_consume handling on ARM and PPC.
memory_order_consume is promoted to memory_order_acquire on these
architectures as they have a weaker memory model than other
architectures. GCC seems to behave the same way. Added nonessential
checks to compiler barriers so that the behavior is closer to thread
fences.
2014-06-01 20:03:53 +04:00
Andrey Semashev
abdce8f1bb Changed memory_order constant values to simplify checks for success/failure orders in compare_exchange methods. Fixed incorrect cas_failure_order_must_not_be_stronger_than_success_order check. 2014-06-01 02:10:44 +04:00
Andrey Semashev
a721aa468f Removed scratch argument for asm block in non-PIC mode. 2014-05-26 23:21:01 +04:00
Andrey Semashev
97ae84a474 Adjusted the workaround for ldrexd/strexd instruction problem with register pairs allocation. The problem only affects ARMv6k in Thumb mode, so enable these instructions for ARM mode. Also don't base the decision on the compiler version, it is probably not relevant. Updated lockfree test to reflect the recent changes related to ARM backends. 2014-05-23 21:02:07 +04:00
Andrey Semashev
deddca6918 Fixed a few bugs with ARM assembler: incorrect comparisons in compare_exchange, compilation failures due to insufficient registers in some optimization modes. Eventually I had to disable 64-bit atomic ops for gcc 4.7 because it is not able to allocate register pairs suitable for ldrexd/strexd instructions. 2014-05-22 01:01:46 +04:00
Andrey Semashev
a71a23694b Fixed typo in a comment. 2014-05-21 00:36:43 +04:00