2
0
mirror of https://github.com/boostorg/atomic.git synced 2026-02-02 08:22:08 +00:00
Commit Graph

164 Commits

Author SHA1 Message Date
Andrey Semashev
bbd32145ef Use BOOST_MAY_ALIAS from Boost.Config. 2017-07-29 16:54:52 +03:00
Andrey Semashev
6a5928759a Added extra ops for PPC. Added 8 and 16-bit ops for PPC8+.
The new implementation of 8 and 16-bit ops uses the lbarx/stbcx and
lharx/sthcx instructions available in Power8 and later architectures.
This allows to use smaller storage types, similar to those used by
compiler intrinsics.

Also added detection of 128-bit instructions lqarx/stqcx, which can
later be used to implement 128-bit ops.
2017-07-17 00:39:20 +03:00
Andrey Semashev
c4f757d6bb Added extra ops implementation for ARM on gcc. 2017-07-16 18:15:18 +03:00
Andrey Semashev
a41cdf093e Extracted common hardware capabilities macros to separate headers.
Also refactored ARM hwcaps detection a bit and fixed compilation for
generic ARMv7, which does not provide 64-bit ldrexd/strexd.
2017-07-14 22:43:19 +03:00
Andrey Semashev
b2c6d37a1a Added byte/word-wide implementations of atomic ops on ARM.
Use ldrexb/w and strexb/w on ARMv7 and later to implement byte/word-wide
atomic ops. On the older ARM versions we still have to use 32-bit
widening implementation.

Also allowed immediate constants in some of the operations to improve
generated code.

Common ARM code extracted to a separate header to reuse with extra ops.
2017-07-14 19:25:01 +03:00
Andrey Semashev
ffb529c4c3 Silence MSVC warnings. 2017-07-11 23:00:52 +03:00
Andrey Semashev
4f38bb4f85 Added opaque variants of negate and complement ops for MSVC. 2017-07-11 23:00:52 +03:00
Andrey Semashev
538b411c4a Added MSVC backend for extra operations. 2017-07-11 23:00:52 +03:00
Andrey Semashev
d375cf0dff Modified 8 and 16-bit CAS loops to employ register renaming. 2017-07-11 23:00:52 +03:00
Andrey Semashev
e30c54754b Replaced fixed temporary register in CAS loops with a temporary variable
This allows for more flexibility in register allocation and potentially
more efficient code. Also, the temporary register was not exactly
customizable in the previous code, so it should have been cleaned up
anyway.
2017-07-11 23:00:52 +03:00
Andrey Semashev
537656d67d Reworked extra operations definition.
In order to support more flexible definition of the extra operations for
different platforms, define extra_operations as an addon to the existing
operations template. The extra_operations template will be used only by
the non-standard operations added by Boost.Atomic.
2017-07-11 23:00:52 +03:00
Andrey Semashev
6d7c0ec2ee Added generic implementation of extended ops. Reorganized platform macros. 2017-07-11 23:00:52 +03:00
Andrey Semashev
17ebc37de8 Fixed incorrect asm constraints. Also optimized 64-bit asm to allow 32-bit immediates. 2017-07-11 23:00:52 +03:00
Andrey Semashev
ce42659b6d Fixed compilation issues and incorrect use of cmpxchg. Added extended ops to the interface. 2017-07-11 23:00:52 +03:00
Andrey Semashev
42024329d7 Corrected constant constraints for bit test ops. 2017-07-11 23:00:52 +03:00
Andrey Semashev
5c554ee284 Preliminary version of extended atomic ops for x86. 2017-07-11 23:00:52 +03:00
Andrey Semashev
a6b01a8429 Disable cast-based implementation for MSVC because it generates broken code sometimes. 2017-07-11 23:00:02 +03:00
Andrey Semashev
b23c406f94 Converted Unicode characters in comments to ASCII. 2017-06-25 16:22:44 +03:00
Andrey Semashev
4ee227c6f2 Simplified use of __has_attribute. Also use __may_alias__ instead of may_alias. 2017-05-29 12:02:04 +03:00
Andrey Semashev
0786398120 Silence bogus gcc warnings about missing struct member initializers. 2017-05-29 11:49:27 +03:00
Andrey Semashev
be5ed8a1c5 Fixed compilation with gcc <= 4.9. Renamed BOOST_ATOMIC_DETAIL_STORAGE_MAY_ALIAS to BOOST_ATOMIC_DETAIL_STORAGE_TYPE_MAY_ALIAS as it is a more precise naming. 2017-05-27 18:11:06 +03:00
Andrey Semashev
7919698b2a Changed is_lock_free() implementation to always use is_always_lock_free.
This is to follow C++17, which says is_lock_free(), for any object of a given
atomic<> type, must return values consistent with is_always_lock_free.
2017-05-25 13:38:39 +03:00
Andrey Semashev
b7dba02b73 In addition to compiler checks for may_alias support, use __has_attribute as well. 2017-05-25 12:54:34 +03:00
Andrey Semashev
7c6917948d Use BOOST_INTEL_CXX_VERSION instead of __INTEL_COMPILER to use workarounds for compiler bugs present in Boost.Config. 2017-05-21 21:44:20 +03:00
Andrey Semashev
11967cc8e2 Mark storage type capable of aliasing other types and simplify CAS when possible
This is an attempt to improve generated code in the calling application that
involves CAS in a tight loop. The neccessity to cast between the value type and
the storage type for the `expected` argument results in inefficient code
that involves copying of the expected value and also saving the CAS result on
the stack. This has been observed at least with gcc 6.3 with a tight loop
on the user's side.

When we can ensure that the storage type can safely alias other types, and the
value type has the same size as the storage type, we can simplify CAS by
performing type punning on the `expected` reference instead of copying it back
and forth.
2017-05-21 17:50:16 +03:00
Andrey Semashev
cf3c4a2d5a Fixed a typo. 2017-04-04 15:54:53 +03:00
Andrey Semashev
f1dc715e41 Added __ARM_ARCH_8A__ to the preprocessor check for ARM. 2017-04-04 15:53:18 +03:00
Andrey Semashev
6d40529a58 Switch back to Boost.TypeTraits for integral traits because libstdc++ doesn't consider __int128 an integral type. 2017-04-03 15:11:30 +03:00
Andrey Semashev
11c785768c Updated to reflect changes from P0558R1 accepted into C++17.
1. Expose value_type and difference_type (where present) to user's code.

2. Prohibit arithmetic operations on pointers to non-object types. In
   particular, arithmetic operations such as fetch_add/fetch_sub will no longer
   compile for pointers to cv void, pointers to functions and pointers to
   non-static class members.

Also, use C++11 <type_traits> when possible instead of Boost.TypeTraits to
reduce dependencies. Cleaned up value_arg_type internal type usage for more
efficient argument passing.
2017-04-01 18:29:26 +03:00
Andrey Semashev
a67cc1b055 Corrected register usage in x86 DCAS asm blocks.
In some of the asm blocks eax was modified as a result of cmpxchg8b but that
was not reflected in the register constraints. This could cause incorrect code
being generated.
2017-01-08 18:09:12 +03:00
Andrey Semashev
2b6ef06b57 Added preliminary support for MSVC for ARM64 target. 2017-01-07 21:27:09 +03:00
Andrey Semashev
e6e6a04114 Changed taking address of the values involved in bitwise_cast to avoid gcc warnings. 2017-01-06 16:35:57 +03:00
Andrey Semashev
07be21c7e8 Silenced some MSVC warnings. 2016-11-06 21:29:33 +03:00
Andrey Semashev
836381dcd1 Added workarounds for Oracle Studio 12.5 on x86. 2016-10-09 19:15:42 +03:00
Andrey Semashev
336a349b9f Added is_always_lock_free static constant from C++17. 2016-10-09 17:53:55 +03:00
Andrey Semashev
e450653451 Extracted x86 mfence detection to caps headers.
Also, for non-gcc compilers which do not allow to auto-detect mfence availability (e.g. Oracle Studio) the instruction is assumed to be supported (since SSE2 is supported by virtually every x86 CPU now). This can be changed by defining BOOST_ATOMIC_NO_MFENCE.
2016-09-15 01:05:51 +03:00
Andrey Semashev
dce87d3c0a Changes for better compatibility with Oracle Studio 12.5.
1. Although the compiler is supposed to support __atomic and __sync intrinsics like gcc, it does not define any macros allowing to detect that. In particular, it is not possible to deduce what sizes of atomic operands are supported in hardware except to check the arch macros.
2. The compiler does not define any macros that allow to deduce the target x86 CPU, which makes it impossible to know whether the CPU supports cmpxchg8b/cmpxchg16b/mfence.

Because of that this commit changes handling of this compiler the following way:

1. On SPARC the compiler will use the gcc_sparc backend. Also, enabled the backend for the SPARCv8+ architecture as it appears to be almost a 32-bit equivalent to SPARCv9 and does support cas/casx instructions.
2. On x86 the compiler will use the gcc_x86 backend. By default cmpxchg8b/cmpxchg16b are assumed to be supported unless BOOST_ATOMIC_NO_CMPXCHG8B/BOOST_ATOMIC_NO_CMPXCHG16B is defined. The mfence instruction requires SSE2 and although ubiquitous these days, it will still be detected as not supported for now.
2016-09-15 00:46:26 +03:00
Andrey Semashev
4274a78f46 Corrected typos that prevented compilation. 2016-09-14 20:36:12 +03:00
Jeremy W. Murphy
0abd9bfd73 Fix some unused parameter warnings. 2015-12-15 20:56:58 +11:00
Andrey Semashev
430169a53c Removed union_cast in favor of memcpy-based implementation and renamed the cast to bitwise_cast. Use __builtin_mem* instead of regular function calls when possible. This saves inclusion of <cstring> and possibly generates better code when -fno-builtin is used. The net result of these changes is no UB that union_cast was introducing while hopefully retaining the same performance. 2015-12-02 20:04:10 +03:00
Andrey Semashev
c989d936c5 Renamed casts.hpp to bitwise_cast.hpp. 2015-12-02 18:36:49 +03:00
Andrey Semashev
8bbf5e19db Corrected compiler check for BOOST_ATOMIC_DETAIL_NO_ASM_IMPLIED_ZERO_DISPLACEMENTS. 2015-11-08 17:24:30 +03:00
Andrey Semashev
6e2d7b28de Restored old versions of DCAS operations that use an extra register for compatibility with old binutils, which apparently are used on OS X with gcc 4.2. 2015-11-04 15:25:12 +03:00
Andrey Semashev
ec4738a8eb Workaround for MSVC-12 ICE. 2015-08-26 23:27:13 +03:00
Andrey Semashev
e3aac4bc65 Simplified memory references in assembler blocks which should reduce the typical number of used registers by one. 2015-08-23 03:27:37 +03:00
Andrey Semashev
4462c1cdf4 Fixed compilation with gcc 4.4. 2015-08-23 02:05:35 +03:00
Andrey Semashev
18fdcb1747 Fixed compilation errors with MSVC caused by using types with large alignment values as function arguments. Alignment is now explicitly specified for all storage types to avoid surprises. Optimized CAS-based exchange implementation on x86. Relaxed initial loads in CAS-based operations as this load is not required to be accurate. 2015-08-23 01:04:08 +03:00
Tim Blechmann
f0f1ef228c atomic: fix compilation on msvc 8 / Windows CE / arm 2015-08-22 10:17:02 +02:00
Andrey Semashev
ff91811c1a Fixes #10994. Fixed compilation with gcc 4.4 when cmpxchg16b is used. 2015-08-17 23:27:37 +03:00
Andrey Semashev
a8661d4cc2 Corrected memory barriers in store, load and thread_fence. Added indentation to the generated assembler code. 2015-08-15 00:29:43 +03:00