Simplify `CAMLalign` and use C11 `max_align_t` #13139

MisterDA · 2024-04-30T20:13:10Z

Some cleanups removing checks and workarounds for older compilers, assuming that the compiler supports C11 or C++11 out of the box. We may use _Alignas (since C11) or alignas (since C23) directly, and use the max_align_t type. Unfortunately, support for max_align_t is missing from the Windows C standard library.

In our public headers, we're using either: - C23 where `alignas` is a keyword; - C++11 or later where `alignas` is also available; - C11/C17 where `_Alignas` is available.

Unfortunately, MSVC C11 suppport is incomplete and doesn't define it. - https://developercommunity.visualstudio.com/t/max_align_t-is-not-provided-by-stddefh/10299797 - https://developercommunity.visualstudio.com/t/stdc11-should-add-max-align-t-to-stddefh/1386891

For C++, MSVC defines `using max_align_t = double`. Take inspiration from GCC and LLVM for the fallback implementation. It's unlikely that we need to carry a fallback implementation for other compilers. If so, the following could be used: #if defined(__STDC_VERSION__) && __STDC_VERSION__ >= 202311L || \ defined(__cplusplus) #define CAMLalignof(n) alignof(n) #else #define CAMLalignof(n) _Alignof(n) #endif typedef struct { long long ll CAMLalign(CAMLalignof(long long)); long double ld CAMLalign(CAMLalignof(long double)); } max_align_t; https://gcc.gnu.org/git/?p=gcc.git;a=blob;f=gcc/ginclude/stddef.h https://github.com/llvm/llvm-project/blob/main/clang/lib/Headers/__stddef_max_align_t.h https://en.cppreference.com/w/c/types/max_align_t

xavierleroy

Looks good to me. +1 for using _Alignas and alignas in preference to attributes. One suggestion below concerning the MSVC fallback.

xavierleroy · 2024-05-31T13:17:31Z

runtime/memory.c


 struct pool_block {
 #ifdef DEBUG
 intnat magic;
 #endif
 struct pool_block *next;
 struct pool_block *prev;
- union max_align data[];  /* not allocated, used for alignment purposes */
+ max_align_t data[]; /* not allocated, used for alignment purposes */
 };


One downside with this use of double as fallback max_align_t type is that double has rather low alignment: 8 on x86_64, 4 on x86_32, while most SSE vector instructions require 16-alignment. (In Linux x86_64 and macOS x86_64, max_align_t has 16 alignment.) What about using an explicit 16 alignment as the fallback case?

struct pool_block { #ifdef DEBUG intnat magic; #endif struct pool_block *next; struct pool_block *prev; #ifdef HAVE_MAX_ALIGN_T max_align_t data[]; /* not allocated, used for alignment purposes */ #else CAMLalign(16) char data[]; /* 16 is a reasonable alignment default */ #endif };

I'd tend to align (ha, ha) with your suggestion, but: MSVC C++ cstddef.h header uses double alignment for max_align_t, and clang-cl uses double too, which I think makes it a reasonable default for Windows.
As for a general fallback, I hope that other compilers+libc aren't as buggy, and I wonder if we do need to provide a definition, or rather catch a missing definition as a compilation failure.

MSVC C++ cstddef.h header uses double alignment for max_align_t

It is in error, then. The program below, compiled with MSVC, prints 16, showing that there are types with alignment greater than 8.

#include <iostream> #include <xmmintrin.h> int main() { std::cout << alignof(__m128) << '\n'; return 0; }

In this PR, you're not trying to emulate whatever strange choices MSVC does, but to make sure OCaml's pool allocator works as intended. If the intent is to align for the maximal alignment constraint of the target platform, the alignment must be >= 16 on x86, because SSE instructions. If the intent is to align for the biggest datatype OCaml stores in heap blocks, word-alignment is enough and you don't need to add anything to struct pool_block to guarantee it, as it already contains two word-sized pointers.

What about defaulting to long double rather than double then?

It is in error, then. The program below, compiled with MSVC, prints 16, showing that there are types with alignment greater than 8.

At least for C++, I don't think there's an error.

The type max_align_t is a POD type whose alignment requirement is at least as great as that of every scalar type, and whose alignment requirement is supported in every context. [C++11]

#include <iostream> #include <xmmintrin.h> #include <cstddef> #include <type_traits> int main() { std::cout << alignof(__m128) << std::endl << alignof(std::max_align_t) << std::endl << std::is_scalar<__m128>() << std::endl; return 0; }

shows 16, 8, 0, indicating that __m128 isn't considered a scalar type, and thus the definition of std::max_align_t is consistent (with respect to the __m128 type).

The definition for C is more vague and I guess it could be argued that 16 would be a correct value.

max_align_t which is an object type whose alignment is as great as is supported by the implementation in all contexts; [C11]

What about defaulting to long double rather than double then?

long double is identical to double under MSVC ¹.

Thanks for the thorough review. I think the real question is indeed whether the intent is to align for the maximal alignment constraint of the target platform, or to align for the biggest datatype OCaml stores in heap blocks. None of the fields in the former union max_align had a 16 bytes alignment, and all worked well, hasn't it? Now it's not clear to me that this field is actually needed.

Footnotes

https://learn.microsoft.com/en-us/cpp/c-language/type-long-double?view=msvc-170 ↩

dustanddreams approved these changes May 2, 2024

View reviewed changes

MisterDA force-pushed the alignas-simplify-CAMLalign branch from 0d98742 to a49b31b Compare May 14, 2024 08:20

gasche assigned xavierleroy May 15, 2024

MisterDA added 3 commits May 21, 2024 08:47

Simplify CAMLalign

c50e4cc

In our public headers, we're using either: - C23 where `alignas` is a keyword; - C++11 or later where `alignas` is also available; - C11/C17 where `_Alignas` is available.

MisterDA force-pushed the alignas-simplify-CAMLalign branch from a49b31b to d226d67 Compare May 21, 2024 06:48

xavierleroy approved these changes May 31, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify `CAMLalign` and use C11 `max_align_t` #13139

Simplify `CAMLalign` and use C11 `max_align_t` #13139

MisterDA commented Apr 30, 2024 •

edited

xavierleroy left a comment

xavierleroy May 31, 2024 •

edited

MisterDA Jun 1, 2024

xavierleroy Jun 1, 2024

dustanddreams Jun 3, 2024

MisterDA Jun 3, 2024

Simplify CAMLalign and use C11 max_align_t #13139

Are you sure you want to change the base?

Simplify CAMLalign and use C11 max_align_t #13139

Conversation

MisterDA commented Apr 30, 2024 • edited

xavierleroy left a comment

Choose a reason for hiding this comment

xavierleroy May 31, 2024 • edited

Choose a reason for hiding this comment

MisterDA Jun 1, 2024

Choose a reason for hiding this comment

xavierleroy Jun 1, 2024

Choose a reason for hiding this comment

dustanddreams Jun 3, 2024

Choose a reason for hiding this comment

MisterDA Jun 3, 2024

Choose a reason for hiding this comment

Footnotes

Simplify `CAMLalign` and use C11 `max_align_t` #13139

Simplify `CAMLalign` and use C11 `max_align_t` #13139

MisterDA commented Apr 30, 2024 •

edited

xavierleroy May 31, 2024 •

edited