Avoid unaligned access to Shape_Type->width #982

th-otto · 2024-03-24T09:08:17Z

No description provided.

xezon · 2024-03-24T09:13:16Z

common/getshape.cpp

@@ -242,7 +241,9 @@ int Get_Shape_Width(void const* shape)
 {
 Shape_Type* shp = (Shape_Type*)shape;

- return le16toh(shp->Width);
+ uint16_t width;
+ memcpy(&width, &shp->Width, sizeof(width));


Perhaps create a template function ReadUnaligned<uint16_t> to help communicate the intent.

giulianobelinassi · 2024-05-20T23:48:44Z

is this really necessary? DSi port has the same issue (unaligned access) but I did not need to patch this function in order to get things to work. What exactly does this fix?

xezon · 2024-05-21T05:20:10Z

Unaligned access is undefined behaviour and it will crash on some hardware.

th-otto · 2024-05-21T06:29:16Z

You might be right, Shape_Type is declared as packed(1), and the compiler should already access the Width member with byte accesses on architectures that need this. OTOH, it should not hurt, as the memcpy of 2 bytes is optimized anyway.

Simple test:
both

uint16_t test(Shape_Type *t)
{
	return t->Width;
}

and

uint16_t test(Shape_Type *t)
{
	uint16_t w;
	
	memcpy(&w, &t->Width, sizeof(w));
	return w;
}

produce the same code on x86.

giulianobelinassi · 2024-05-21T11:50:04Z

You might be right, Shape_Type is declared as packed(1), and the compiler should already access the Width member with byte accesses on architectures that need this. OTOH, it should not hurt, as the memcpy of 2 bytes is optimized anyway.

Simple test: both
uint16_t test(Shape_Type *t)
{
	return t->Width;
}
and
uint16_t test(Shape_Type *t)
{
	uint16_t w;
	
	memcpy(&w, &t->Width, sizeof(w));
	return w;
}
produce the same code on x86.

I Will check this on ARM9 when I have the opportunity. Sorry but the compiler always remove those memcpys on x86_64, so this arch is not ideal for this kind of test

xezon · 2024-05-21T11:56:05Z

On ARM this may error as highlighted by this article:

https://www.alfonsobeato.net/arm/how-to-access-safely-unaligned-data/

giulianobelinassi · 2024-05-26T17:36:52Z

This indeed inserts a memcpy call:
https://godbolt.org/z/77W3o5M3P

Although this won't error out since memcpy is guaranteed to read the bytes in the correct order if the pointers do not alias, it still keeps the memcpy call.

The solution presented by you using templates seems to be the best approach, tho. Perhaps we should refactor the code to use this approach instead.

Avoid unaligned access to Shape_Type->width

984febc

xezon reviewed Mar 24, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid unaligned access to Shape_Type->width #982

Avoid unaligned access to Shape_Type->width #982

th-otto commented Mar 24, 2024

xezon Mar 24, 2024

giulianobelinassi commented May 20, 2024

xezon commented May 21, 2024

th-otto commented May 21, 2024 •

edited

giulianobelinassi commented May 21, 2024

xezon commented May 21, 2024

giulianobelinassi commented May 26, 2024

Avoid unaligned access to Shape_Type->width #982

Are you sure you want to change the base?

Avoid unaligned access to Shape_Type->width #982

Conversation

th-otto commented Mar 24, 2024

xezon Mar 24, 2024

Choose a reason for hiding this comment

giulianobelinassi commented May 20, 2024

xezon commented May 21, 2024

th-otto commented May 21, 2024 • edited

giulianobelinassi commented May 21, 2024

xezon commented May 21, 2024

giulianobelinassi commented May 26, 2024

th-otto commented May 21, 2024 •

edited