Fix endianness and alignment assumptions in `pnNpCommon` IO code by dgelessus · Pull Request #1850 · H-uru/Plasma

dgelessus · 2026-03-07T01:15:10Z

My goal was just to remove the StrCopy calls, but while I was here, I also fixed the endianness and alignment issues in this code. Perhaps it will help @dpogue on PowerPC :)

This adds some helper functions for converting little-endian UTF-16 directly from/to ST::string, which could also be useful for other code that reads/writes UTF-16 strings manually, like file server manifests and the GameMgr.

dpogue

This is definitely nicer than the hacky string copy stuff I had to do in plNglFile

Hoikas · 2026-03-08T19:11:54Z

Sources/Plasma/CoreLib/hsEndian.h


 #include "HeadSpin.h"

+#include <string_theory/string>


Could we use forward declarations instead?

Probably. I was too lazy to type out the template declaration needed for ST::char16_buffer :D

Changing the return type of hsSTStringToUTF16LE as suggested above would also resolve this ;)

Sources/Plasma/NucleusLib/pnNetProtocol/pnNpCommon.cpp

zrax · 2026-03-09T15:47:40Z

Sources/Plasma/CoreLib/hsEndian.cpp

+    // Can't pre-allocate anything, because we don't know the string length yet...
+    std::vector<char16_t> utf16Buffer;


You can, however, reserve bufferSize/sizeof(char16_t) to flatten the number of actual memory allocations to 1.

hmm, I don't think that's a good idea in the general case. For something like the file server manifests, where we have potentially hundreds of zero-terminated strings concatenated, we don't want to reserve the size of the entire manifest for each string.

Ah, good point, I wasn't considering the case where the buffer is significantly larger than the string to extract from it...

Would something like 128 work as an initial size that's big enough to handle most of the strings we'd expect over the network without incurring reallocations but also not incurring a big chunk of memory?

I rewrote the code so it counts the string length beforehand. This allows reusing hsSTStringFromUTF16LE, which allocates a buffer with the right size.

zrax · 2026-03-09T15:49:56Z

Sources/Plasma/CoreLib/hsEndian.cpp

+    return ST::string::from_utf16(utf16Buffer.data(), utf16Buffer.size());
+}
+
+ST::utf16_buffer hsSTStringToUTF16LE(const ST::string& string)


Swapping the endianness of the buffer implies that this is no longer a legal utf16_buffer... It may make more sense logically to return as a vector<uint16_t>

I made it a std::vector<uint8_t> now, so we don't have any incorrect endianness values at all. This actually also works better for the calling code.

zrax · 2026-03-09T15:50:50Z

Sources/Plasma/CoreLib/hsEndian.h


 #include "HeadSpin.h"

+#include <string_theory/string>


Changing the return type of hsSTStringToUTF16LE as suggested above would also resolve this ;)

Sources/Plasma/NucleusLib/pnNetProtocol/pnNpCommon.cpp

This removes one dependency on pnUtils/StrCopy.

This makes the most sense for a caller that wants to skip past the string that was just read.

This way, it can also share the core code with hsSTStringFromUTF16LE.

dpogue approved these changes Mar 8, 2026

View reviewed changes

Hoikas reviewed Mar 8, 2026

View reviewed changes

Hoikas requested a review from zrax March 8, 2026 19:18

zrax reviewed Mar 9, 2026

View reviewed changes

dgelessus force-pushed the pnNpCommon_endianness branch from cd6cbe1 to 2c39b17 Compare March 12, 2026 00:50

dgelessus added 11 commits March 22, 2026 01:25

Port NetGameRank to ST::string

18f7763

This removes one dependency on pnUtils/StrCopy.

Remove CopyFrom methods that now do the same as simple assignment

03fa7e4

Add hsEndian functions for converting between ST::string and UTF-16 LE

6e5e415

Fix endianness and alignment assumptions in pnNpCommon IO code

8968242

Use vector instead of primitive array for score ranks callback

4d1432c

Verify string terminator value in IRead<ST::string>

0470f40

Include terminator in hsSTStringFromTerminatedUTF16LE consumedSize

7752520

This makes the most sense for a caller that wants to skip past the string that was just read.

Add tests for hsEndian UTF-16 functions

8ce6a03

Fix edge cases with consumedSize for truncated buffers

0188212

Avoid buffer reallocations in hsSTStringFromTerminatedUTF16LE

301c15a

This way, it can also share the core code with hsSTStringFromUTF16LE.

Rename endianSwap tests to hsEndian for consistency with header

2bec282

dgelessus force-pushed the pnNpCommon_endianness branch from 65326ec to 2bec282 Compare March 22, 2026 00:48

		// Can't pre-allocate anything, because we don't know the string length yet...
		std::vector<char16_t> utf16Buffer;

Conversation

dgelessus commented Mar 7, 2026

Uh oh!

dpogue left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants