In order to increase portability, I thought that eliminating inline assembler is the obvious way to go about it. In the root/include/libbb.h file, I replaced the following code (note Line 4):
PThreads are a fairly complex method of threading in C. What they offer, though, is the ability to introduce portability into what would otherwise require memory barriers.
After creating this test file, I attempted in multiple ways to attempt a way to integrate this code into the original file. Due to the complexity of this as well as the later byteswapping code, I decided to leave this as is and focus on the byteswapping code.
If anyone would like to use my work above, feel free to do so.