This is a follow up to https://github.com/dotnet/runtime/pull/39506, https://github.com/dotnet/runtime/pull/39507 and https://github.com/dotnet/runtime/pull/39050 to investigate Tamar's suggestions for improving ARM64 perf in ASCIIUtility and Utf16Utility. The suggestions are https://github.com/dotnet/runtime/pull/39506#discussion_r469516647, https://github.com/dotnet/runtime/pull/39507#discussion_r468013622 and https://github.com/dotnet/runtime/pull/39507#discussion_r468097953 cc @carlossanlop @jeffhandley @kunalspathak @echesakovMSFT @tannergooding