Make it possible to align at 32 byte boundaries by MichalStrehovsky · Pull Request #6675 · dotnet/corert

MichalStrehovsky · 2018-12-11T16:02:59Z

This will be necessary to support computing layout for types that embed Vector256.

This is foundational work needed to support the new HW intrinsics. We may or may not end up implementing function multiversioning for ready to run for .NET Core 3.0 CPAOT, but being able to compute the layout will let us at least pregenerate method bodies that pass through vector types without actually calling the intrinsics.

See individual commits. I'm flexible on whether we should include 47bacbd.

tannergooding · 2018-12-11T16:25:31Z

Is this actually fixing the alignment or just fixing up the struct packing for these fields?

tannergooding · 2018-12-11T16:27:55Z

src/Common/src/TypeSystem/Common/TargetDetails.cs

+                if (Abi == TargetAbi.ProjectN)
+                {
+                    // ProjectN doesn't support hardware intrinsics
+                    return 8;


Why special-case this for ProjectN? The layout of the struct should be consistent (IMO), even if you don't support codegen for any of the types. This allows the Vector64<T>, Vector128<T>, and Vector256<T> types to still be used as the corresponding ABI type for interchange/interop scenarios.

Bumping this value up has negative effects on universal shared code (RyuJIT doesn't concern itself with it, but it's a thing on Project N).

This constant is used in cases like this:

class Foo<T> { T myField; }

When T is __UniversalCanon (could be a ref type, could be a struct, we have no clue): with maximum supported alignment of 8, we can still assign an offset to myField because the field is already aligned at the maximum possible alignment (the previous type ends at offset 8), no matter what T ends up being at runtime.

Once this is bumped to 32, T could be something that requires alignment of 16 or 32 and we can no longer give it an offset (so accessing the field from generated code becomes: go call a helper to find offset of myField). This is a net perf loss. No point in paying it when the backend is UTC.

Will ProjectN never support hardware intrinsics? Because it will be a breaking change to modify this in the future if users have a T that is a Vector128<T> otherwise (the layout of the struct will change, which will impact unsafe code, offsets, etc).

This constant is used in cases like this:

class Foo<T> { T myField; }

What happens in CoreCLR in cases like these today?

For reference types, Vector64/128/256 alignment requirements should be irrelevant since the GC will allocate the whole type with an arbitrary offsets anyway.

For struct types, where does this alignment matter? Is it just that the offsets are aligned; or does the JIT actually allocate these types on aligned addresses on the stack?

What happens in CoreCLR in cases like these today?

CoreCLR appropriately packs T according to the packing for T. We have tests covering this here: https://github.com/dotnet/coreclr/blob/master/tests/src/Interop/StructPacking/StructPacking.cs#L27

For struct types, where does this alignment matter? Is it just that the offsets are aligned; or does the JIT actually allocate these types on aligned addresses on the stack?

The alignment technically matters for all cases. I believe that, for Vector128<T> we generally work (since stack alignment defaults to 16), but I don't think we have anything special that guarantees this for stack allocations today (we probably should for both this and RVA statics, since those should be possible to make "pay to play").

MichalStrehovsky · 2018-12-11T18:05:15Z

Is this actually fixing the alignment or just fixing up the struct packing for these fields?

This just bumps the number for maximum supported alignment from 8 to a higher number.

tannergooding · 2018-12-11T18:13:16Z

This just bumps the number for maximum supported alignment from 8 to a higher number.

Sorry, I'm still not certain what this means. I'm trying to determine if this just matches the CoreCLR behavior (structures will be packed appropriately and fields will be at the correct/expected offsets; but structures may not be allocated at the "correct" address), or if it also fixes the alignment of these structs when allocated (for a Vector128<T>: (address % 16) == 0)

MichalStrehovsky · 2018-12-11T18:21:18Z

src/Common/src/TypeSystem/Common/TargetDetails.cs

        /// Gets the maximum alignment to which something can be aligned
        /// </summary>
-        public static int MaximumAlignment
+        public int MaximumAlignment


Looking at this now, maybe MaximumAlignment should actually be moved to the type system context if we decide to go the instance field route.

MaximumAlignment is an optimization for universal shared generic code. The moment it becomes too big like what we are doing here it becomes useless. We can just drop it completely.

MichalStrehovsky · 2018-12-11T18:23:02Z

Sorry, I'm still not certain what this means. I'm trying to determine if this just matches the CoreCLR behavior (structures will be packed appropriately and fields will be at the correct/expected offsets; but structures may not be allocated at the "correct" address), or if it also fixes the alignment of these structs when allocated

This doesn't do anything for Vector64/128/256 - it's not getting special cased anywhere. This pull request just opens the opportunity to have something with alignment higher than 8 in the system.

tannergooding · 2018-12-11T18:27:07Z

This pull request just opens the opportunity to have something with alignment higher than 8 in the system.

Right. So my question is: When we do actually add types that have higher alignments (such as special-casing the HWIntrinsic types) will it actually impact the allocation alignment or just the struct packing?

MichalStrehovsky · 2018-12-11T18:34:24Z

So my question is: When we do actually add types that have higher alignments (such as special-casing the HWIntrinsic types) will it actually impact the allocation alignment or just the struct packing?

CPAOT compiler (that this change is mostly for) generates ready to run code to run on top of CoreCLR. Do you expect we would need to do something for allocation alignment here? My expectation is that the ready to run helper generated by the runtime will deal with that. Struct packing for vectors will be dealt with after we settle on a design for this max alignment extensibility point.

(If and when we do something for hardware intrinsic on the CoreRT runtime and the flavor of the compiler that targets that runtime is up in the air. It will quite likely have to be one of my weekend projects.)

tannergooding · 2018-12-11T18:46:23Z

The ABI requirements for __m64, __m128, and __m256 specify that these types have specific packing and alignment.

On CoreCLR, we currently respect the packing (which ensures layout is correct) but do not respect the alignment requirement (due to limitations with the GC, and the fact that adding the support is not currently thought to be easily possible in a "pay to play" manner).

I think for the "ready to run" scenario, just ensuring that the layout is correct would be sufficient. I think we also need to ensure the layout is correct for ProjectN (otherwise you end up in a scenario where ProjectN vs everything else has a different struct layout).

I also think that we should be special-casing the layout of Vector64<T>, Vector128<T>, and Vector256<T> now (rather than later) as that should be much simpler and will avoid any breaking changes in the future.

jkotas · 2018-12-12T05:13:32Z

Will ProjectN never support hardware intrinsics?

We may want to have issue opened for this so that it is not forgotten once/if these Vector types ship in ProjectN.

davidwrighton

This change looks good. The discussion around ProjectN will need to be done as part of .NET Native, and not as part of the CoreRT repo especially as we'll need to update the underlying compiler to be aware of this sort of change as well.

This will allow us to do layout of types that have `Vector256` in them. I'm making this conditional and not enabled in Project N for now, since the Project N code generator doesn't support the hardware intrinsics anyway and increasing this value means that we have fewer places where we know field placement when universal shared generics are involved.

xUnit sniffs into properties otherwise and we assert/throw in some.

MichalStrehovsky · 2019-01-11T11:21:39Z

I've rebased this against latest to get rid of a merge conflict in a R2R file.

I'm keeping the no merge flag because I would like to do an integration to nmirror first. I'm expecting some ToF failure fun from all the changes that accumulated over time and would like to limit the delta.

MichalStrehovsky requested a review from davidwrighton December 11, 2018 16:03

tannergooding reviewed Dec 11, 2018

View reviewed changes

MichalStrehovsky commented Dec 11, 2018

View reviewed changes

jkotas approved these changes Dec 12, 2018

View reviewed changes

tannergooding mentioned this pull request Dec 13, 2018

Ensure that the Vector64<T>, Vector128<T>, and Vector256<T> types are properly handled with regards to struct packing/layout #6685

Closed

jkotas added the * NO MERGE * label Dec 17, 2018

MichalStrehovsky mentioned this pull request Dec 20, 2018

Enable System.Runtime.Intrinsics intrinsics #6173

Open

4 tasks

davidwrighton approved these changes Jan 10, 2019

View reviewed changes

MichalStrehovsky added 3 commits January 11, 2019 11:49

Add ToString override for TargetDetails

54ec7b4

xUnit sniffs into properties otherwise and we assert/throw in some.

Make MaximumAlignment an instance property

376dc3a

MichalStrehovsky force-pushed the allow32bytesAlignment branch from 47bacbd to 376dc3a Compare January 11, 2019 11:19

MichalStrehovsky merged commit 3c31103 into dotnet:master Jan 15, 2019

MichalStrehovsky deleted the allow32bytesAlignment branch January 15, 2019 13:44

Conversation

MichalStrehovsky commented Dec 11, 2018

Uh oh!

tannergooding commented Dec 11, 2018

Uh oh!

tannergooding Dec 11, 2018

Choose a reason for hiding this comment

Uh oh!

MichalStrehovsky Dec 11, 2018

Choose a reason for hiding this comment

Uh oh!

tannergooding Dec 11, 2018

Choose a reason for hiding this comment

Uh oh!

jkotas Dec 11, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tannergooding Dec 11, 2018

Choose a reason for hiding this comment

Uh oh!

tannergooding Dec 11, 2018

Choose a reason for hiding this comment

Uh oh!

MichalStrehovsky commented Dec 11, 2018

Uh oh!

tannergooding commented Dec 11, 2018

Uh oh!

MichalStrehovsky Dec 11, 2018

Choose a reason for hiding this comment

Uh oh!

jkotas Dec 12, 2018

Choose a reason for hiding this comment

Uh oh!

MichalStrehovsky commented Dec 11, 2018

Uh oh!

tannergooding commented Dec 11, 2018

Uh oh!

MichalStrehovsky commented Dec 11, 2018

Uh oh!

tannergooding commented Dec 11, 2018

Uh oh!

jkotas commented Dec 12, 2018

Uh oh!

davidwrighton left a comment

Choose a reason for hiding this comment

Uh oh!

MichalStrehovsky commented Jan 11, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jkotas Dec 11, 2018 •

edited

Loading