Improve DateTime.ParseExact perf for invariant culture by stephentoub · Pull Request #82877 · dotnet/runtime

stephentoub · 2023-03-02T04:56:55Z

Speed up the handling of ddd, dddd, MMM, and MMMM parts of a date time format string when using the invariant culture, which is very commonly used in parsing. Today, when one of these is encountered, the relevant array of comparison strings is retrieved from the DateTimeFormatInfo, and each is compared as a prefix against the current position in the input, using a linguistic ignore-case comparison. But for the invariant culture, we don't need to consult any arrays, and can do the comparison much more quickly. These parts dominate the processing of a format like that for RFC1123.

Method	Toolchain	Mean	Error	StdDev	Ratio
ParseExact	\main\corerun.exe	997.4 ns	19.97 ns	17.70 ns	1.00
ParseExact	\pr\corerun.exe	359.9 ns	2.99 ns	2.65 ns	0.36

const string Format = "ddd, dd MMM yyyy HH':'mm':'ss 'GMT'";

private string _s = new DateTime(1955, 11, 5, 6, 0, 0, DateTimeKind.Utc).ToString(Format, CultureInfo.InvariantCulture);

[Benchmark]
public void ParseExact() => DateTimeOffset.ParseExact(_s, Format, CultureInfo.InvariantCulture, DateTimeStyles.AllowInnerWhite | DateTimeStyles.AssumeUniversal);

ghost · 2023-03-02T04:57:08Z

Tagging subscribers to this area: @dotnet/area-system-globalization
See info in area-owners.md if you want to be subscribed.

Issue Details

Speed up the handling of ddd, dddd, MMM, and MMMM parts of a date time format string when using the invariant culture, which is very commonly used in parsing. Today, when one of these is encountered, the relevant array of comparison strings is retrieved from the DateTimeFormatInfo, and each is compared as a prefix against the current position in the input, using a linguistic ignore-case comparison. But for the invariant culture, we don't need to consult any arrays, and can do the comparison much more quickly. These parts dominate the processing of a format like that for RFC1123.

Method	Toolchain	Mean	Error	StdDev	Ratio
ParseExact	\main\corerun.exe	997.4 ns	19.97 ns	17.70 ns	1.00
ParseExact	\pr\corerun.exe	359.9 ns	2.99 ns	2.65 ns	0.36

const string Format = "ddd, dd MMM yyyy HH':'mm':'ss 'GMT'";

private string _s = new DateTime(1955, 11, 5, 6, 0, 0, DateTimeKind.Utc).ToString(Format, CultureInfo.InvariantCulture);

[Benchmark]
public void ParseExact() => DateTimeOffset.ParseExact(_s, Format, CultureInfo.InvariantCulture, DateTimeStyles.AllowInnerWhite | DateTimeStyles.AssumeUniversal);

Author:	stephentoub
Assignees:	-
Labels:	`area-System.Globalization`, `tenet-performance`
Milestone:	8.0.0

Speed up the handling of ddd, dddd, MMM, and MMMM parts of a date time format string when using the invariant culture, which is very commonly used in parsing. Today, when one of these is encountered, the relevant array of comparison strings is retrieved from the DateTimeFormatInfo, and each is compared as a prefix against the current position in the input, using a linguistic ignore-case comparison. But for the invariant culture, we don't need to consult any arrays, and can do the comparison much more quickly. These parts dominate the processing of a format like that for RFC1123.

src/libraries/System.Private.CoreLib/src/System/Globalization/DateTimeParse.cs

danmoseley · 2023-03-02T20:48:41Z

src/libraries/System.Private.CoreLib/src/System/Globalization/DateTimeParse.cs

+                }
+                else
+                {
+                    // Scan the month names (note that some calendars has 13 months) and find


Suggested change

// Scan the month names (note that some calendars has 13 months) and find

// Scan the month names (note that some calendars have 13 months) and find

Thanks. These are both pre-existing. If I have to restart CI for some reason, I'll take the comment fixes.

danmoseley · 2023-03-02T20:49:03Z

src/libraries/System.Private.CoreLib/src/System/Globalization/DateTimeParse.cs

+                {
+                    // Scan the month names (note that some calendars has 13 months) and find
+                    // the matching month name which has the max string length.
+                    // We need to do this because some cultures (e.g. "cs-CZ") which have


Suggested change

// We need to do this because some cultures (e.g. "cs-CZ") which have

// We need to do this because some cultures (e.g. "cs-CZ") have

stephentoub · 2023-03-03T02:46:26Z

Failures are known

stephentoub added area-System.Globalization tenet-performance Performance related issue labels Mar 2, 2023

stephentoub added this to the 8.0.0 milestone Mar 2, 2023

ghost assigned stephentoub Mar 2, 2023

stephentoub force-pushed the dtinvariant branch from 8d3f1c8 to 73e1fe2 Compare March 2, 2023 04:59

runfoapp bot mentioned this pull request Mar 2, 2023

Test failure: System.Security.Cryptography.X509Certificates.Tests.CertificateCreation.CertificateRequestChainTests/CreateChain_Hybrid #25979

Closed

tarekgh reviewed Mar 2, 2023

View reviewed changes

src/libraries/System.Private.CoreLib/src/System/Globalization/DateTimeParse.cs Outdated Show resolved Hide resolved

tarekgh reviewed Mar 2, 2023

View reviewed changes

src/libraries/System.Private.CoreLib/src/System/Globalization/DateTimeParse.cs Outdated Show resolved Hide resolved

tarekgh approved these changes Mar 2, 2023

View reviewed changes

Merge branch 'main' into dtinvariant

02377c5

build-analysis bot mentioned this pull request Mar 2, 2023

DataContractSerializerTests.DCS_MyPersonSurrogate_Stress failing in CI #35066

Open

Address PR feedback

5012178

stephentoub force-pushed the dtinvariant branch from 546ed73 to 5012178 Compare March 2, 2023 20:05

danmoseley reviewed Mar 2, 2023

View reviewed changes

stephentoub merged commit eb6b81c into dotnet:main Mar 3, 2023

stephentoub deleted the dtinvariant branch March 3, 2023 02:46

This was referenced Mar 3, 2023

Alpine System.Net.Security.Tests failing because of "Cannot load library libgssapi_krb5.so.2" #82945

Closed

System.Net.Quic.Tests.QuicStreamTests.WriteCanceled_NextWriteThrows test failure #76831

Closed

ghost locked as resolved and limited conversation to collaborators Apr 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve DateTime.ParseExact perf for invariant culture#82877

Improve DateTime.ParseExact perf for invariant culture#82877
stephentoub merged 3 commits intodotnet:mainfrom
stephentoub:dtinvariant

stephentoub commented Mar 2, 2023

Uh oh!

ghost commented Mar 2, 2023

Uh oh!

Uh oh!

Uh oh!

danmoseley Mar 2, 2023

Uh oh!

stephentoub Mar 2, 2023

Uh oh!

danmoseley Mar 2, 2023

Uh oh!

stephentoub commented Mar 3, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	// Scan the month names (note that some calendars has 13 months) and find
	// Scan the month names (note that some calendars have 13 months) and find

	// We need to do this because some cultures (e.g. "cs-CZ") which have
	// We need to do this because some cultures (e.g. "cs-CZ") have

Conversation

stephentoub commented Mar 2, 2023

Uh oh!

ghost commented Mar 2, 2023

Uh oh!

Uh oh!

Uh oh!

danmoseley Mar 2, 2023

Choose a reason for hiding this comment

Uh oh!

stephentoub Mar 2, 2023

Choose a reason for hiding this comment

Uh oh!

danmoseley Mar 2, 2023

Choose a reason for hiding this comment

Uh oh!

stephentoub commented Mar 3, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants