Fix Number.ParseNumber to not assume '\0' at the end of a span #17808

stephentoub · 2018-04-27T05:57:42Z

This routine was written for parsing strings, which are implicitly null-terminated, and it doesn't factor in string length but instead uses tricks to exit loops when the next character is null. Now that the routine is also used for spans, this is very problematic, as spans need not be null terminated, and generally aren't when they represent slices, and expecting a null termination like this can result in walking off the end of valid memory.

I would like to see all of this code rewritten to use span. In the interim, though, as a short-term fix I've changed all dereferences of the current position to compare against the length of the span (or, rather, a pointer to the end), and pretend that a null terminator was found if we've hit the end.

Contributes to https://github.com/dotnet/corefx/issues/29343
cc: @jkotas, @danmosemsft

This routine was written for parsing strings, which are implicitly null-terminated, and it doesn't factor in string length but instead uses tricks to exit loops when the next character is null. Now that the routine is also used for spans, this is very problematic, as spans need not be null terminated, and generally aren't when they represent slices, and expecting a null termination like this can result in walking off the end of valid memory. I would like to see all of this code rewritten to use span. In the interim, though, as a short-term fix I've changed all dereferences of the current position to compare against the length of the span (or, rather, a pointer to the end), and pretend that a null terminator was found if we've hit the end.

danmoseley · 2018-04-27T14:22:51Z

src/mscorlib/shared/System/Number.Parsing.cs

+            while (true)
            {
+                char cp = p < pEnd ? *p : '\0';
+                if (cp != *str && (*str != '\u00a0' || cp != '\u0020'))


This is a case where I think !(*str == '\u00a0' && cp == '\u0020') would be much clearer.

danmoseley

Ugh...

danmoseley · 2018-04-27T14:31:18Z

In general, it is hard to prove the code in this file is safe by reading it. For example NumberBuffer is assumed to be null terminated above. And it seems it always is, but this is from several frames up, where the ReadOnlySpan<char> always originates from a string before it is parsed.

stephentoub · 2018-04-27T15:52:07Z

In general, it is hard to prove the code in this file is safe by reading it.

Yes, hence my comment about wanting to see all of this rewritten. But that's too big a change to make right now.

…t#17808) * Fix Number.ParseNumber to not assume '\0' at the end of a span This routine was written for parsing strings, which are implicitly null-terminated, and it doesn't factor in string length but instead uses tricks to exit loops when the next character is null. Now that the routine is also used for spans, this is very problematic, as spans need not be null terminated, and generally aren't when they represent slices, and expecting a null termination like this can result in walking off the end of valid memory. I would like to see all of this code rewritten to use span. In the interim, though, as a short-term fix I've changed all dereferences of the current position to compare against the length of the span (or, rather, a pointer to the end), and pretend that a null terminator was found if we've hit the end. * Address PR feedback

… (#17820) * Fix Number.ParseNumber to not assume '\0' at the end of a span This routine was written for parsing strings, which are implicitly null-terminated, and it doesn't factor in string length but instead uses tricks to exit loops when the next character is null. Now that the routine is also used for spans, this is very problematic, as spans need not be null terminated, and generally aren't when they represent slices, and expecting a null termination like this can result in walking off the end of valid memory. I would like to see all of this code rewritten to use span. In the interim, though, as a short-term fix I've changed all dereferences of the current position to compare against the length of the span (or, rather, a pointer to the end), and pretend that a null terminator was found if we've hit the end. * Address PR feedback

danmoseley · 2018-05-29T00:26:27Z

I would like to see all of this code rewritten to use span.

@stephentoub do we need an issue to track this? I can't find one.

stephentoub · 2018-05-29T00:39:34Z

I don't believe we have one.

dotnet-bot added the 2 - In Progress label Apr 27, 2018

stephentoub mentioned this pull request Apr 27, 2018

Add more Span-based parsing tests for Int32 and friends dotnet/corefx#29363

Merged

jkotas approved these changes Apr 27, 2018

View reviewed changes

danmoseley reviewed Apr 27, 2018

View reviewed changes

danmoseley approved these changes Apr 27, 2018

View reviewed changes

Address PR feedback

d15e866

stephentoub merged commit d0a55af into dotnet:master Apr 27, 2018

stephentoub deleted the numberparsing branch April 27, 2018 22:37

stephentoub mentioned this pull request Apr 27, 2018

[release/2.1] Fix Number.ParseNumber to not assume '\0' at the end of a span (#17808) #17820

Merged

stephentoub mentioned this pull request Feb 8, 2019

Fix BigInteger parsing of substring span dotnet/corefx#35185

Merged

danmoseley mentioned this pull request Jan 31, 2020

Remove unsafe code from number parsing dotnet/runtime#10397

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix Number.ParseNumber to not assume '\0' at the end of a span #17808

Fix Number.ParseNumber to not assume '\0' at the end of a span #17808

Uh oh!

stephentoub commented Apr 27, 2018

Uh oh!

danmoseley Apr 27, 2018

Uh oh!

stephentoub Apr 27, 2018

Uh oh!

danmoseley left a comment

Uh oh!

danmoseley commented Apr 27, 2018

Uh oh!

stephentoub commented Apr 27, 2018

Uh oh!

danmoseley commented May 29, 2018

Uh oh!

stephentoub commented May 29, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Fix Number.ParseNumber to not assume '\0' at the end of a span #17808

Fix Number.ParseNumber to not assume '\0' at the end of a span #17808

Uh oh!

Conversation

stephentoub commented Apr 27, 2018

Uh oh!

danmoseley Apr 27, 2018

Choose a reason for hiding this comment

Uh oh!

stephentoub Apr 27, 2018

Choose a reason for hiding this comment

Uh oh!

danmoseley left a comment

Choose a reason for hiding this comment

Uh oh!

danmoseley commented Apr 27, 2018

Uh oh!

stephentoub commented Apr 27, 2018

Uh oh!

danmoseley commented May 29, 2018

Uh oh!

stephentoub commented May 29, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants