Issue with multibyte chars in source_text() computation

It treats `lo` as a byte index, while it is actually a character index:

https://github.com/dtolnay/proc-macro2/blob/7f5533d6cc9ff783d174aec4e3be2caa202b62ca/src/fallback.rs#L367

I expect this test to pass, but it does not:

```rust
#[cfg(span_locations)]
#[test]
fn source_text() {
    let input = "    𓀕 c    ";
    let mut tokens = input
        .parse::<proc_macro2::TokenStream>()
        .unwrap()
        .into_iter();

    let ident1 = tokens.next().unwrap();
    assert_eq!("𓀕", ident1.span().source_text().unwrap());

    let ident2 = tokens.next().unwrap();
    assert_eq!("𓀕", ident2.span().source_text().unwrap());
}
```

Panics with (as character `𓀕` occupies byte 5 and 6)
```
---- source_text stdout ----
thread 'source_text' panicked at 'byte index 6 is not a char boundary; it is inside '𓀕' (bytes 4..8) of `    𓀕 c   `', src/fallback.rs:367:25
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Issue with multibyte chars in source_text() computation #410

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

Issue with multibyte chars in source_text() computation #410

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions