Lexer accidentally(?) does not use is_ascii_whitespace for literal whitespace in string continuations

https://github.com/rust-lang/rust/pull/108403 proposed to fix this, but it was claimed that the current behavior was [documented in the reference](https://doc.rust-lang.org/reference/tokens.html#string-literals) in [this comment](https://github.com/rust-lang/rust/pull/108403#issuecomment-1459144787). Incorrectly, as far as I can see, as that page only describes *whitespace escapes* as being \r, \t, and \n and the fix was about literal whitespace in string continuations. Now https://doc.rust-lang.org/reference/expressions/literal-expr.html#string-continuation-escapes does describe this behavior, but this [was added later in Jan 2024](https://github.com/rust-lang/reference/pull/1452/files#diff-3a341cecbd9e3b210860ddef44cc4d4f0ed0e8a7a4a20b0242578fc767277573R83). Indeed, [this PR](https://github.com/rust-lang/reference/pull/1042) shows the reference documented skipping all whitespace, until  Jun 13, 2022.

Current behavior has this [ui test](https://github.com/rust-lang/rust/blob/master/tests/ui/str/str-escape.rs). It seems like this behavior was once implemented like it is now, then got claimed to be canon then got documented as canon. Anyway, I'm not sure why not all unicode whitespace is skipped, but just almost all ascii whitespace, but it seems important to pick an existing whitespace set, instead of using an old bad manual implementation of is_ascii_whitespace...

Perhaps we can see a crater run at least...




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Lexer accidentally(?) does not use is_ascii_whitespace for literal whitespace in string continuations #136600

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Lexer accidentally(?) does not use is_ascii_whitespace for literal whitespace in string continuations #136600

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions