Why is there a "not greedy" comment here and what does that mean? #1831

historydev · 2025-05-17T21:04:04Z

https://doc.rust-lang.org/1.87.0/reference/tokens.html#raw-byte-string-literals

This seems to mean that not all of the x00-x7f range is allowed, the "non-greedy" comment refers to an invalid character in the pair, namely the carriage return (CR) - x0D.

It only confused, it's already clear that the carriage return can't be used, since the ASCII_FOR_RAW description has "except IsolatedCR".

I was told the following:
"This is a standard concept for regular expressions.
Greedy matching takes the maximum possible number of characters of the string to match the mask, non-greedy - the minimum possible.
For example, for the string axxxbxxxb greedy /a.*b/ will capture the entire string, and non-greedy /a.*?b/ only up to the first b."

The text was updated successfully, but these errors were encountered:

ehuss · 2025-05-17T21:40:33Z

It's non-greedy in a sense that if you have something like:

let a = br#"example a"#;
let b = br#"example b"#;

while it is matching the ASCII_FOR_RAW bytes in example a, it knows to stop before the "# in the first line. If it was greedy (the default for the * repetition), it would continue gobbling up characters until the end of example b, and thus ASCII_FOR_RAW would match example a"#;\nlet b = br#"example b.

https://en.wikipedia.org/wiki/Regular_expression#Lazy_matching contains a little more of a description.

historydev · 2025-05-17T21:50:59Z

It's non-greedy in a sense that if you have something like:

let a = br#"example a"#;
let b = br#"example b"#;
while it is matching the ASCII_FOR_RAW bytes in example a, it knows to stop before the "# in the first line. If it was greedy (the default for the * repetition), it would continue gobbling up characters until the end of example b, and thus ASCII_FOR_RAW would match example a"#;\nlet b = br#"example b.

https://en.wikipedia.org/wiki/Regular_expression#Lazy_matching contains a little more of a description.

Thx u very much!

Why isn't this described in the table? Is this such obvious information?

It doesn't even specify whether regular expression symbols are used, I didn't even think about it:
https://doc.rust-lang.org/1.87.0/reference/notation.html#string-table-productions

mattheww · 2025-05-18T11:36:12Z

To take your questions in reverse:

No, this isn't so obvious it doesn't need documenting.

The fact that the formalism being used isn't documented is a bug in the Reference (there's isn't an open issue explicitly about this, but I suppose it comes under #567).

The reason it isn't documented is that the Reference has evolved gradually from a Rust "manual" which described the lexical structure only in English.

In 2017 a contributor was kind enough to submit a form of the current "Lexer" blocks, and the editors at the time thought that was valuable enough to include without an explicit desciption of how they need to be interpreted.

(As I understand it he was using Antlr4 in its "lexer grammar" mode.)

historydev · 2025-05-18T14:50:09Z

To take your questions in reverse:

No, this isn't so obvious it doesn't need documenting.

The fact that the formalism being used isn't documented is a bug in the Reference (there's isn't an open issue explicitly about this, but I suppose it comes under #567).

The reason it isn't documented is that the Reference has evolved gradually from a Rust "manual" which described the lexical structure only in English.

In 2017 a contributor was kind enough to submit a form of the current "Lexer" blocks, and the editors at the time thought that was valuable enough to include without an explicit desciption of how they need to be interpreted.

(As I understand it he was using Antlr4 in its "lexer grammar" mode.)

Thx u very much!

ehuss added Language Cleanup Improvements to existing language which is correct but not clear, or missing examples, or the like. A-grammar Area: Syntax and parsing labels May 17, 2025

historydev closed this as completed May 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why is there a "not greedy" comment here and what does that mean? #1831

Why is there a "not greedy" comment here and what does that mean? #1831

historydev commented May 17, 2025

ehuss commented May 17, 2025

historydev commented May 17, 2025

mattheww commented May 18, 2025

historydev commented May 18, 2025

Why is there a "not greedy" comment here and what does that mean? #1831

Why is there a "not greedy" comment here and what does that mean? #1831

Comments

historydev commented May 17, 2025

ehuss commented May 17, 2025

historydev commented May 17, 2025

mattheww commented May 18, 2025

historydev commented May 18, 2025