`DelimTokenTree`-based parsing #3791

powerboat9 · 2025-05-10T03:51:39Z

For a future project, it would be nice to convert the parser to run on DelimTokenTree, rather than a stream of tokens. It would make parsing simpler and help catch edge cases like #3790.

Implementation wise, we'd probably want to do a first pass through the list of tokens, verifying that there aren't any mismatched delimiters. During that pass we could also cache distances between delimiter token pairs, sorted by left delimiter location. For example, [ ( { a } b ) c d ] could produce the distance cache [8, 4, 1]. Then we could have something like a ManagedTokenSource, say, TreeSource, that gives out either tokens or delimited token trees (via new instances of TreeSource). It would only have to store an iterator into the list of tokens, an ending iterator into the list of tokens, and an iterator into the distance cache.

Producing an initial TreeSource would be simple. Additionally, spinning off TreeSource instances for nested delimited pairs would be cheap, and the distance cache would make advancing the original TreeSource instance past the nested delimited pair cheap as well.

The text was updated successfully, but these errors were encountered:

powerboat9 · 2025-05-10T04:01:34Z

It looks like this would require storing a file's worth of lexed tokens in a vector, instead of single-passing them, but that doesn't seem like too much of a drawback.

powerboat9 · 2025-05-10T04:07:10Z

It could also allow us to move away from having Parser as a template, by providing some indirection away from Lex and ProcMacroInvocLexer.

powerboat9 · 2025-05-10T04:09:00Z

Not going to work on it right now though, since I'm more focused on getting nr2.0 through

powerboat9 self-assigned this May 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`DelimTokenTree`-based parsing #3791

`DelimTokenTree`-based parsing #3791

powerboat9 commented May 10, 2025

powerboat9 commented May 10, 2025

powerboat9 commented May 10, 2025 •

edited

Loading

powerboat9 commented May 10, 2025 •

edited

Loading

DelimTokenTree-based parsing #3791

DelimTokenTree-based parsing #3791

Comments

powerboat9 commented May 10, 2025

powerboat9 commented May 10, 2025

powerboat9 commented May 10, 2025 • edited Loading

powerboat9 commented May 10, 2025 • edited Loading

`DelimTokenTree`-based parsing #3791

`DelimTokenTree`-based parsing #3791

powerboat9 commented May 10, 2025 •

edited

Loading

powerboat9 commented May 10, 2025 •

edited

Loading